Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_18524 |
Symbol | |
ID | 7204357 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 413705 |
End bp | 417174 |
Gene Length | 3470 bp |
Protein Length | 766 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186340 |
Protein GI | 219113513 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACAACATTT CGTTAAATTT GCGAGTAACA ATTCTCGGAA CCACCGAGGA AAATAGTCGC ACGAAGCCAA AATGGGCAAG GGCGACGGCA AAAAGGAGAA AAAGGGTTCC CCATCGGAAA CGATCGGTTG GGGCGCGACG CCCAAATTTG CCGTCTTGCA GTATCCGGGC GAAACGCAAC CCGTCGGAAC GTTGTCGACC GCTGCTTCCC ACGAAACCTC CAACGATGCC ACCACCAGCC TCACACCGGG ACGTCGAGTC CAAGCCGGTC GGGGTCCCAA AATGGCCAGT ACGGGCTTGG CAATTGGCCG AGACTCCTTC ACGGTCCGGG TCCGCGATCC GGCCTGGTTG ACCTCCCGTG CCGCCGTGTA CGACCGCATC CGCGCCTCGC GAGCGGCAGA ACTCGCGTTG AAAACGCCGA CACCCATCCG GGTCGTCATG CCGGATGGCA AGGTACTGGA GCAGGATCAG GAAGGCCAGA ACTTTAGGGC CTGGCAGACG ACGCCCTGGG ATGTAGCACG GGTGATTGCG CAGGGATTGG CCGACGCCGC CACGGTGGCT AGGGTGACCT ACGCGGACTT TGTAGCCGAC TACGATAAGG CTCAGGACGG TATGGAAGTG GAGGATACAC TGTCGGCGGC TATGGCCGAC GGGGGAGTCG AGTCCGACGC ACAGGAAAAT CAGTTGCTCT GGGATATGAC CAGACCCTTG GTAGGGAACG TGGCCAAACT GGAATTGCTC AAGTTTGAGG ATGATCAAGA CGCCAAAACG GTCTTCTGGC ATTCATCGGC TCATATGATG GGAGAGGCAT TGGAACACCT TTACGGATGC AAGTTGACCA TCGGACCGCC GTTGGCGGGA GGATTCTATT ACGATTCCTA CATGGGCAAG GATGCCTTTC GAGAAGAAGA TTGTACGTTG TGTGTTGCAG TGTCAATTGC TCCAACCACA GAGTAGACAC TCTTGCAGTA CTACCAAAAA GCGATTCTGT TTTCTGTCGT GCTGACCATT CTCTTGTCTC TTGTCTGGTA GACTCCCCCG TGGAAGGGGA AGTAGGCAAA ATTATCAAGC AAAAGCAAAA GTTCGAGCGC CTAGTCATTA CCAAGGAGGA AGGCTTGGAG TTGTTCGCCG ACAATCCCTT CAAGGTCAAT ATTCTTACTA CCAAGGTCCC CGACGGATCC CGCACCACCG TCTACAAATG TGGTGATCTA ATAGATCTGT GTCGGGGTCC ACACCTGTCC CATACCGGCA AGGTCAAGGC CTTTGCCGCG ACACGGCATT CGGCCACCAA TTGGCTGGGA GATACCAACA ACGATACCCT CCAGCGCATG TATGGAATTT CGTTCCCCGA CAAGAAAATG CTCAAGGTTT GGAAGGAAAA TCAAGAAAAG GTACGTGAGC TGCCTGCCTC AGTTCTGCGT TGGTGGAGAA ATTATAATAC GACAAGCAGT GAAGACAATA CACGAATCTC ATTCTCGGAG TAAACAATGT GACCGACAAC TTTCCAATCC GCTGATTCTT CTGAGCTTGT CACAATTAAC GTTGATGGGC AGACGCATTT TAATTGTACG CTAACTCGTA TTCTCTGTGT TTCTTGTCAG GCCAAAGAAC GCGATCATCG TCGTATCGCG GCCAAGCAGG ATCTCATAAT GTTCCATGAC TTATCTGCAG GGAGCGCCTT TTGGCTGCCG CACGGAGCCC GCATTTACAA CAAGCTCATT GATTTTATTA AATCACACTA CTGGAACCGC GGCTACGACG AAATCATCAC ACCCAACATT TACAATCTTG ATTTGTGGCA CAAGTCCGGC CACGCCCTCC ACTACAAGGA CGCTATGTTC TGTTTTGACG TGGAAGGTCA AGAATGGGCT ATGAAACCCA TGAATTGTCC TGGTCACTGT CTCATGTTCG CCAATTCAAT TCGTTCCTAC CGTGATTTGC CGCTGCGTTT TGCCGATTTC GGAGTGTTGC ATCGCAACGA ACTTTCCGGT GCTCTCACGG GCTTGACGCG CGTCCGACGC TTTCAGCAGG ATGATGCCCA TATTTACTGC CGCGAGGATC AAATTGAAAA AGAAGTCGTG GATGCGCTTA ATTTTATGAA GGATGTTTAC GATACATTCG GAATGACGTA CAAACTGGAA CTGTCCACAC GTCCTCAAAA GGCGCTCGGG GACGTTGCGC TTTGGGAGCG CGCGGAGGAA GCCCTGGCGA ACGCCATGGA TATGTTCGCT GGCAAGGGTG GCTGGAGAGT AAATCCGGGG GACGGCGCCT TTTATGGACC CAAAATCGAC ATCAAGGTTA TGGACGCCAT GGATCGTGTG CACCAGTGTG CTACTGTACA GCTGGATTTC CAACTACCCA TTCGATTTGA CCTCCAATAC ACCACGGCGA GCAAGGAAGA AGGCCAGCAG TTTGCTCGGC CAGTGATGAT CCATCGTGCC ATGCTTGGTA GTGTGGAGCG CATGTTTGCC GTCTTGTGCG AACACTATGG AGGAAAGTGG CCATTCTGGC TGAGTCCTCG ACAAGTGATG CTCGTGCCGG TTCATGCCGA ATTTTTCGAT TACAGCGAAG ATATTCGTGC GAAACTCCAT GCCGAAGGTT TCTATGTGGA TGTCGATACC TCGAAGAATA CATTTCAAAA GAAGGTTCGC AATGCACAGG TAGCGCAGTA TAACTTTCAG TTTGTCGTAG GCAAGGCCGA GGTCGCCAAC GGCTCGGTCA ACATCCGCAA TCGGGAAAAT CAGGTCGAGG GTGAGAAAAA GATTGATGAG ATGATCGCGA TGCTCAAGCA ATTGAGAGAG GAACACAAGT AGAAATTCGG TTTCGTTAAG TTCCGTGAAA CTAAGTATTT ATCAACATTC CCGTAACATC AGACACCGTC TTGTTCCTAG TTCTTCGTGC GCGCCCACAG CGCTTTGCTG CCGCTTCGAA CATGGCTCGG TTGTGCATGT CGGTCAATTC TACAGGTGGC GTGGCACCGC GCTTCATAAT GAGCGCAGCA CGAGGAAGAG CGAGTCTTTC AGTTCTCTTT TCGTCAAACT GCTTCGAGGC ATCCAGTACT GCCGTGCGCA TCTCGCGGTG TTTTCTGTAC AACGCACGGT TTATAACCAA GGGAGACATG TTCATATCTT TGATGACTTC TTCGTGCAAT CCGTCGACAT GTCCAGGGAG CATCACTGCC CGAGAAGAAT CTCGAAGATC CCCGGAAGCA TTTCTTTCAT TCTCCACCCA TAAAGGAGAG CCTGGCTCGT GTGAATGAGA ACGCTTCTTT TTATAGATCA AATGAGAGTC GGACTGTTGA TCTAGAAGTT CGAGATCGGA AGCACTGGCC CCGATGCCAT TCGTTTTCTC CTCTCGAACA GGTGTGAGAC CTTCCGTTGA TAGGCCACTC ACCAAATGCC CCGGGGCTGA AAGAAGATTT TTGTCTGTTG AATTTATTCC GACACTCACC TCCGCGTTGC GGAGGGCACT
|
Protein sequence | MGKGDGKKEK KGSPSETIGW GATPKFAVLQ YPGETQPVGT LSTAASHETS NDATTSLTPG RRVQAGRGPK MASTGLAIGR DSFTVRVRDP AWLTSRAAVY DRIRASRAAE LALKTPTPIR VVMPDGKVLE QDQEGQNFRA WQTTPWDVAR VIAQGLADAA TVARENQLLW DMTRPLVGNV AKLELLKFED DQDAKTVFWH SSAHMMGEAL EHLYGCKLTI GPPLAGGFYY DSYMGKDAFR EEDYSPVEGE VGKIIKQKQK FERLVITKEE GLELFADNPF KVNILTTKVP DGSRTTVYKC GDLIDLCRGP HLSHTGKVKA FAATRHSATN WLGDTNNDTL QRMYGISFPD KKMLKVWKEN QEKAKERDHR RIAAKQDLIM FHDLSAGSAF WLPHGARIYN KLIDFIKSHY WNRGYDEIIT PNIYNLDLWH KSGHALHYKD AMFCFDVEGQ EWAMKPMNCP GHCLMFANSI RSYRDLPLRF ADFGVLHRNE LSGALTGLTR VRRFQQDDAH IYCREDQIEK EVVDALNFMK DVYDTFGMTY KLELSTRPQK ALGDVALWER AEEALANAMD MFAGKGGWRV NPGDGAFYGP KIDIKVMDAM DRVHQCATVQ LDFQLPIRFD LQYTTASKEE GQQFARPVMI HRAMLGSVER MFAVLCEHYG GKWPFWLSPR QVMLVPVHAE FFDYSEDIRA KLHAEGFYVD VDTSKNTFQK KVRNAQVAQY NFQFVVGKAE VANGSVNIRN RENQVEGEKK IDEMIAMLKQ LREEHK
|
| |