Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37227 |
Symbol | |
ID | 7202180 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 450370 |
End bp | 455890 |
Gene Length | 5521 bp |
Protein Length | 1144 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181379 |
Protein GI | 219122075 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACCA AGTCTACTCC CAAAGACCTC ATTGACTCGT TTCCGCACAG CAAACTCACC CCGATTGCTA CCGCGACAAC CGAACCCGAT TACTTGTCGC TCCATCAGCT TCAGTATGAA ATCAACGACA ATGCGGAGAC CCTTTCCTCC ACTCTTGGAG ATGGCCAACA CGGTCACCTT TTTCTCGTAA TTTCCGAAAC CGAGTACCTC GAAATGACCG ACGGCATTCC ATGCATTCCT CCTGTGCAAC CGCCTTTCGA CCCAGTTCAC GCTGCCAACG CCACAGCCCC TCAAATTGTC GAAGCTAACC GCCAGAACGA CAAACGATAA AAGCTGTTTG ACCTCTATCA CAACGCCATC AAAGCGTTTC GCAATCAACT CCTTGAAGCC ATTCCCATCG AATACATCGA ATCTCTCGGT CATCCTACAC GAGGCTTTAA CAAAGTCTCT CCCCTCAAAA TCCTCTCTCA TCTTTGGGAA ACCTTTGGTA AAATTCAGGC TTCGGATCTC ATCGCCAACG ACGAACGCAT GAAAGCCGCC TGGCATCCAC CAACGCCTAT CCAGCAACTC TTCCAGCAGC TTGAAAAAGG CAATCAGTTT ATCATTGCGT CTGGCCAAGT CATGGACGAA CGTATTATCG CTCGCATCGG CTACCAGATC ATCGAAAAAA CCGGACTCTT TGATCTTGCT TCTCGCGACT GGCATTATAA AGATGAAGCC GATAAAACTT TGGCAAATTT CAAAAAACAT TTCCAGAAGG CCAACAAGGA TCTTGCCCTC ACCGCCACCA GCAGCTCTGC GGGTTACCAC ACCGCAAATC AGAGTACTGT CATCAAGGGC AAATCGTATT GCTGGACACA CGGCATCGTG CACAACACGA AGCACACCAG TGCAACATGT GAAAAACAGG CCCCGGGGCA CAAAACCGGC GCTACATTGC ACGACAAACA AGGCGGGTCG ACCAAGACCT ATCAATACAC GCCACCGGTG CCCAAATAGG AAAGAGGGAC GGCCAAACTG TTGAGTGTGC CGCTGAATAC TTATCATAAC AAAAATAAGT CTTCAGTTGC ACCAAACACT CCTCCGTTAG CTTCCTCCCC GCCATTTTTT CCTCCCGACG CCATTGCAGA CACTGGCTGT ACCGGACATT TTTTGAGCAC CAACATTGCT CACATACACT GCCAACCGAC AGTCCCCGGC ATCAACGTGG TCCTCCCTGA TGGTCGCACA ATTACTTCAA GTCACATCAC CGAACTCAAC ATTCCTTCGC TTCCTCCGGC AGCTCGTACC GCCCACATCT TTCCTGGTCT CTCGAATGGA TCCCTCATTT CCATCGGCCA ACTTTGTGAC CACGGCTGTA CCGCCACATT CACATCCGAC GCGGTCAGCA TTGAGCTCAA TAACACTGTC GTTCTTCGCG GCGGCCGTTC TCCTTGCACC CGATTGTGGA CCCTAGACTC CCCTGTAACG CCAAATCCTC CCGCCACTGA ATTGCATGCG CCTTTGCACG ACAAAAATTT TGCGAATCAC CTCGGAGACC ACTCAGGGAC CCTTGCCGAC CGCATTGCCT TTGTTCATGC ATCCTTATTC TCGCCACAAC TTTCGACATG GTGCAAGGCC ATTGACGAAG GCCGCCTCAC AACCTTTCCG GACATCACGT CTGCACAGGT AAAACGGCAC CCCCCACAGT CCGTCCCTAT GGTCAAGGGA CACCTTGACC AGCAACGGTC CAACTTGCGC TCAACCAAGC CCAAGGTCAC CCTGTGTGCC TCTGTTGATC CTGACGACAT TAATTTCGAC ACCAATCCCG TCGTACAAGA CCCTCCAGCC GCCAGGACGC AGTTTTTGTA CGCCGATTTC GCCGAAGTCA CCGGAAAAAT TTTTACCGAC CCCACCGGCC GTTTCGTTAC CACTTCAAGC TCCGGCAATG CTTACATGCT AGTGGTCTAT AATTACGACA GCAATTTTAT TCATGTCGAA GCCATGAAGA ACCGCACCGG TCCCGAGATC TTGAGCGCCT ACCAGCGTGC CCACGCCATG CTGTCCTCCA AAGGTCTGCG CCCCCAACTC CAACGCTTAG ACAACGAAGC CTCAACTGCA TTACAACAAT TCATGTCCTC TGTCAATATT GATTTTCAAT TAGCTCCGCC TCACGTGCAC CGTCGGAACG CCGCCGAACG GGCCATCCGC ACGTTCAAAA ACCACTTCAT TGCAGGTCTG TGCAGCACAG ACAAAAACTT TCCGCTTCAC CTTTGGGATC GCTTACTCCC ACAAGCCATC ATGACTCTCA ACCTTCTTCG AGGGTCTCGT ATCAACCCAA ATCTGTCGTC CTGGGCCCAA CTCCACGGCT CGTTCGACTA CAATCATACC CCTCTGGCTC CCCCGGGCAT CCGCGTGCTT GTACACGAAA AACCGTCAAT TCGCAGAACT TGGGCCCCCC ACGCAGCCGA CGGTTGGTAC GTTGGCCCCG CCATGAACCA TTACCGATGC TATCGCGTCT GGGTCAAGGA GACCACCAGC GAACGCATTT CGGACACTCT GACCTGGTTT CCCAGCCAAG TCAAAATGCC CAGCACCTCG TCTCGCGATA CAATTGTCGC CGCTGCTCAC GATCTTGCCC ATGCTCTGGC ACATCCCTCT CCTGCGTCGC CTTTATCACC TCTTTCGGTC AACGAACGCG AAGCCCTCTC GCAACTTTCA GATATTTTTT CCAAAGCCGC TAACCCAGTT GACTCGTCCC TCCCAGTTGC TCCCACGGCA ACCCTAAGTC CGCCAGCTGC ATCTACTTCT TCACCGCGTC AAGTCCGCTT CCGAAACCCG GTCACTGAAT CACTTCCAAG GGTGCCGACC GCCACAGCCG CCCCTCCGCA GTCACTTCCG AGGGTGCCTC CCCCGGACTC CGAGGCCGAG ACATACAAGC TTGTCACCTG CAACCCTCGC CAAGCACGTC GTAGGGCCGC TCGCAAACTG AAAGAAAAAA TTTCTGCTCC AACATCCGTT GTTCCTACCC AAGCAGCACC CGCACCCGTC GTACCATCTC CCAAGGTCCC AACACCTCCG CACAGTCACG GAACTCGCTT GCAAGCCGCT CTATACCCAG ACGCGTTCGA CAGCGCCAAC GCCGTCGTCG ACCCCAATTC TGGAGCCACT CTCGAGTATT CAAAACTCAA AAATTCCGAA CACGGCCCCG AATGGATTCA GGCCGCTGCC AACGAGATGG GCCGCCTATC CCAAGGCGTT AAACCCAACA TGCCCACCGG CACCGACACG ATGCATTTTA TTCCGCATAC CGCGAAGCCG CACGACCGCA AGGCCACTTA CCTGAAGATC GTAGCGGCTA TCAAGCCACA CAAGGCCGAA AAATACCGCA TCCGTTTCAC TGTCGGCGGC GACCGTATCG AGTACAACGG ACCCACAAGT ACCCCTACAG CTGCCTTACC AGCCATCAAG ATCCTCATCA ACAGCGTAAT TTCCACTGAA GGCGCACGCT TTATGACCTG CGACCTCAAG GATTTTTATT TGGGCACTCC TCTCCCCGTG TACGAGTACA TGCGCATCCC TGCAGTCCAT ATACCGGACT GCATCATGGA ACAGTACAAG CTTGCCCCGC TAGTTCACAA CGGCAACGTT CTAGTGGAAA TTCGAAAAGG AATGTACGGT CTCCCACACG CAGGCCGCAT TGCCAACAAC CGCCTCATCG ATCATTTAGC TCTCGACGGA TACCATCAGG CCCAGCATAC CCCAGGCTTC TTCACCCACG AAACGCGCCC TATTTCATTT TCACTAGTTG TTGACGATTT TGGTGTTAAA TACGTGGGCA AGGAACATGC CGAGCATTTA CTACACTGTC TCGAGAAGCT TTACACGGTA ACGACAGATT GGACCGGTGC CCTTTACTGC GGTCTCACCT TTACTTGGAA TTACAAACAG CGCCACGTTG ACATGGCCAT GCCTGGCTAC GTCGAAAAAG CTTTACAACG TTTCCAACAT CCGTCCCCAG CCCGACCTCA ACATTCTCCT CACGCGTGGG TTCCGCCATT GTACGGTGTA AAAATTCAGC TAACCAACGA AACTGATCTT TCTCCGCCGC TCGACAAGGC TGGCATCACT TGTCTTCAAG AAGTTATTGG TACCCTGTTA TATTATGCTC GCGCCGTGGA TTCAACCATG CTTGTTGCTC TCGGCACCCT CGCATCCGCT CAAACGAAAG GAACCGAAGC AACTGCCGAA GCCGTCACGC AACTACTAAA TTACAGCGCA ACGCACCCAG ACGCGACGGT ACGATACCAC GCCAGCGACA TGCACTTACA CGTTCACAGC GATGCTTCTT ATTTATCAGA ATCCAAAGCT CGCTCTCGCG CTGGTGGAAT TTTTTTTCTA AGTTCTGCAC CTACCAAGAA CCCCAAGCCA AATTCCAAGC CACCGCCATT AAACGGCGCC ATACACACGC ACTGTTCTAT CATGAAGTCC GTTCTTTCTT CTGCAACCGA AGCTGAACTG GGTGCACTAT TCTTTAACGC CAAGGACGGT GTGGAATTAC GTACTACCTT GGAAGCTATG TGACATCCCC AGTTAGCTAC TCCTATTCAA ACTGACAATG AATGTGCATC AGGAATTGTC AACAATACAG TGAAACAACG AAGATCTAAA GCCATAGACA TGCGTTTCTA TTGGATTAAA GACCGTGTCA AGCAAGGTCA ATTCAATGTT CATTGGAGAA AAGGAACTGA TAATCTTGCA GATTATTTCA CCAAGCATCA TTCGCCATCA CATCATCAAA TAATGCGATC TCGTTATTTG CTCGATCTCG ACCAAACTGC CTCCAACTCA AGTTTGAAAC GAGGGTGTGT TGATAATTCC ATTGGGCCTA CTAAATCTTT CCCATTCCGC TGTGACAAGA CTGATATTCT CCCAATCCGT TGTGACAAGA TTGATAGTAC CAAAGAACCT ATCACGGTTA CATACAGAGC TTTAACGTCC TACGAACGTT CTAGTCTTCC CGTTTACATT TCAAACAAGT CATCGCCATA TGAAACAGTC ATTGGCGATT CAGCTGCATC ATACAGACTC AATCACGATT CTCAAACTAG TCAGAAATCT TATTAGTTCA TCGATTTCCA TCATTCGCGT TCTTGCGTTT GTAGAGCTCA TCAATAGTAT GGACAGTTCT GGAAAAGGTG ATTGGTGTAG AAGCCCAGCT AGGTTGTCAA CTGGAATGAC AGTTTTGAGT ACCGGTTCGC GCGATGGCGC GCACTACTTG GGTCGTGGCA CCCAAGGCGA GCCTGTCGAA ACTAATCTTG TGTGTTTGGC CACAAATTCC GACACGGATG GTTTTGGACA GGCGCAATCC CAGCTGGGTA TTCAGAAACA TGACGTGACT GTTTCGGGAC TCGATCGTTG CCATGCTAAG GAGCATGGTA ACTATGATGA ATGTGCCCAC GGTGCCGTGT ATAACACCGT GGAGAATAGT GAAACGGACC ACAATGGCGC TTGGCAGATT GTGGAGCCAA ATGGAAAGTG A
|
Protein sequence | MTTKSTPKDL IDSFPHSKLT PIATATTEPD YLSLHQLQYE INDNAETLSS TLGDGQHGHL FLVISETEYL EMTDGIPCIP PVQPPFDPVH AANATAPQIL FDLYHNAIKA FRNQLLEAIP IEYIESLGHP TRGFNKVSPL KILSHLWETF GKIQASDLIA NDERMKAAWH PPTPIQQLFQ QLEKGNQFII ASGQVMDERI IARIGYQIIE KTGLFDLASR DWHYKDEADK TLANFKKHFQ KANKDLALTA TSSSAGYHTA NQSTVIKGKS YCWTHGIVHN TKHTSATCEK QAPGHKTGAT LHDKQGGSTK TYQYTPPSSV APNTPPLASS PPFFPPDAIA DTGCTGHFLS TNIAHIHCQP TVPGINVVLP DGRTITSSHI TELNIPSLPP AARTAHIFPG LSNGSLISIG QLCDHGCTAT FTSDAVSIEL NNTVVLRGGR SPCTRLWTLD SPVTPNPPAT ELHAPLHDKN FANHLGDHSG TLADRIAFVH ASLFSPQLST WCKAIDEGRL TTFPDITSAQ VKRHPPQSVP MVKGHLDQQR SNLRSTKPKV TLCASVDPDD INFDTNPVVQ DPPAARTQFL YADFAEVTGK IFTDPTGRFV TTSSSGNAYM LVVYNYDSNF IHVEAMKNRT GPEILSAYQR AHAMLSSKGL RPQLQRLDNE ASTALQQFMS SVNIDFQLAP PHVHRRNAAE RAIRTFKNHF IAGLCSTDKN FPLHLWDRLL PQAIMTLNLL RGSRINPNLS SWAQLHGSFD YNHTPLAPPG IRVLVHEKPS IRRTWAPHAA DGWYVGPAMN HYRCYRVWVK ETTSERISDT LTWFPSQVKM PSTSSRDTIV AAAHDLAHAL AHPSPASPLS PLSVNEREAL SQLSDIFSKA ANPVDSSLPV APTATLSPPA ASTSSPRQVR FRNPVTESLP RVPTATAAPP QSLPRVPPPD SEAETYKLVT CNPRQARPPA PVVPSPKVPT PPHSHGTRLQ AALYPDAFDS ANAVSLAIQL HHTDSITILK LVRNLISSSI SIIRVLAFVE LINSMDSSGK GDWCRSPARL STGMTVLSTG SRDGAHYLGR GTQGEPVETN LVCLATNSDT DGFGQAQSQL GIQKHDVTVS GLDRCHAKEH GNYDECAHGA VYNTVENSET DHNGAWQIVE PNGK
|
| |