Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00471 |
Symbol | |
ID | 5730886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 45233 |
End bp | 47074 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641284389 |
Product | flavoprotein |
Protein accession | YP_001549932 |
Protein GI | 159902588 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.818961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAATA CAACAAGTCA TCCATTATCT CAAACAAAAA CTGACAAACA TGTTGTCAGT ATTCCTATAG AAGAAGATTT AATTTGTCTG AAATGTATGA GTCCAAGAAA ATTAAGATTT GAAATTGAAT ATGCATTAGA AAAGGGAACA ACAGCCAATG CTTTTCTATT TACTCAAATA AATGAAACCA ATTCATCTGC TGTTTTAGTT CATCCACCAG GAATTAATTT TGAGGAAGTT TTTATTGCAG AACTAATCAA CATTATTTCT CATAAAGATA CAAACTTATT AGTTGTCATT GGGCATATCA ACCCTAATAG GGTCGCCTTA TTAAAAAAAC TAGCGGAGAT ATTCAACCAT ATTGAGTTTG TGGCTTCTAA TCCTGCAGCA AAATTATTAA AAGATCTTTG GTATCAAGTT AAGCCTTCAC AACTCAGAAA TAATGAAGAG AGCAATACAA TCATTCCGCC TCTACCAAAT ATTAAGTTAA TAAAGCAAGA GCAGACCATA GCTCTTTTTA ATGAGTATGA AATGCAGCTC ATACCTAGTC CTACTGCTCG ATGGCCTGGT GGATTAATCA GTTTTGAGCG AAAATTAGGC CTACTAATGA GTGACAAGTT ATTTGGTGCT CATTTGTGCA ATGACCTCTG GGCAGAACCC AACAGAAGCA GTACCGAAGA AGAGCGTCGT CATTATTTTG ACTGTCTAAT GAGTCCTATG ATCAGCCAAG TAAGTTCAAT TATTGAAAAA CTTGAAGATC TGGATATTCA AACTATTGCG CCTGGACATG GGCCAGCCAT AGAAACCAGT TGGCGTAGTT TACTTAATGA CTATCAAAGA TGGGGCGAAG GTCAACAGAA GGCCTCCTTA AAAGTAGTTT TACTATTTGC AAGTGCCTAT GGAAACACAG CATCTATTGC TGATTCTCTT GCAAAAGGAA TTAATTCAAC TGGTGTCAAG GTAGACAGTT TAAATTGCGA GTTCACTCCT GCAAATGAAC TAGTTCAGGC CATAAAAGAA GCGGATGCCT ATCTTATAGG ATCACCTACT CTGGGTGGAC ATGCACCAAC TCCAATAGTT GCAGCACTGG GAACCTTACT TGCTGAAGGA GATAGGCAGA AACCAGTAGG AATTTTTGGT AGCTATGGCT GGAGCGGAGA AGCATTAGAT CTTCTAGAAA ATAAGCTCCG AGATGGAGGT TTTGAATTTG GTTTTAACCC AATAAAAATA AAATTTAGTC CTAATAATAA TATTATTAAA ACCCTTGAAG AAACAGGAAC CCAATTTGGG AGACAACTAT TAAAAGAACA ACGTCGCAAA AAACGTCGAC TAGGGGGAGG TATTAGTACA ACAAAAAGTG ATCCTGCATT ATTGGCTCTT GGGAAAGTAA TAGGTTCTTT ATGTATCTTA ACTGCCTTCA AAAATACTGA AGAAGAGAGC CTTTCAGGTG CAATGGTCGC CAGTTGGGTA AGTCAAGCAA GCTTCAATCC TCCAGGTATA ACAATTGCAG TGGCTAAAGA CAGAGCTGTC GAAACCCTAC TGCATAAAGA AGATTTATTT GCGTTGAACA TTCTTAATGA GGAAAACTAC CACAAACTTT TAAAACAATT TCTACAACCT TTTAAACCAG GTGCAGATCG ATTTAAAGGA ATACAGGTTG ACCAAAGTCC AGGGAAACAA CCAATACTTC CAGAAGCATT GGCCTGGCTT GAAGGCTCTG TCCAGCAAAG AATGGAGTGT GGTGATCATT GGCTCATCTA TGCACAGATT CACCATGGCA AAGTACTAAG TTCAGATGGA GTCACAGCAG TCCACCATCG AAACACAGGA GCAAATTATT AA
|
Protein sequence | MSNTTSHPLS QTKTDKHVVS IPIEEDLICL KCMSPRKLRF EIEYALEKGT TANAFLFTQI NETNSSAVLV HPPGINFEEV FIAELINIIS HKDTNLLVVI GHINPNRVAL LKKLAEIFNH IEFVASNPAA KLLKDLWYQV KPSQLRNNEE SNTIIPPLPN IKLIKQEQTI ALFNEYEMQL IPSPTARWPG GLISFERKLG LLMSDKLFGA HLCNDLWAEP NRSSTEEERR HYFDCLMSPM ISQVSSIIEK LEDLDIQTIA PGHGPAIETS WRSLLNDYQR WGEGQQKASL KVVLLFASAY GNTASIADSL AKGINSTGVK VDSLNCEFTP ANELVQAIKE ADAYLIGSPT LGGHAPTPIV AALGTLLAEG DRQKPVGIFG SYGWSGEALD LLENKLRDGG FEFGFNPIKI KFSPNNNIIK TLEETGTQFG RQLLKEQRRK KRRLGGGIST TKSDPALLAL GKVIGSLCIL TAFKNTEEES LSGAMVASWV SQASFNPPGI TIAVAKDRAV ETLLHKEDLF ALNILNEENY HKLLKQFLQP FKPGADRFKG IQVDQSPGKQ PILPEALAWL EGSVQQRMEC GDHWLIYAQI HHGKVLSSDG VTAVHHRNTG ANY
|
| |