Gene P9211_00471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00471 
Symbol 
ID5730886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp45233 
End bp47074 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content38% 
IMG OID641284389 
Productflavoprotein 
Protein accessionYP_001549932 
Protein GI159902588 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.818961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAATA CAACAAGTCA TCCATTATCT CAAACAAAAA CTGACAAACA TGTTGTCAGT 
ATTCCTATAG AAGAAGATTT AATTTGTCTG AAATGTATGA GTCCAAGAAA ATTAAGATTT
GAAATTGAAT ATGCATTAGA AAAGGGAACA ACAGCCAATG CTTTTCTATT TACTCAAATA
AATGAAACCA ATTCATCTGC TGTTTTAGTT CATCCACCAG GAATTAATTT TGAGGAAGTT
TTTATTGCAG AACTAATCAA CATTATTTCT CATAAAGATA CAAACTTATT AGTTGTCATT
GGGCATATCA ACCCTAATAG GGTCGCCTTA TTAAAAAAAC TAGCGGAGAT ATTCAACCAT
ATTGAGTTTG TGGCTTCTAA TCCTGCAGCA AAATTATTAA AAGATCTTTG GTATCAAGTT
AAGCCTTCAC AACTCAGAAA TAATGAAGAG AGCAATACAA TCATTCCGCC TCTACCAAAT
ATTAAGTTAA TAAAGCAAGA GCAGACCATA GCTCTTTTTA ATGAGTATGA AATGCAGCTC
ATACCTAGTC CTACTGCTCG ATGGCCTGGT GGATTAATCA GTTTTGAGCG AAAATTAGGC
CTACTAATGA GTGACAAGTT ATTTGGTGCT CATTTGTGCA ATGACCTCTG GGCAGAACCC
AACAGAAGCA GTACCGAAGA AGAGCGTCGT CATTATTTTG ACTGTCTAAT GAGTCCTATG
ATCAGCCAAG TAAGTTCAAT TATTGAAAAA CTTGAAGATC TGGATATTCA AACTATTGCG
CCTGGACATG GGCCAGCCAT AGAAACCAGT TGGCGTAGTT TACTTAATGA CTATCAAAGA
TGGGGCGAAG GTCAACAGAA GGCCTCCTTA AAAGTAGTTT TACTATTTGC AAGTGCCTAT
GGAAACACAG CATCTATTGC TGATTCTCTT GCAAAAGGAA TTAATTCAAC TGGTGTCAAG
GTAGACAGTT TAAATTGCGA GTTCACTCCT GCAAATGAAC TAGTTCAGGC CATAAAAGAA
GCGGATGCCT ATCTTATAGG ATCACCTACT CTGGGTGGAC ATGCACCAAC TCCAATAGTT
GCAGCACTGG GAACCTTACT TGCTGAAGGA GATAGGCAGA AACCAGTAGG AATTTTTGGT
AGCTATGGCT GGAGCGGAGA AGCATTAGAT CTTCTAGAAA ATAAGCTCCG AGATGGAGGT
TTTGAATTTG GTTTTAACCC AATAAAAATA AAATTTAGTC CTAATAATAA TATTATTAAA
ACCCTTGAAG AAACAGGAAC CCAATTTGGG AGACAACTAT TAAAAGAACA ACGTCGCAAA
AAACGTCGAC TAGGGGGAGG TATTAGTACA ACAAAAAGTG ATCCTGCATT ATTGGCTCTT
GGGAAAGTAA TAGGTTCTTT ATGTATCTTA ACTGCCTTCA AAAATACTGA AGAAGAGAGC
CTTTCAGGTG CAATGGTCGC CAGTTGGGTA AGTCAAGCAA GCTTCAATCC TCCAGGTATA
ACAATTGCAG TGGCTAAAGA CAGAGCTGTC GAAACCCTAC TGCATAAAGA AGATTTATTT
GCGTTGAACA TTCTTAATGA GGAAAACTAC CACAAACTTT TAAAACAATT TCTACAACCT
TTTAAACCAG GTGCAGATCG ATTTAAAGGA ATACAGGTTG ACCAAAGTCC AGGGAAACAA
CCAATACTTC CAGAAGCATT GGCCTGGCTT GAAGGCTCTG TCCAGCAAAG AATGGAGTGT
GGTGATCATT GGCTCATCTA TGCACAGATT CACCATGGCA AAGTACTAAG TTCAGATGGA
GTCACAGCAG TCCACCATCG AAACACAGGA GCAAATTATT AA
 
Protein sequence
MSNTTSHPLS QTKTDKHVVS IPIEEDLICL KCMSPRKLRF EIEYALEKGT TANAFLFTQI 
NETNSSAVLV HPPGINFEEV FIAELINIIS HKDTNLLVVI GHINPNRVAL LKKLAEIFNH
IEFVASNPAA KLLKDLWYQV KPSQLRNNEE SNTIIPPLPN IKLIKQEQTI ALFNEYEMQL
IPSPTARWPG GLISFERKLG LLMSDKLFGA HLCNDLWAEP NRSSTEEERR HYFDCLMSPM
ISQVSSIIEK LEDLDIQTIA PGHGPAIETS WRSLLNDYQR WGEGQQKASL KVVLLFASAY
GNTASIADSL AKGINSTGVK VDSLNCEFTP ANELVQAIKE ADAYLIGSPT LGGHAPTPIV
AALGTLLAEG DRQKPVGIFG SYGWSGEALD LLENKLRDGG FEFGFNPIKI KFSPNNNIIK
TLEETGTQFG RQLLKEQRRK KRRLGGGIST TKSDPALLAL GKVIGSLCIL TAFKNTEEES
LSGAMVASWV SQASFNPPGI TIAVAKDRAV ETLLHKEDLF ALNILNEENY HKLLKQFLQP
FKPGADRFKG IQVDQSPGKQ PILPEALAWL EGSVQQRMEC GDHWLIYAQI HHGKVLSSDG
VTAVHHRNTG ANY