Gene P9211_00481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00481 
Symbol 
ID5730885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp47074 
End bp48843 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content40% 
IMG OID641284390 
Productflavoprotein 
Protein accessionYP_001549933 
Protein GI159902589 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACC CTGTTGCTGT AATTGATCAA TCCCAACCTG AGCTTCAGCT ATCTTTGCAA 
TGCGAGATAA TTGCGCCCAA TACTACTACT ATTCGTTCAT TAGATTGGAA GCGTAGTCGG
TTCGATATTG AATTTGGTCT CAGAAACGGA ACAACCTACA ATAGTTTCAT TATAGAAGGT
GAGAAGAAAG CCTTAATAGA TACCAGTCAT GAAAAATTTC GAGATAGCTG GCTAAAACTT
TTTAAAGAGC AAGTCAATCC ACAAGAACTT GATTTTTTGG TTGTTAGCCA TACAGAACCA
GATCATTCTG GCTTAATCAG TTACCTTCTT GATTTCAATC CAGAAATTCA AATTATTGGC
TCAAAAGTTG CAATTCAATT TTTAGAAAGT CAAGTTCATC GCCCCTTTAA ATCCCAAGCT
ATAAAAACTG GATGCGAGCT TGATCTAGGC ATTAACCCAT CTAATGGAAT CCATCACAAA
TTGGAATTTC TAAGTGCACC AAATTTACAT TGGCCAGATA CCATTTTTTC CTTTGATCAT
GCCACTGCAA TCCTCTTTAC TTGTGATGCC TTTGGCCTCC ATTACTGCTC AAGTGACATG
TTCGATATTG ATCCAGAATT AATACTTCCT GATTTCCGTT ATTACTACGA TTGTCTCATG
GGCCCAAATG CACGTAGTGT TTTACAAGCA CTAAAGCGAA TAAAAAATTT ACCCAAAATA
ACAACAATCG CCGTTGGGCA TGGTCCCCTT TTACGACACA ACATTGATCT TTGGCTAAGC
AGTTATCTAA GCTGGAGTGA AAAACGTAAC AAAGGGGAAG GGTATGCAGC TGTTTGTTAT
GTAAGCCAAT ATGGTTTTTG TGACCGCCTA AGTCAGGCAA TTGCATTAGG TATTAATAAA
GCCGATGCGC AAGTTCAACT TATTGATCTG AGAGCTTCTG ATTCTCAAGA AATCAGTGCA
CTCATAGGAG AAGCTAATGC AATCATTGTT CCTACTTGGC CTCATAACAC TGATGCTGAA
CTTCAAAGCG CTATAGGAAC ACTGCTTGCA GCTCTTAAAC AAAAACAATG GGTAGCTGTA
TACGATGCTT ATGGTGGAAA TGATGAACCA ATTGATGTAG TAGCCAATCA ATTAAGAAGC
CTGGGTCAAA AAGAAGCCTT CTCGCCACTA AGAGTTCGTG GTACACCGGA TGCAAATACT
TTTCAACGTT TTGAAGAAGC AGGCACTGAT TTAGGACAAC TACTTAATCG TAAAAAAAAT
ATTGCAAATA TTAAAAGTTT TAGTGGCGAT CTAATGAAAG CAATGGGTCG AATAAGCGGG
GGTTTATATG TAGTAACAGC CAGTCAAGGT AAAGGAAAAG AGCAACGACG AGGCGCAATG
GTAGCCAGTT GGGTAAGTCA AGCAAGCTTC AATCCTCCAG GCATCACTGT TGCCGTTGCA
AAAGATCGTG CAATAGAAAC TCTTATGCAA GTGGGAGATC GTTTTGTAAT CAATGTACTT
CAAGAAAATA ACTATCAAAA ATTATTCCGT CAGTTTCTAA AAAGATTTCC ACCGGGAGCT
GATCGTTTTG AAGGTATCTC CATATTAGAA GATGTCACAA AAGGAGGTCC AGTATTGGTC
GATGCCCTCG CATACTTAGA TTGTCTAGTC AAGCAAAGAT TAGAGACTAC AGATCATTGG
GTTATCTACG CGTTAGTAGA GCATGGAAAC GTCGCTAACG TAGAGTCCAA AACAGCTGTC
CATCATCGCA AAGTAGGCAC GTCATATTGA
 
Protein sequence
MTDPVAVIDQ SQPELQLSLQ CEIIAPNTTT IRSLDWKRSR FDIEFGLRNG TTYNSFIIEG 
EKKALIDTSH EKFRDSWLKL FKEQVNPQEL DFLVVSHTEP DHSGLISYLL DFNPEIQIIG
SKVAIQFLES QVHRPFKSQA IKTGCELDLG INPSNGIHHK LEFLSAPNLH WPDTIFSFDH
ATAILFTCDA FGLHYCSSDM FDIDPELILP DFRYYYDCLM GPNARSVLQA LKRIKNLPKI
TTIAVGHGPL LRHNIDLWLS SYLSWSEKRN KGEGYAAVCY VSQYGFCDRL SQAIALGINK
ADAQVQLIDL RASDSQEISA LIGEANAIIV PTWPHNTDAE LQSAIGTLLA ALKQKQWVAV
YDAYGGNDEP IDVVANQLRS LGQKEAFSPL RVRGTPDANT FQRFEEAGTD LGQLLNRKKN
IANIKSFSGD LMKAMGRISG GLYVVTASQG KGKEQRRGAM VASWVSQASF NPPGITVAVA
KDRAIETLMQ VGDRFVINVL QENNYQKLFR QFLKRFPPGA DRFEGISILE DVTKGGPVLV
DALAYLDCLV KQRLETTDHW VIYALVEHGN VANVESKTAV HHRKVGTSY