Gene P9301_00451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_00451 
Symbol 
ID4912211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp45500 
End bp47302 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content33% 
IMG OID640159609 
Productflavoprotein 
Protein accessionYP_001090269 
Protein GI126695383 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGAGA TTAATATAAG CTTACTTTCA GAAAATCAAA ACTTATCAAA ATTTGAAATT 
GCTGATAATT TTACTTGCAT CAGATTTCTA GATCGTCAAA AAGAAAGATT TGAACTTGAG
TTTAATCTCG AAAAAGGAAC CTCTTTTAAC ACTTTTTTCA TAAGAAATAA TGAAGAACTT
CTTATTATTC ATCCACCTGA AAAGCAATAT TTAAATTCAT TTATTGAGAT AATATCTAAC
TTTTTTAATC AATATAAATT AAAAAAAATT AACTTCATAT CTGGACATAT CAATCCTCAA
ATCATAGAAA CCATAAAAAA CGTTAGTAAG CGATTTCAAG ACTTAACTAT AACTTGCTCA
AATCCTGGCT TCAAACTTAT TAGTGAACTT TGGAATCAAA GAAATCCTAA CTTAGAAGAA
TTTTTTGAAA TTCAATTACC CAAAATAAAC ATAGTCAAAA AGGAACTTAA TTTAGAATTA
AGTAATATTT CATTAGAGTT AATTCCTATT CCAACTGCGC GTTGGCCTGG AGGTCTAATC
ATTTATGAAC GTAATCAAGA AATACTTCTA AGTGAAAAAA TTTTCTCTGC ACACATTGCA
TCTGAATATT GGTCTGAGAC GAATCGAATT AGTACAGAAA TTGATAGAAA ACACTTTTAT
GATTGTTTGA TGGCACCAAT GTCTAATCAA GTTGTATCAA TAACTGAAAA AATTGCAGAC
TATGACATTA AAACTATTGC ACCATTACAT GGGCCAGCTA TCGAATATAG CTTAAAAAGC
TTTTTCAATG ACTATATTCG ATGGGGAGAG AATCTCTCCA CAAATAATCC TAAGATCGCT
CTTATTTATG CAAGTGCATA TGGAAATACA GCCTCAATTG GAGATGCATT GGCCAAAGGG
ATAAATAGAA CTTCTGTTGA AGTAGAAAGT ATTAATTGTG AATTTACATC AAATGATGTA
CTTATAAAAT CCATCCACAA TGCTGATGGA TATTTAATAG GATCTCCCAC TCTTGGTGGA
CACGCCCCAA CTCCTATAGT AAGTGCACTA GGAACATTGT TAGCAGAGGG TAATAGAGAT
AAGCCAGTTG GGATTTTTGG TAGTTTTGGG TGGAGCGGCG AGGCGATAGA TTTACTTGAA
TCAAAATTGA AAGACGGCGG TTTTCAGTTT AGTTTTGATC CTATAAGAAT TAAATTCAGT
CCAAATAAAC CAAAAATTAA GGAGCTAGAA GAAATAGGAA CTCACTTTGG GAGAAAAATT
ATAAAGAAAG CTAAGAAAAA ACCTAGAAAA TCAGATACTG GAATGATTAC AAGTAAAACA
GATCCAAAGT TGCAAGCTCT TGGAAGAGTA ATTGGTTCGC TTTGTGTCCT AACAGCCTCC
AAAGGCAAAG ATGAGAATAA TATTAAAGGA GCTATGCTTG CATCTTGGGT TAGTCAAGCA
AGTTTTTCGC CTCCGGGGTT AAGTATTGCA GTAGCAAAAG ATAGATCGGT AGAGTCTCTT
CTACAAATAG GAGATTCATT CGCATTGAAT ATTTTAAGTG AAAAAGATTT TAAAGAACCT
TTGAAAAGAT TCACAAAACC TTTTTCACCT GGTGAAGATA GATTTGAAGG TATAAATATT
GAATTAACCC CAAATGAACA GATAATAATT CCTAACTCTC TAGCATGGTT AGAAGCGGCT
GTAAAAGAAC GAATGGAATG TGGAGATCAT TGGGTAATAT ACGCTGAAGT ACTACACGGT
AATATACTAA AATCCGATTG TCTAACAGCA GTTCATCACC GTAAAACAGG TGCTAACTAT
TAA
 
Protein sequence
MSEINISLLS ENQNLSKFEI ADNFTCIRFL DRQKERFELE FNLEKGTSFN TFFIRNNEEL 
LIIHPPEKQY LNSFIEIISN FFNQYKLKKI NFISGHINPQ IIETIKNVSK RFQDLTITCS
NPGFKLISEL WNQRNPNLEE FFEIQLPKIN IVKKELNLEL SNISLELIPI PTARWPGGLI
IYERNQEILL SEKIFSAHIA SEYWSETNRI STEIDRKHFY DCLMAPMSNQ VVSITEKIAD
YDIKTIAPLH GPAIEYSLKS FFNDYIRWGE NLSTNNPKIA LIYASAYGNT ASIGDALAKG
INRTSVEVES INCEFTSNDV LIKSIHNADG YLIGSPTLGG HAPTPIVSAL GTLLAEGNRD
KPVGIFGSFG WSGEAIDLLE SKLKDGGFQF SFDPIRIKFS PNKPKIKELE EIGTHFGRKI
IKKAKKKPRK SDTGMITSKT DPKLQALGRV IGSLCVLTAS KGKDENNIKG AMLASWVSQA
SFSPPGLSIA VAKDRSVESL LQIGDSFALN ILSEKDFKEP LKRFTKPFSP GEDRFEGINI
ELTPNEQIII PNSLAWLEAA VKERMECGDH WVIYAEVLHG NILKSDCLTA VHHRKTGANY