Gene A9601_00431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00431 
Symbol 
ID4716725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp42497 
End bp44299 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content33% 
IMG OID640077740 
Productflavoprotein 
Protein accessionYP_001008438 
Protein GI123967580 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0426] Uncharacterized flavoproteins
[COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.175879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA TTAGTATAGG CTTACTTGCA GAAAATCAAA ACTTATCAGA ATTTGAAATT 
ACTGACAATT TCTCTTGTGT AAGATTCTTA GACAAAAAAA AAGAAAGATT TGAACTTGAA
TTTAATCTTG AAAAAGGAAC CTCTTTTAAT ACTTTTTTCA TTAGAAGTAA TGCGGAACTT
TTTATTATTC ATCCACCTGA AAAACAATAT TTAAATTTAT TTAATAAAGT AATTTCTAGG
TTTTGTGATC AATTTAAATT AGACAATATC AACTTCATTT CTGGTCATAT TAATCCCCAA
ATTATTGAAA CTATAAAAAA TATAAGTACC CAATTTCAAA ACACAACTAT TACTTGCTCT
AATCCAGGTT ATAAACTTAT AAGCGAACTT TGGAATCAAA GAAATCCTAA CTTAGAGAAC
TTTATTGAAA TTCAATTACC TGAAATAAAC ATAATCAAAA AAGAACTTAA TTTAGAATTA
GATAAAGTTT CACTAGAGTT AATTCCTATT CCAACAGCTC GCTGGCCTGG TGGCTTGATA
ATTTATGAAC GTAACCAAGA GATACTTCTT AGTGAAAAAA TTTTCTCTGC ACACATTGCA
TCTAAATATT GGTCTGAAAC AAATCGAATT AGTACAGAAA TTGATAGGAA ACATTTTTAT
GATTGCTTAA TGGCACCAAT GTCTAATCAA GTAGTATCAA TAACTGAAAA AATTTCAGAA
TATGAGATTA AAACTATTGC ACCATTACAT GGTCCAGCAA TCGAATATAG CTTAAAAAGC
TTTTTAAATG ACTACATTAG ATGGGGAGAG AATCTCTTAA CAAATAATCC TAAGATCGCT
CTGATATATG CAAGTGCATA TGGAAATACA GCCTCAATAG GTGATGCATT AGCTAAAGGG
ATAAATCGAA CCTCTGTAGA AGTTGAAAGT ATTAATTGTG AATTCACACC TAATGATGTA
CTCGTAAAAT CTATCCAAAA TGCTGATGGA TATTTAATAG GATCACCCAC ATTGGGTGGT
CACGCCCCAA CTCCAATAGT TAGTGCACTA GGAACATTAT TAGCAGAGGG GAATAGAGAT
AAGCCAGTGG GAATTTTTGG CAGTTTTGGC TGGAGCGGCG AAGCTATAGA TTTACTTGAA
ACAAAATTAA AAGACGGAGG TTTTAAGTTT GGTTTTGACC CTGTAAGGAT TAAGTTCAGT
CCAAATAAAC CAAAGATTAA AGAGTTAGAA GAAATAGGAA CTCACTTTGG AAGAAAAATC
ATAAAAAAAG CAAAGATAAA ACCTAGAAAA TCAGATACTG GAATGATTAC AAGCAAAACA
GATCCAAAGC TACAAGCTCT TGGAAGAGTA ATTGGGTCAC TCTGTGTCTT GACAGCCTCT
AAAGGCAAAG ATGAGAATAA TATTAAAGGG GCTATGCTTG CATCTTGGGT TAGTCAAGCA
AGTTTTTCAC CGCCCGGGTT AAGTATTGCA GTAGCAAAAG ATAGATCGGT TGAGTCTCTT
TTGCAAATAG GAGATTCATT CGCATTAAAT ATTCTTAGTG AAAAAGATTA TAAAGAACCT
TTAAAAAGAT TCACAAAGCC CTTTGCACCT GGAGAAGATA GATTTGAAGG AATAAAAATT
GAATTAACTC CTAACGAACA GATAATCATT CCTGAATCGC TCGCATGGTT AGATGCTTCT
GTAAAAGAAA GAATGGAATG TGGAGATCAT TGGGTAATTT ACGCCGAAGT ACTACACGGT
AATATACTAA AATCCGATAG TTTAGCGGCA GTTCATCATC GCAAAACCGG CGCTAACTAT
TAA
 
Protein sequence
MSDISIGLLA ENQNLSEFEI TDNFSCVRFL DKKKERFELE FNLEKGTSFN TFFIRSNAEL 
FIIHPPEKQY LNLFNKVISR FCDQFKLDNI NFISGHINPQ IIETIKNIST QFQNTTITCS
NPGYKLISEL WNQRNPNLEN FIEIQLPEIN IIKKELNLEL DKVSLELIPI PTARWPGGLI
IYERNQEILL SEKIFSAHIA SKYWSETNRI STEIDRKHFY DCLMAPMSNQ VVSITEKISE
YEIKTIAPLH GPAIEYSLKS FLNDYIRWGE NLLTNNPKIA LIYASAYGNT ASIGDALAKG
INRTSVEVES INCEFTPNDV LVKSIQNADG YLIGSPTLGG HAPTPIVSAL GTLLAEGNRD
KPVGIFGSFG WSGEAIDLLE TKLKDGGFKF GFDPVRIKFS PNKPKIKELE EIGTHFGRKI
IKKAKIKPRK SDTGMITSKT DPKLQALGRV IGSLCVLTAS KGKDENNIKG AMLASWVSQA
SFSPPGLSIA VAKDRSVESL LQIGDSFALN ILSEKDYKEP LKRFTKPFAP GEDRFEGIKI
ELTPNEQIII PESLAWLDAS VKERMECGDH WVIYAEVLHG NILKSDSLAA VHHRKTGANY