Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_00431 |
Symbol | |
ID | 4716725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 42497 |
End bp | 44299 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640077740 |
Product | flavoprotein |
Protein accession | YP_001008438 |
Protein GI | 123967580 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.175879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGATA TTAGTATAGG CTTACTTGCA GAAAATCAAA ACTTATCAGA ATTTGAAATT ACTGACAATT TCTCTTGTGT AAGATTCTTA GACAAAAAAA AAGAAAGATT TGAACTTGAA TTTAATCTTG AAAAAGGAAC CTCTTTTAAT ACTTTTTTCA TTAGAAGTAA TGCGGAACTT TTTATTATTC ATCCACCTGA AAAACAATAT TTAAATTTAT TTAATAAAGT AATTTCTAGG TTTTGTGATC AATTTAAATT AGACAATATC AACTTCATTT CTGGTCATAT TAATCCCCAA ATTATTGAAA CTATAAAAAA TATAAGTACC CAATTTCAAA ACACAACTAT TACTTGCTCT AATCCAGGTT ATAAACTTAT AAGCGAACTT TGGAATCAAA GAAATCCTAA CTTAGAGAAC TTTATTGAAA TTCAATTACC TGAAATAAAC ATAATCAAAA AAGAACTTAA TTTAGAATTA GATAAAGTTT CACTAGAGTT AATTCCTATT CCAACAGCTC GCTGGCCTGG TGGCTTGATA ATTTATGAAC GTAACCAAGA GATACTTCTT AGTGAAAAAA TTTTCTCTGC ACACATTGCA TCTAAATATT GGTCTGAAAC AAATCGAATT AGTACAGAAA TTGATAGGAA ACATTTTTAT GATTGCTTAA TGGCACCAAT GTCTAATCAA GTAGTATCAA TAACTGAAAA AATTTCAGAA TATGAGATTA AAACTATTGC ACCATTACAT GGTCCAGCAA TCGAATATAG CTTAAAAAGC TTTTTAAATG ACTACATTAG ATGGGGAGAG AATCTCTTAA CAAATAATCC TAAGATCGCT CTGATATATG CAAGTGCATA TGGAAATACA GCCTCAATAG GTGATGCATT AGCTAAAGGG ATAAATCGAA CCTCTGTAGA AGTTGAAAGT ATTAATTGTG AATTCACACC TAATGATGTA CTCGTAAAAT CTATCCAAAA TGCTGATGGA TATTTAATAG GATCACCCAC ATTGGGTGGT CACGCCCCAA CTCCAATAGT TAGTGCACTA GGAACATTAT TAGCAGAGGG GAATAGAGAT AAGCCAGTGG GAATTTTTGG CAGTTTTGGC TGGAGCGGCG AAGCTATAGA TTTACTTGAA ACAAAATTAA AAGACGGAGG TTTTAAGTTT GGTTTTGACC CTGTAAGGAT TAAGTTCAGT CCAAATAAAC CAAAGATTAA AGAGTTAGAA GAAATAGGAA CTCACTTTGG AAGAAAAATC ATAAAAAAAG CAAAGATAAA ACCTAGAAAA TCAGATACTG GAATGATTAC AAGCAAAACA GATCCAAAGC TACAAGCTCT TGGAAGAGTA ATTGGGTCAC TCTGTGTCTT GACAGCCTCT AAAGGCAAAG ATGAGAATAA TATTAAAGGG GCTATGCTTG CATCTTGGGT TAGTCAAGCA AGTTTTTCAC CGCCCGGGTT AAGTATTGCA GTAGCAAAAG ATAGATCGGT TGAGTCTCTT TTGCAAATAG GAGATTCATT CGCATTAAAT ATTCTTAGTG AAAAAGATTA TAAAGAACCT TTAAAAAGAT TCACAAAGCC CTTTGCACCT GGAGAAGATA GATTTGAAGG AATAAAAATT GAATTAACTC CTAACGAACA GATAATCATT CCTGAATCGC TCGCATGGTT AGATGCTTCT GTAAAAGAAA GAATGGAATG TGGAGATCAT TGGGTAATTT ACGCCGAAGT ACTACACGGT AATATACTAA AATCCGATAG TTTAGCGGCA GTTCATCATC GCAAAACCGG CGCTAACTAT TAA
|
Protein sequence | MSDISIGLLA ENQNLSEFEI TDNFSCVRFL DKKKERFELE FNLEKGTSFN TFFIRSNAEL FIIHPPEKQY LNLFNKVISR FCDQFKLDNI NFISGHINPQ IIETIKNIST QFQNTTITCS NPGYKLISEL WNQRNPNLEN FIEIQLPEIN IIKKELNLEL DKVSLELIPI PTARWPGGLI IYERNQEILL SEKIFSAHIA SKYWSETNRI STEIDRKHFY DCLMAPMSNQ VVSITEKISE YEIKTIAPLH GPAIEYSLKS FLNDYIRWGE NLLTNNPKIA LIYASAYGNT ASIGDALAKG INRTSVEVES INCEFTPNDV LVKSIQNADG YLIGSPTLGG HAPTPIVSAL GTLLAEGNRD KPVGIFGSFG WSGEAIDLLE TKLKDGGFKF GFDPVRIKFS PNKPKIKELE EIGTHFGRKI IKKAKIKPRK SDTGMITSKT DPKLQALGRV IGSLCVLTAS KGKDENNIKG AMLASWVSQA SFSPPGLSIA VAKDRSVESL LQIGDSFALN ILSEKDYKEP LKRFTKPFAP GEDRFEGIKI ELTPNEQIII PESLAWLDAS VKERMECGDH WVIYAEVLHG NILKSDSLAA VHHRKTGANY
|
| |