Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_00451 |
Symbol | |
ID | 4912211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 45500 |
End bp | 47302 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640159609 |
Product | flavoprotein |
Protein accession | YP_001090269 |
Protein GI | 126695383 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGAGA TTAATATAAG CTTACTTTCA GAAAATCAAA ACTTATCAAA ATTTGAAATT GCTGATAATT TTACTTGCAT CAGATTTCTA GATCGTCAAA AAGAAAGATT TGAACTTGAG TTTAATCTCG AAAAAGGAAC CTCTTTTAAC ACTTTTTTCA TAAGAAATAA TGAAGAACTT CTTATTATTC ATCCACCTGA AAAGCAATAT TTAAATTCAT TTATTGAGAT AATATCTAAC TTTTTTAATC AATATAAATT AAAAAAAATT AACTTCATAT CTGGACATAT CAATCCTCAA ATCATAGAAA CCATAAAAAA CGTTAGTAAG CGATTTCAAG ACTTAACTAT AACTTGCTCA AATCCTGGCT TCAAACTTAT TAGTGAACTT TGGAATCAAA GAAATCCTAA CTTAGAAGAA TTTTTTGAAA TTCAATTACC CAAAATAAAC ATAGTCAAAA AGGAACTTAA TTTAGAATTA AGTAATATTT CATTAGAGTT AATTCCTATT CCAACTGCGC GTTGGCCTGG AGGTCTAATC ATTTATGAAC GTAATCAAGA AATACTTCTA AGTGAAAAAA TTTTCTCTGC ACACATTGCA TCTGAATATT GGTCTGAGAC GAATCGAATT AGTACAGAAA TTGATAGAAA ACACTTTTAT GATTGTTTGA TGGCACCAAT GTCTAATCAA GTTGTATCAA TAACTGAAAA AATTGCAGAC TATGACATTA AAACTATTGC ACCATTACAT GGGCCAGCTA TCGAATATAG CTTAAAAAGC TTTTTCAATG ACTATATTCG ATGGGGAGAG AATCTCTCCA CAAATAATCC TAAGATCGCT CTTATTTATG CAAGTGCATA TGGAAATACA GCCTCAATTG GAGATGCATT GGCCAAAGGG ATAAATAGAA CTTCTGTTGA AGTAGAAAGT ATTAATTGTG AATTTACATC AAATGATGTA CTTATAAAAT CCATCCACAA TGCTGATGGA TATTTAATAG GATCTCCCAC TCTTGGTGGA CACGCCCCAA CTCCTATAGT AAGTGCACTA GGAACATTGT TAGCAGAGGG TAATAGAGAT AAGCCAGTTG GGATTTTTGG TAGTTTTGGG TGGAGCGGCG AGGCGATAGA TTTACTTGAA TCAAAATTGA AAGACGGCGG TTTTCAGTTT AGTTTTGATC CTATAAGAAT TAAATTCAGT CCAAATAAAC CAAAAATTAA GGAGCTAGAA GAAATAGGAA CTCACTTTGG GAGAAAAATT ATAAAGAAAG CTAAGAAAAA ACCTAGAAAA TCAGATACTG GAATGATTAC AAGTAAAACA GATCCAAAGT TGCAAGCTCT TGGAAGAGTA ATTGGTTCGC TTTGTGTCCT AACAGCCTCC AAAGGCAAAG ATGAGAATAA TATTAAAGGA GCTATGCTTG CATCTTGGGT TAGTCAAGCA AGTTTTTCGC CTCCGGGGTT AAGTATTGCA GTAGCAAAAG ATAGATCGGT AGAGTCTCTT CTACAAATAG GAGATTCATT CGCATTGAAT ATTTTAAGTG AAAAAGATTT TAAAGAACCT TTGAAAAGAT TCACAAAACC TTTTTCACCT GGTGAAGATA GATTTGAAGG TATAAATATT GAATTAACCC CAAATGAACA GATAATAATT CCTAACTCTC TAGCATGGTT AGAAGCGGCT GTAAAAGAAC GAATGGAATG TGGAGATCAT TGGGTAATAT ACGCTGAAGT ACTACACGGT AATATACTAA AATCCGATTG TCTAACAGCA GTTCATCACC GTAAAACAGG TGCTAACTAT TAA
|
Protein sequence | MSEINISLLS ENQNLSKFEI ADNFTCIRFL DRQKERFELE FNLEKGTSFN TFFIRNNEEL LIIHPPEKQY LNSFIEIISN FFNQYKLKKI NFISGHINPQ IIETIKNVSK RFQDLTITCS NPGFKLISEL WNQRNPNLEE FFEIQLPKIN IVKKELNLEL SNISLELIPI PTARWPGGLI IYERNQEILL SEKIFSAHIA SEYWSETNRI STEIDRKHFY DCLMAPMSNQ VVSITEKIAD YDIKTIAPLH GPAIEYSLKS FFNDYIRWGE NLSTNNPKIA LIYASAYGNT ASIGDALAKG INRTSVEVES INCEFTSNDV LIKSIHNADG YLIGSPTLGG HAPTPIVSAL GTLLAEGNRD KPVGIFGSFG WSGEAIDLLE SKLKDGGFQF SFDPIRIKFS PNKPKIKELE EIGTHFGRKI IKKAKKKPRK SDTGMITSKT DPKLQALGRV IGSLCVLTAS KGKDENNIKG AMLASWVSQA SFSPPGLSIA VAKDRSVESL LQIGDSFALN ILSEKDFKEP LKRFTKPFSP GEDRFEGINI ELTPNEQIII PNSLAWLEAA VKERMECGDH WVIYAEVLHG NILKSDCLTA VHHRKTGANY
|
| |