Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00481 |
Symbol | |
ID | 5730885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 47074 |
End bp | 48843 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284390 |
Product | flavoprotein |
Protein accession | YP_001549933 |
Protein GI | 159902589 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0426] Uncharacterized flavoproteins [COG1853] Conserved protein/domain typically associated with flavoprotein oxygenases, DIM6/NTAB family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGACC CTGTTGCTGT AATTGATCAA TCCCAACCTG AGCTTCAGCT ATCTTTGCAA TGCGAGATAA TTGCGCCCAA TACTACTACT ATTCGTTCAT TAGATTGGAA GCGTAGTCGG TTCGATATTG AATTTGGTCT CAGAAACGGA ACAACCTACA ATAGTTTCAT TATAGAAGGT GAGAAGAAAG CCTTAATAGA TACCAGTCAT GAAAAATTTC GAGATAGCTG GCTAAAACTT TTTAAAGAGC AAGTCAATCC ACAAGAACTT GATTTTTTGG TTGTTAGCCA TACAGAACCA GATCATTCTG GCTTAATCAG TTACCTTCTT GATTTCAATC CAGAAATTCA AATTATTGGC TCAAAAGTTG CAATTCAATT TTTAGAAAGT CAAGTTCATC GCCCCTTTAA ATCCCAAGCT ATAAAAACTG GATGCGAGCT TGATCTAGGC ATTAACCCAT CTAATGGAAT CCATCACAAA TTGGAATTTC TAAGTGCACC AAATTTACAT TGGCCAGATA CCATTTTTTC CTTTGATCAT GCCACTGCAA TCCTCTTTAC TTGTGATGCC TTTGGCCTCC ATTACTGCTC AAGTGACATG TTCGATATTG ATCCAGAATT AATACTTCCT GATTTCCGTT ATTACTACGA TTGTCTCATG GGCCCAAATG CACGTAGTGT TTTACAAGCA CTAAAGCGAA TAAAAAATTT ACCCAAAATA ACAACAATCG CCGTTGGGCA TGGTCCCCTT TTACGACACA ACATTGATCT TTGGCTAAGC AGTTATCTAA GCTGGAGTGA AAAACGTAAC AAAGGGGAAG GGTATGCAGC TGTTTGTTAT GTAAGCCAAT ATGGTTTTTG TGACCGCCTA AGTCAGGCAA TTGCATTAGG TATTAATAAA GCCGATGCGC AAGTTCAACT TATTGATCTG AGAGCTTCTG ATTCTCAAGA AATCAGTGCA CTCATAGGAG AAGCTAATGC AATCATTGTT CCTACTTGGC CTCATAACAC TGATGCTGAA CTTCAAAGCG CTATAGGAAC ACTGCTTGCA GCTCTTAAAC AAAAACAATG GGTAGCTGTA TACGATGCTT ATGGTGGAAA TGATGAACCA ATTGATGTAG TAGCCAATCA ATTAAGAAGC CTGGGTCAAA AAGAAGCCTT CTCGCCACTA AGAGTTCGTG GTACACCGGA TGCAAATACT TTTCAACGTT TTGAAGAAGC AGGCACTGAT TTAGGACAAC TACTTAATCG TAAAAAAAAT ATTGCAAATA TTAAAAGTTT TAGTGGCGAT CTAATGAAAG CAATGGGTCG AATAAGCGGG GGTTTATATG TAGTAACAGC CAGTCAAGGT AAAGGAAAAG AGCAACGACG AGGCGCAATG GTAGCCAGTT GGGTAAGTCA AGCAAGCTTC AATCCTCCAG GCATCACTGT TGCCGTTGCA AAAGATCGTG CAATAGAAAC TCTTATGCAA GTGGGAGATC GTTTTGTAAT CAATGTACTT CAAGAAAATA ACTATCAAAA ATTATTCCGT CAGTTTCTAA AAAGATTTCC ACCGGGAGCT GATCGTTTTG AAGGTATCTC CATATTAGAA GATGTCACAA AAGGAGGTCC AGTATTGGTC GATGCCCTCG CATACTTAGA TTGTCTAGTC AAGCAAAGAT TAGAGACTAC AGATCATTGG GTTATCTACG CGTTAGTAGA GCATGGAAAC GTCGCTAACG TAGAGTCCAA AACAGCTGTC CATCATCGCA AAGTAGGCAC GTCATATTGA
|
Protein sequence | MTDPVAVIDQ SQPELQLSLQ CEIIAPNTTT IRSLDWKRSR FDIEFGLRNG TTYNSFIIEG EKKALIDTSH EKFRDSWLKL FKEQVNPQEL DFLVVSHTEP DHSGLISYLL DFNPEIQIIG SKVAIQFLES QVHRPFKSQA IKTGCELDLG INPSNGIHHK LEFLSAPNLH WPDTIFSFDH ATAILFTCDA FGLHYCSSDM FDIDPELILP DFRYYYDCLM GPNARSVLQA LKRIKNLPKI TTIAVGHGPL LRHNIDLWLS SYLSWSEKRN KGEGYAAVCY VSQYGFCDRL SQAIALGINK ADAQVQLIDL RASDSQEISA LIGEANAIIV PTWPHNTDAE LQSAIGTLLA ALKQKQWVAV YDAYGGNDEP IDVVANQLRS LGQKEAFSPL RVRGTPDANT FQRFEEAGTD LGQLLNRKKN IANIKSFSGD LMKAMGRISG GLYVVTASQG KGKEQRRGAM VASWVSQASF NPPGITVAVA KDRAIETLMQ VGDRFVINVL QENNYQKLFR QFLKRFPPGA DRFEGISILE DVTKGGPVLV DALAYLDCLV KQRLETTDHW VIYALVEHGN VANVESKTAV HHRKVGTSY
|
| |