Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03681 |
Symbol | |
ID | 5731859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 350808 |
End bp | 352157 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641284722 |
Product | NAD-dependent aldehyde dehydrogenase |
Protein accession | YP_001550253 |
Protein GI | 159902909 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.975414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAGC CAGTTATCTC TGGCCTGACA AGATCAGAGA ACTGGCGAAG AATTCAGCTC AAAAGATTAA CAGCAATTAC TGAAGAGCAT GAAAGTGAAA TCCTCTCATC ATTAGCTACT GACCTTGGCA AGCCACCTAC GGAAGGTCTC TTTGAAATTA TATCGTTAAA GCAAGAATTA AAAGTTGCCC AGAAAAACCT AACTGACTGG ATGCGTCCAA AATTAGTAAA TGTGCCGCTA TTTCTGAAGC CCGGGAAAGC AAAACTTCAA AATGAGCCAC TGGGTTGCGT ATTGATTATT GGTGCATGGA ACTATCCTTT TATGCTTACT CTTCAACCAT TAATATCAGC ATTGGCTGCA GGCAACACAG CTGTTCTTAA GCCTTCAGAG TTTGCTCCCT CGACTTCTAA TTTAATAGCA AGACTGATTA GCAAGCATTT TCCAAAAGAT ATAGTGAGAG TATTAGAAGG AGATTCAGAA TTTTCCAAAC AATTGATGAA TAATAAATTT GATCATATTT TTTTTACAGG GGGAAGCAAA ACTGGCGCAA AGATTATGGA AGCTGCGGCA AAACATTTGA CACCAGTAAC ACTTGAACTT GGCGGGAAGA ATCCTGCAAT AGTTCTCAAA GGCTCAGATC TGAAGACAAC AGCGAAAAGA CTTGTATGGG GAAAATCCAT TAATTCTGGT CAAACTTGCC TGGCTCCTAA TCATCTTCTG GTCGAAGAGG GTATAAAAGA TGATTTAGTT GAATGTATGT GCAAATCTAT TATTGAATTT TACGGGGAAA ATCCATTGGA CTCTCCTGAT TTAGGTAAAA TAATAAATAG CAATCAATTC AATAGACTAA TTGATCTATT AAATCAAATA AAAACTAAGA ACCAAATACT GTTTGGTGGA GATGTTGACA ATAAGAACAA GAGAATTAGC CCAACATTAA TTGAGCTAGA TTCAATTAAT GATCCTTTAA TCGAAGAGGA ATTATTTGGA CCAATACTAC CTATTTTAAG TATTCCAAAT CTAGATTTTG CAATTTCTGA AATAAGGAAG CAACCCAAGC CGCTTGCTAT TTATATGTTT GGCGGTTCAG GTGAACAGCA AAAAACACTA TTGGATAAAT CTAGCTCTGG AGGAGTTTGT TTTAACGATG TAGTGATGCA GGCTGCGATC CCTGAACTCC CATTTGGTGG CGTAGGGAAT AGCGGAATGG GTAGATATCA TGGAAAAGCA GGATTCGATA ATTTTTCTCA TCAAAAATCA ATCCTTGAAC GACCATTTTG GCTAGACATT CAATTCCGAT ACCCTCCATA CAAGATTGAT ATTTCCTTGT TTAAAAAGCT CTTCAGGTGA
|
Protein sequence | MLEPVISGLT RSENWRRIQL KRLTAITEEH ESEILSSLAT DLGKPPTEGL FEIISLKQEL KVAQKNLTDW MRPKLVNVPL FLKPGKAKLQ NEPLGCVLII GAWNYPFMLT LQPLISALAA GNTAVLKPSE FAPSTSNLIA RLISKHFPKD IVRVLEGDSE FSKQLMNNKF DHIFFTGGSK TGAKIMEAAA KHLTPVTLEL GGKNPAIVLK GSDLKTTAKR LVWGKSINSG QTCLAPNHLL VEEGIKDDLV ECMCKSIIEF YGENPLDSPD LGKIINSNQF NRLIDLLNQI KTKNQILFGG DVDNKNKRIS PTLIELDSIN DPLIEEELFG PILPILSIPN LDFAISEIRK QPKPLAIYMF GGSGEQQKTL LDKSSSGGVC FNDVVMQAAI PELPFGGVGN SGMGRYHGKA GFDNFSHQKS ILERPFWLDI QFRYPPYKID ISLFKKLFR
|
| |