Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21711 |
Symbol | |
ID | 4776458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1926550 |
End bp | 1927929 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087681 |
Product | putative aldehyde dehydrogenase |
Protein accession | YP_001018171 |
Protein GI | 124023864 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0450332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCCCTG AGAACTTTCT GCTGAGTGGG CTACGCAAGC CAGTCATTTC GGGCCTCACA AGACCAGAAG CCTGGCGCAG AAAGCAACTC AAGCAAATTG AGGTCCTCAT CGAAAAGCAC CAAGACGAGG TACTTGATGC CCTGGCAACG GATCTAGGCA AACCACCAAC CGAGGCATTG TTTGAGCTGA TCGCGCTGCG TGGAGAGCTC AAACTGGCTC AACGGCAGCT CAGCCGCTGG ATGCAACCCA GGCACGTACA AGTGCCTTTA GCCCATCAGC CAGGTCAAGC CGAGGTGATC CTCGACCCCC TGGGTTGCGT TCTGATCATC GGCCCCTGGA ATTACCCCTT CTCTCTAACC CTCCAACCAC TGATTAGTGC TCTGGCAGCA GGCAATACAG CAGTGCTCAA ACCTTCCGAG CATGCCCCTG CCACCTCTCG ACTGATCGCC CATGTGATCC CTCAACATTT CTCCAGCGAG GTCGTACAGG TGATCGAGGG GGATGGGGCC ATTGCAGCGG CACTGATCAA GCAACCCTTT GATCACATCT TCTTTACCGG CAGCGGCGCC ATCGGCCAGA AGGTCATGGC TGCTGCCGCC GAACATCTCA CTCCGGTGAC CCTGGAGCTA GGAGGGAAAA GCCCAGCCAT CGTGATTGAT GGCGCCGATC TCTCGGTCAC GGCTAGGCGA TTGGTATGGG GGAAGGGTCT CAATGCTGGT CAAACCTGCA TCGCCCCAGA CCATCTACTT ATCCAAGAAC AACTCAAACA GCCTCTACTG CAGGCGATGA AAGGAGCCAT TACTGAGCTC TATGGAGGCG ATCCGCTGCG ATCACCCCAC CTCGCAAAAA TCATCAACGA TTGTCATTTC CAGCGACTAC AACACTTGCT TGATCAAGCA AAGCAGCGCG GCAAGGTGCT CTCAGGCGGA CAAATTGACC CCGATCAAAG ACGCATCGCT CCCACTCTGA TTGACGTAGA CAAGCGCGAC GATCCGCTGA TGGAGGAAGA GCTCTTCGGC CCACTGCTGC CTGTCATCAG CGTGCACAGC CTTAATGAAG CTCTCGCTGA GGTCCGACAA CAACCAAAGC CCCTGGCCCT CTATCTCTTT GGCGGAACAC ATGCCGACCA ACAACAGCTC CTCAACACAA CCAGCTCAGG CGGAGTTTGT TTCAACGATG TGGTGATGCA TGTAGGCATT CCTGAGCTGC CCTTTGGTGG AGTCGGGGCC AGTGGCATGG GGCGTTATCA CGGCCTGGCT GGTTTCGAAA CCTTTTCCCA TCAAAAGTCT GTCTTACGTC GCCCCTTCTG GCTAGATCTC AAATTGCGCT ACCCCCCCTA CAAGGCCAAT CTGGCGATGC TCAAAAAGCT GCTGGGATAA
|
Protein sequence | MLPENFLLSG LRKPVISGLT RPEAWRRKQL KQIEVLIEKH QDEVLDALAT DLGKPPTEAL FELIALRGEL KLAQRQLSRW MQPRHVQVPL AHQPGQAEVI LDPLGCVLII GPWNYPFSLT LQPLISALAA GNTAVLKPSE HAPATSRLIA HVIPQHFSSE VVQVIEGDGA IAAALIKQPF DHIFFTGSGA IGQKVMAAAA EHLTPVTLEL GGKSPAIVID GADLSVTARR LVWGKGLNAG QTCIAPDHLL IQEQLKQPLL QAMKGAITEL YGGDPLRSPH LAKIINDCHF QRLQHLLDQA KQRGKVLSGG QIDPDQRRIA PTLIDVDKRD DPLMEEELFG PLLPVISVHS LNEALAEVRQ QPKPLALYLF GGTHADQQQL LNTTSSGGVC FNDVVMHVGI PELPFGGVGA SGMGRYHGLA GFETFSHQKS VLRRPFWLDL KLRYPPYKAN LAMLKKLLG
|
| |