Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3806 |
Symbol | |
ID | 3911609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4343970 |
End bp | 4345025 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885707 |
Product | alcohol dehydrogenase |
Protein accession | YP_487411 |
Protein GI | 86750915 |
COG category | [C] Energy production and conversion |
COG ID | [COG1062] Zn-dependent alcohol dehydrogenases, class III |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.925036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTGA CCTTCCGCGC CGCCGTTCTT CGTGAGCTGA ATCGGCCGCT CGCCATCGAG ACCGTCGAGG CGCCGCCGCT CGCGGCCGGG CAGGTGCTGG TCAAGCTGGC GTATTCCGGG GTGTGCCACA GCCAGGTGAT GGAAGCGCGG GGCGGCCGCG GCGTCGATCG CTATCTGCCG CACATGCTCG GCCACGAGGG CTCGGGCGTC GTGGTCGAGA CCGGCGCCGG CGTGACCAAG GTGAAGACGG GCGATCGCGT CATCCTCGGC TGGATCAAGG GCAAAGGCGC CGACGCGCAG GGCATCCGCT ACAAGAGCGG CGACGGCTTC ATCAATGCCG GCGCGGTGAC GACGTTCAAC GAATACGCCG TGGTCGCGGA GAACCGCGTG ACCCTGCTGC CGCAGGGCCT GCCGATGGAC GTCGCGGTGC TGTTCGGCTG CGCGCTGCCG GTCGGCGCCG GCATCGTCAT CAATATCGCC AAGCCCGCGC CCGGCAGCAC GCTCGCGGTG TTCGGGCTCG GCGGCATCGG GTTGTCGGCG CTGATGGCGT GCAAGCTGTT CGACTGCAGG CAACTGATCG CCGTCGATGT CGAGCCCGCG AAGCTGGCGA TGGCGCGCGA ACTCGGCGCC ACCGCGACGA TCGATGCGTC GCAGCAGGAT CCGGTCGCTG CGATCCGGGA GCTGACCGGC GGGCTCGGTG TCGACTACGC CATTGAATCC GCCGGCCTGG TGCGCGTCAT CGAGCAGGCG TTCGACGCCA CGCGGCGGTT CGGCGGGCTG TGCGTGTTCG CCTCGCATCC GCGTTCCGGC GAAAAGATCG CGCTCGACCC GTTCGAACTG ATCTGCGGCA AGCGCATCCT CGGCACCTGG GGTGGCGACG CCAATCCGGA CCGCGACGTC GACCTGCTCG CCGGCCTGTT CCGCGCCGGC AAGCTGCCGC TGGCCTCGAT GTTCAGCCGC CGCTACGCGC TCGACGAGAT CAACATCGCC CTCGACGATC TCGAACAGCG CCGCAGCGTG CGGCCGCTGA TCGAAATCGA TGCGACATTG GGCTGA
|
Protein sequence | MPVTFRAAVL RELNRPLAIE TVEAPPLAAG QVLVKLAYSG VCHSQVMEAR GGRGVDRYLP HMLGHEGSGV VVETGAGVTK VKTGDRVILG WIKGKGADAQ GIRYKSGDGF INAGAVTTFN EYAVVAENRV TLLPQGLPMD VAVLFGCALP VGAGIVINIA KPAPGSTLAV FGLGGIGLSA LMACKLFDCR QLIAVDVEPA KLAMARELGA TATIDASQQD PVAAIRELTG GLGVDYAIES AGLVRVIEQA FDATRRFGGL CVFASHPRSG EKIALDPFEL ICGKRILGTW GGDANPDRDV DLLAGLFRAG KLPLASMFSR RYALDEINIA LDDLEQRRSV RPLIEIDATL G
|
| |