Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4433 |
Symbol | |
ID | 3912248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5023422 |
End bp | 5024483 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637886338 |
Product | threonine aldolase |
Protein accession | YP_488030 |
Protein GI | 86751534 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2008] Threonine aldolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTTG CCAGCGACAA CGCCGTCGGC GCGAGCCCGC GTGTGCTCGA GGCTCTCCTC GCGGCCAATG ACGGCGCCGA GCCGGCCTAT GGCCACGATT GCTACAGCCA CCGCGCGCGC GCGCTGCTGA ACGAGGTGTT CGAATGCGAG GTCTCCGCTT ACTTCGTCGC GACCGGAACG GCGTCGAACG CGCTCGCGCT CGGCGCGATC ACCCCGCCTT GGGGCGCGGT GTTCTGTCAT CACCAGGCCC ACATCGCCAA TGACGAATGC GGCGCGCCGG AAATGTTCAC CGCCGGCGCC AAACTGATCG GCGTCGACGG CGTGCAGGGC AAGATCGATC CGGCCGCCCT GCGCGACATC CTCAGCGGAT TTCCCGCGGG CACGGTGCGG CAGGTGCAGC CGGCCTCGCT GTCGCTGTCG CAGGCGACCG AATGCGGCAC TCTTTACGAC TGCGGCGAGA TCGCCGAACT CGCCGCCATC GCCCACGATC GCGGCCTGGC GGTGCATATG GACGGCGCGC GCTTCGCCAA TGCGCTGGTC GCGATCGGAT GCACGGCGGC CGAGATGAGC CGGAAGGCCG GCATCGACGT CCTCTCCTTC GGCGCCACCA AGAACGGCGC TTTGGCCTGC GAGGCCGTGA TCTTCTTCGA CGAGGCGAAG GCCGCGGCGT TCGCCTATCA GTGCAAGCGC GCCGGACACG TCCTGTCCAA GGGGCGGATG CTCGGCGCGC AGATGGCGGC CTATCTCGCC GGCGGACACT GGCTCGACCT CGCGCGGCTG GCCAACCGGC GCGCCGCCGA ATTGTCGGAC GGCCTCACCA AGGTGCCTGG CGTGCGGCTG GCTTTCGAGC CGCGCGGCAA TCAGCTCTTT GCCGCGCTGC CCCGTCCGGT CGATGCGGCG CTCAGGAAGG CCGGCGCGCG CTACTACGAA TGGGGCGACC GCGGCTTCGG GCGGATGCTG ACGCTCGGGA CGAACGACGT TCTGGTTCGG CTGGTGACGT CCTTCGCGAC CTCGGCCGAC GACGTCCGCG CCTTCGTGAG CGCCGCCCGG GGCGCGGCGT GA
|
Protein sequence | MDFASDNAVG ASPRVLEALL AANDGAEPAY GHDCYSHRAR ALLNEVFECE VSAYFVATGT ASNALALGAI TPPWGAVFCH HQAHIANDEC GAPEMFTAGA KLIGVDGVQG KIDPAALRDI LSGFPAGTVR QVQPASLSLS QATECGTLYD CGEIAELAAI AHDRGLAVHM DGARFANALV AIGCTAAEMS RKAGIDVLSF GATKNGALAC EAVIFFDEAK AAAFAYQCKR AGHVLSKGRM LGAQMAAYLA GGHWLDLARL ANRRAAELSD GLTKVPGVRL AFEPRGNQLF AALPRPVDAA LRKAGARYYE WGDRGFGRML TLGTNDVLVR LVTSFATSAD DVRAFVSAAR GAA
|
| |