Gene RPB_4433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4433 
Symbol 
ID3912248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5023422 
End bp5024483 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content70% 
IMG OID637886338 
Productthreonine aldolase 
Protein accessionYP_488030 
Protein GI86751534 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTG CCAGCGACAA CGCCGTCGGC GCGAGCCCGC GTGTGCTCGA GGCTCTCCTC 
GCGGCCAATG ACGGCGCCGA GCCGGCCTAT GGCCACGATT GCTACAGCCA CCGCGCGCGC
GCGCTGCTGA ACGAGGTGTT CGAATGCGAG GTCTCCGCTT ACTTCGTCGC GACCGGAACG
GCGTCGAACG CGCTCGCGCT CGGCGCGATC ACCCCGCCTT GGGGCGCGGT GTTCTGTCAT
CACCAGGCCC ACATCGCCAA TGACGAATGC GGCGCGCCGG AAATGTTCAC CGCCGGCGCC
AAACTGATCG GCGTCGACGG CGTGCAGGGC AAGATCGATC CGGCCGCCCT GCGCGACATC
CTCAGCGGAT TTCCCGCGGG CACGGTGCGG CAGGTGCAGC CGGCCTCGCT GTCGCTGTCG
CAGGCGACCG AATGCGGCAC TCTTTACGAC TGCGGCGAGA TCGCCGAACT CGCCGCCATC
GCCCACGATC GCGGCCTGGC GGTGCATATG GACGGCGCGC GCTTCGCCAA TGCGCTGGTC
GCGATCGGAT GCACGGCGGC CGAGATGAGC CGGAAGGCCG GCATCGACGT CCTCTCCTTC
GGCGCCACCA AGAACGGCGC TTTGGCCTGC GAGGCCGTGA TCTTCTTCGA CGAGGCGAAG
GCCGCGGCGT TCGCCTATCA GTGCAAGCGC GCCGGACACG TCCTGTCCAA GGGGCGGATG
CTCGGCGCGC AGATGGCGGC CTATCTCGCC GGCGGACACT GGCTCGACCT CGCGCGGCTG
GCCAACCGGC GCGCCGCCGA ATTGTCGGAC GGCCTCACCA AGGTGCCTGG CGTGCGGCTG
GCTTTCGAGC CGCGCGGCAA TCAGCTCTTT GCCGCGCTGC CCCGTCCGGT CGATGCGGCG
CTCAGGAAGG CCGGCGCGCG CTACTACGAA TGGGGCGACC GCGGCTTCGG GCGGATGCTG
ACGCTCGGGA CGAACGACGT TCTGGTTCGG CTGGTGACGT CCTTCGCGAC CTCGGCCGAC
GACGTCCGCG CCTTCGTGAG CGCCGCCCGG GGCGCGGCGT GA
 
Protein sequence
MDFASDNAVG ASPRVLEALL AANDGAEPAY GHDCYSHRAR ALLNEVFECE VSAYFVATGT 
ASNALALGAI TPPWGAVFCH HQAHIANDEC GAPEMFTAGA KLIGVDGVQG KIDPAALRDI
LSGFPAGTVR QVQPASLSLS QATECGTLYD CGEIAELAAI AHDRGLAVHM DGARFANALV
AIGCTAAEMS RKAGIDVLSF GATKNGALAC EAVIFFDEAK AAAFAYQCKR AGHVLSKGRM
LGAQMAAYLA GGHWLDLARL ANRRAAELSD GLTKVPGVRL AFEPRGNQLF AALPRPVDAA
LRKAGARYYE WGDRGFGRML TLGTNDVLVR LVTSFATSAD DVRAFVSAAR GAA