Gene RPB_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4420 
Symbol 
ID3912235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5006990 
End bp5008252 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content68% 
IMG OID637886325 
Productthreonine synthase 
Protein accessionYP_488017 
Protein GI86751521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.698202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGG TCAAAGGGCT GCAGTGCCTG CGCTGCGGCG CCCTTTATGC CCCGGATCAT 
TACGCCGAGG ACTGCCCGGC CTGCCGGCCG GTGGTGCGGA GCAATCTCAT CGTCGTCTAC
GATGAGCCGC TCGCCTTGCG CAAGCCGGAC GCGGCCGGCG CCGGGCCGTC GAGCGGCCTG
TGGCGCTACG GCGACGTGCT GCCGGTGAGC GAGGCCGACG CCGTGTCGCT CGGCGAGGGC
GGTTCGCCGC TCCGCCAACT GCGCGCGGTG GGCGATCAAC TCGGTCTCAA ACGGCTCTAC
GGCAAGGACG AGAGCGGCAA TCCGACCTGG TCGTTCAAGG ACCGTCTCGC CTGCATCGCG
GTGTCGGTCG CCAAGCAGAT GGGCGCCAAG ACTATCGTGT CGAGTTCATC CGGCAACGCC
GGCGCCGCCG CCGCCGCTTA TGCGGCGAAA GCCGGCATCC CCTGCGTGGT GTTCACGTTC
GGCTGGGCCG CAGGTCCGAT GGTGACGCAG ATGCGCGCCT ATGGGGCCAA GGTCGTCACC
GTGCCGCAGA AGGAAGACCG CTGGCGGTTC ATGGAGCACG CCGTGCGGCA GTATGGCTGG
TTTCCGACTT CGCCGTTTTT CGGCCCGGCC GTCGGCTCCA ATCCTTACGG CATCGAAGGC
TACAAGACGC TGGCCTACGA GACCGTCGAG CAGCTCGGCT GGCGGGCCCC GGATTGGTGC
ATCCTGCCGG TGTGCTACGG CGACGCGCTG ATCGGGATGT GGCGCGGCTT CACCGAGATG
AAGGCGGCGG GCTGGATCGA TCGGATGCCG AAGATGGTCG CCGCGGAGGT CTACGGCTCG
ATCGGCCGGG CGCTCGACGA CGACCTCGAA GCGCCGCCGG CGATGCCGAA GACCTTCGAC
ACGGTGTCGG GCTCGATCGG CGCCGTGCAG GGCACCTATC AGGCGCTCGA GATCGTGCGA
AAATCCGGCG GCCGAGCGGT GACGATCTCC AACGACGACA CCATGCGATG GCAGCGTCTG
CTGGCGACGC GTGAGGCCCG CTATCTCGAG CCGGCGTCCG CCGGCGGACT GGTCGCGGTC
GAGCGGCTCG CAAAATCGGG AATCATCAAG CCGGACGACG TCGTCGTCTC GCTGCTCACG
GCGTCGGGTC TGAAAGACCC GGCGGTCACT GCCGCCACCC AGGGCGACAC CATGGCGGTG
CCGTCCGATC TGTCGGCAGC CTGGCAAATC CTGCAATCGG CGGGAATAGT TCCAAGCAAC
TGA
 
Protein sequence
MAKVKGLQCL RCGALYAPDH YAEDCPACRP VVRSNLIVVY DEPLALRKPD AAGAGPSSGL 
WRYGDVLPVS EADAVSLGEG GSPLRQLRAV GDQLGLKRLY GKDESGNPTW SFKDRLACIA
VSVAKQMGAK TIVSSSSGNA GAAAAAYAAK AGIPCVVFTF GWAAGPMVTQ MRAYGAKVVT
VPQKEDRWRF MEHAVRQYGW FPTSPFFGPA VGSNPYGIEG YKTLAYETVE QLGWRAPDWC
ILPVCYGDAL IGMWRGFTEM KAAGWIDRMP KMVAAEVYGS IGRALDDDLE APPAMPKTFD
TVSGSIGAVQ GTYQALEIVR KSGGRAVTIS NDDTMRWQRL LATREARYLE PASAGGLVAV
ERLAKSGIIK PDDVVVSLLT ASGLKDPAVT AATQGDTMAV PSDLSAAWQI LQSAGIVPSN