Gene Rsph17029_2872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2872 
Symbol 
ID4897901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp3028651 
End bp3029898 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content69% 
IMG OID640113475 
Productthreonine dehydratase 
Protein accessionYP_001044746 
Protein GI126463632 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02079] threonine dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGT TCGCCCGCAA CGCCCGTGCC GCCACCCGCG CCCTGCGCGA TCTCTTCCCC 
GAGACGCCGC TCCAGCGCAA CGATCACCTT TCGGCCCGCT ACGGTGCCGA CATCTGGCTC
AAGCGCGAGG ATCTGACGCC GGTGCGCAGC TACAAGCTGC GCGGCGCCTT CACGGCGATG
CGGAAGGTGC GCGATGCGCG CCCCGACCAG CGCTCCTTCG TCTGCGCCTC GGCGGGAAAT
CATGCGCAGG GCGTGGCTTA CGCCTGCCGG CATTTCGGTG TGAAGGGCAC GATCTTCATG
CCGGTGACGA CCCCGCAGCA GAAGATCGCC AAGACGCGGA CCTTCGGCGG CGAAGCGGTC
GAGATCGTGC TGACGGGCGA CTATTTCGAC CAGACCCTCG CCGCGGCCCA GGCCTGGTGC
GCCGAGCAGA AGGCCCATTT CCTCGCGCCC TTCGACGATC CCGACGTGAT CGAGGGGCAG
GCGAGTGTGG GGGTCGAGCT GCTCGAACAG CTCGGCCGGG CGCCGGATCT GGTGGTGCTG
CCGGTGGGCG GCGGCGGGCT TGCTTCGGGT GTCACGGCCT TCCTGCGGAG CGAGGCGCCG
GAGACCGACT TCCGGTTCGT CGAGCCTGCG GGCGGGGCCA GCCTTCTGGC CGCGCTGGAA
GCGGGAGGTC CCACGGCGCT GCCGCGCGTG AACAGCTTCG TCGACGGGGC CGCCGTGGCG
CGGCTGGGAC AGCTGCCCTT CTCGATGCTC GACTGGGTGC GCCCCGATCA GGTGCATCTG
GCGCCCGAGG ACCGGATCTG CATCACCATG CTCGAGATGC TGAACGTCGA GGGCATCGTG
CTCGAGCCTG CGGGAGCGCT GTCGGTGGAC GTCCTGCCGG AGCTGGCCGA CCGGATCCGC
GGCCGCACGG TCGTCTGCGT GACCTCGGGC GGCAATTTCG ACTTCGAGCG GCTGCCCGAG
GTGAAGGAGC GGGCGCAGCG CTACTCGGGC CTCAAGAAAT ACTTCATCCT GCGGATGCCG
CAGCGCCCCG GCGCGCTGCG CGAGTTCCTG ATGATGCTCG GCCCCGACGA CGACATCGCG
CGCTTCGAAT ATCTCAAGAA GTCCGCGCGC AACTTCGGCT CGGTCCTGAT CGGGATCGAG
ACCCGCGAGG CCGGGAATTT CGCGCGACTG ACGGCGGTGA TGGAGGAGGC GGGGCTGAAC
TACCGCGACA TCACCGGCGA CGATGCCCTG GCCGAGTTCC TGGTCTAG
 
Protein sequence
MTQFARNARA ATRALRDLFP ETPLQRNDHL SARYGADIWL KREDLTPVRS YKLRGAFTAM 
RKVRDARPDQ RSFVCASAGN HAQGVAYACR HFGVKGTIFM PVTTPQQKIA KTRTFGGEAV
EIVLTGDYFD QTLAAAQAWC AEQKAHFLAP FDDPDVIEGQ ASVGVELLEQ LGRAPDLVVL
PVGGGGLASG VTAFLRSEAP ETDFRFVEPA GGASLLAALE AGGPTALPRV NSFVDGAAVA
RLGQLPFSML DWVRPDQVHL APEDRICITM LEMLNVEGIV LEPAGALSVD VLPELADRIR
GRTVVCVTSG GNFDFERLPE VKERAQRYSG LKKYFILRMP QRPGALREFL MMLGPDDDIA
RFEYLKKSAR NFGSVLIGIE TREAGNFARL TAVMEEAGLN YRDITGDDAL AEFLV