Gene Rsph17029_2445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2445 
Symbol 
ID4897954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2579646 
End bp2580620 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content75% 
IMG OID640113043 
Productaminotransferase, class I and II 
Protein accessionYP_001044319 
Protein GI126463205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0228466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.020665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATC ACGGCGGCAA TCTCGACAGT GCGGCGGCGC TCTTTGGCGG GGCGGTCGAG 
GACTGGCTCG ACCTCTCGAC GGGGATCAAT CGGGTGCCCT ATCCGATGCC GCCCCTGCCC
GCCCGCGCGC TGACCGCCCT GCCCGATGCA GCGGCCGAGG CGCGCCTTCT GGCCGCGGCG
CGCCTGGCCT TCCGCACGGA GGCGCCGATG CTGGCCGTGG CGGGGGCACA GGCGGCGATT
CAGCTGGTGC CGCGGCTCAC CCCGCCCGGC CGGGCGCGCG TCCTTGGCCC CACCTACAAC
GAACATGCGG CAAGCCTCCG CGCCGCGGGC TGGCAGGTCG AGGAGGTGAG CGAGCTTGCG
GCGCTCGAAG GCGCCGATCT CGCCGTGCTC GTCAATCCGA ACAACCCCGA CGGCCGCCGC
CACCCGCCCG AGGCGCTCCG GGCGCTTCTG CCCCGCGTGG GCCGGCTTCT GGTCGATGAG
AGCTTCGGCG ATCCGCTGCC CGATCTGTCG CTCGCCCCCG AGGCCGGGGT GCCGGGGCTT
CTGGTGCTGC GCTCCTTCGG CAAGTTCTAC GGGCTGGCGG GGCTGCGTCT GGGCTTCGTG
CTGGGAAATG CCGAGGATGT GGCGGCGCTG GCGCGGATGG CGGGGCCGTG GGCGGTCTCG
GGCCCGGCCA TCGCCGCGGG AACCGTGGCC CTGGCCGACC ACGACTGGGC GGAGACCACG
ACCGCGCGGC TCGAGGCCGA GGGGCCGCGC CTCGATGCGC TGGCCGCCCG GATGGGCTGG
CGGCTGGCGG GCGGGGCGCA TCTCTTCCGG CTCTACGACA CGCCGAATGC CCGTGCGGCG
CAGGACCACC TTGCCCGCGC CCGGATCTGG AGCCGGATCT TCCCCTGGTC TGACCGGCTG
ATCCGCCTCG GGCTGCCGGG CGGCGAAGCG GAATGGGCGC GTCTCGATGC GGCCTTCGGT
GCCGGGATCC GCTAG
 
Protein sequence
MRDHGGNLDS AAALFGGAVE DWLDLSTGIN RVPYPMPPLP ARALTALPDA AAEARLLAAA 
RLAFRTEAPM LAVAGAQAAI QLVPRLTPPG RARVLGPTYN EHAASLRAAG WQVEEVSELA
ALEGADLAVL VNPNNPDGRR HPPEALRALL PRVGRLLVDE SFGDPLPDLS LAPEAGVPGL
LVLRSFGKFY GLAGLRLGFV LGNAEDVAAL ARMAGPWAVS GPAIAAGTVA LADHDWAETT
TARLEAEGPR LDALAARMGW RLAGGAHLFR LYDTPNARAA QDHLARARIW SRIFPWSDRL
IRLGLPGGEA EWARLDAAFG AGIR