Gene Rsph17029_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1604 
Symbol 
ID4895080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1687991 
End bp1688992 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID640112195 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_001043486 
Protein GI126462372 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.531435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0505585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTGA AAGTGGCAAT CAACGGCTTC GGCCGCATCG GGCGGAACGT GCTCCGCGCC 
ATCATCGAAT CGGGCCGGAC CGATATCGAG GTGGTGGCGA TCAACGATCT CGGCCCGGTC
GAGACCAACG CGCACCTGCT GCGCTTCGAC TCGGTCCACG GCCGCTTCCC CGCCACCGTC
ACCACCACCG AGAAGACCAT CGACGTGGGC CGCGGCCCGA TGGATGTGAC CGCGATCCGC
AACCCGGCCG AACTGCCCTG GGGCCATGTC GACATCGTGC TCGAATGCAC CGGCATCTTC
ACCGACAAGG AGAAGGCGAA GGTCCACCTC GAGAGCGGCG CCAAGCGCGT GCTGGTCTCG
GCCCCCTCGA CCGGCGCCGA CAAGACCATC GTCTATGGCG TGAACCACGA GACCCTGACC
AAGGACGACC TCATCGTCTC GAACGCCTCC TGCACGACGA ACTGCCTCTC GCCGGTCGCC
AAGGTGCTGA ACGACACGAT CGGCATCACC AAGGGCTTCA TGACGACGAT CCACAGCTAT
ACGGGCGACC AGCCGACGCT CGACACGATG CACAAGGATC TCTACCGCGC CCGCGCCGCG
GCGCTGAGCA TGATCCCGAC CTCGACCGGC GCCGCCAAGG CCGTGGGCCT CGTGCTGCCG
GAACTGAAGG GCAAGCTCGA CGGCGTGGCG ATCCGGGTGC CGACGCCGAA CGTCTCGGTG
GTGGACCTCG TGTTCGAAGC CTCGCGCGCG ACCAGCGTCG AGGAAGTGAA CGCCGCCATC
CGCGAGGCTG CCGACGGCAA GCTGAAGGGC ATCCTCGGCT ATACCGACCA GCCCAACGTC
TCGATGGACT TCAACCACGA TCCGCACAGC TCGATCTTCC ACCTCGACCA GACCAAGGTC
ATGGAAGGCA ACATGGTGCG GATCCTCACC TGGTACGACA ACGAATGGGG CTTCTCGAAC
CGCATGGCCG ATACGGCCGT GGCCATGGGC AAGCTCATCT GA
 
Protein sequence
MTVKVAINGF GRIGRNVLRA IIESGRTDIE VVAINDLGPV ETNAHLLRFD SVHGRFPATV 
TTTEKTIDVG RGPMDVTAIR NPAELPWGHV DIVLECTGIF TDKEKAKVHL ESGAKRVLVS
APSTGADKTI VYGVNHETLT KDDLIVSNAS CTTNCLSPVA KVLNDTIGIT KGFMTTIHSY
TGDQPTLDTM HKDLYRARAA ALSMIPTSTG AAKAVGLVLP ELKGKLDGVA IRVPTPNVSV
VDLVFEASRA TSVEEVNAAI REAADGKLKG ILGYTDQPNV SMDFNHDPHS SIFHLDQTKV
MEGNMVRILT WYDNEWGFSN RMADTAVAMG KLI