Gene Rsph17029_4002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4002 
Symbol 
ID4899141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1145891 
End bp1146892 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content67% 
IMG OID640114605 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_001045852 
Protein GI126464739 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCA GGGTTGCCAT CAACGGCTTC GGCCGTATCG GCCGCAACGT GCTGCGGGCC 
ATCGTGGAGT CGGGGCGCAC CGATATCGAG GTGGTCGCGA TCAACGATCT GGGTCCGGTC
GAGACCAACG CGCATCTTCT GCGCTTCGAC AGCGTGCACG GCCGCTTCCC GGCCAAGGTC
ACCAGCGGAG ACGACTGGAT CGATGTGGGC CGCGGCCCGA TCAAGGTGAC GGCGATCCGC
AATCCGGCGG AGCTGCCCTG GGCGGGTGTC GACGTGGCGA TGGAATGCAC GGGCATCTTC
ACCTCGAAGG AGAAGGCCGC GGCCCATCTG CAGAACGGGG CGAAGCGGGT GCTCGTCTCG
GCGCCCTGCG ACGGGGCCGA CCGGACCATC GTCTATGGGG TGAACCATGC GACGCTCACC
GCGGACGACC TCGTGGTCTC GAATGCCTCC TGCACCACCA ACTGCCTCTC GCCGGTGGCC
AAGGTGCTTC ACGATGCGAT CGGCATCGCC AAGGGCTTCA TGACCACGAT CCACAGCTAC
ACGGGCGACC AGCCCACCCT CGACACGATG CACAAGGATC TCTACCGCGC GCGGGCCGCG
GCGCTGAGCA TGATCCCGAC CTCGACCGGC GCCGCGAAGG CCGTGGGCCT CGTGCTGCCC
GAGCTCAAGG GCCGGCTCGA CGGCGTGTCG ATCCGGGTGC CCACGCCCAA TGTCTCGGTG
GTGGATCTGG TGTTCGAGGC CGCCCGCGAC ACGACGGTGG AGGAGGTGAA TGCGGCCATC
GAGGCCGCCG CCCGCGGACC GTTGAAGGGC GTGCTGGGCT TCACGACCGA GCCCAACGTC
TCGTCCGACT TCAACCACGA CCCGCATTCG TCGGTGTTCC ACATGGACCA GACCAAGGTG
ATGGAGGGCC GCATGGTCCG CATCCTCAGC TGGTACGACA ACGAATGGGG CTTCTCGAAC
CGGATGGCCG ACACCGCCGT GGCGATGGGC CGGCTTCTCT GA
 
Protein sequence
MTIRVAINGF GRIGRNVLRA IVESGRTDIE VVAINDLGPV ETNAHLLRFD SVHGRFPAKV 
TSGDDWIDVG RGPIKVTAIR NPAELPWAGV DVAMECTGIF TSKEKAAAHL QNGAKRVLVS
APCDGADRTI VYGVNHATLT ADDLVVSNAS CTTNCLSPVA KVLHDAIGIA KGFMTTIHSY
TGDQPTLDTM HKDLYRARAA ALSMIPTSTG AAKAVGLVLP ELKGRLDGVS IRVPTPNVSV
VDLVFEAARD TTVEEVNAAI EAAARGPLKG VLGFTTEPNV SSDFNHDPHS SVFHMDQTKV
MEGRMVRILS WYDNEWGFSN RMADTAVAMG RLL