Gene Rsph17029_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3892 
Symbol 
ID4899180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1023355 
End bp1025007 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content74% 
IMG OID640114496 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001045743 
Protein GI126464630 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.289685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0243729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC GCCGCCCCTT CCTGCTTCGC GGAGCGGATC TTCTCGTCTC CACGCTGGTC 
GATTGCGGCG TGACCCGGAT CTTCAGCCTG TCGGGCAACC AGATCATGCC GCTCTACGAC
GCCTGTCTGG GCTCGGGGAT CGAGATCGTC CATGTGCGTC ACGAGGCGGC CGCCGTTTTC
ATGGCCGAGG GCTGGGCGCA GCTTACGGGC GAAGTGGGCG TGGCGCTGGT CACCGCGGGC
CCGGGGGCTG CCAATGCGGT GGGGCCGCTG ATGAGCGCGC GCCAGTCCGA GACGCCGCTG
CTGCTCCTCA GCGGGGACTC GCCCCGGGCG CAGGACGGGA TGGGGGCGTT CCAGACCCTC
GATCAGGTGG CCGCGACCGC GCCCTTCACC AAACATTCCG CGCGCGTGGG CTCGGTGCCG
ACGCTGGCGC ACGAGCTCCG GGCGGCGCTG ACGCTGGCCC GGGCGGGGCG GCCGGGGCCG
GTGCATCTGG CGCTGCCGCA GGATCTTCTG ACCCAGCGCG CCGAGGGTGT GCCGAGCAGC
AGCCGGACCG CAGCCGCCAC GGTGCCGGCC AGCGAGCGCG AGACCGCGAG GCTCGCGGCC
GAGATCGCGG AAGCGCGCCG CCCGCTGATC GTGACGAGCG GCCTCTTCAG CCCGACCCGC
GGCGGCGATC TGGCCCGCAG GCTTGGTCAG AGGCTCGCCG TGCCGGTCGT GGCGCTGGAA
AGCCCGCGCG GCCTGCGCGA TCCGGCCCAG CCCGGCCTGC GGCAGCGGAT GGCCGACGCG
GATCTCGTGA TCTCACTCGG CAAATCCGTC GATTACATGC TCGACTTCGG TCGGGCCACC
TCCGCGGACT GCGGCTGGAT CGTGGTCGAG CCCGAGGACG GGGCGGGCGA AGAGGCCGTC
CGCAACCTCG GCCCGAAGCT GCGCCGCCTG ATCGCGGCCG ATCCGCGGGC GCTGGCGCAG
GGGCTGGCCG AGCTGCCCGA GGCCGGCTGC GACGCGGGAC GCCGCGACTG GTGCGCCCGC
TTCGGGCGCC GGCCCGCGCC GCCCGTGCTG GACGAGACGC CGGGGCCGAT CGATTCCGGG
CTGCTCTGCG CCACCCTGTC CGACGTTCTG GGCGCGGACG GGGGCGAGAC GATCCTCGTC
TCGGACGGCG GCGAGTTCGG CCAGTGGGCG CAGGCCCTCG TGCAGGCCGA CCGACGGCTC
ATCAACGGGC CCGCAGGCGG GATCGGCGGG GCGCTCGGTC ATGCGGTGGC GGCAAGCCTC
GCCTGCCCCG AGGCCCGCGT TGCGGTCGCC AGCGGGGATG GCAGCATCGG CTTTCATCTG
GCAGAACTCG AGACGGCGGT GCGCGCGGGC GCGGCCTTCG TGGTGGTGAT CGGCAACGAC
CGCCGCTGGA ATGCCGAGCA TCTGCTGCAG ATTCGCGAAT TCGGCCCCGA CCGCGTCCAT
GGATGCGAGC TCTCGGGCGC CCGTTACGAT CTGGTCGCCG CGGCCCTCGG GGGCACGGGC
GCCCATGTGA CATGCCGGTC CGAGCTTCGG GGCGCCCTGC GGCGCGCCTT CGCCGCCGGC
GGAGTGGTTC TGGTGAATGT GGAGATCGAG GGCCGCGCAG CCCCTTCCGG AGAGGAGCCG
GAAGCGGCGG AGACGGCGCA GCCCGACGGA TGA
 
Protein sequence
MNDRRPFLLR GADLLVSTLV DCGVTRIFSL SGNQIMPLYD ACLGSGIEIV HVRHEAAAVF 
MAEGWAQLTG EVGVALVTAG PGAANAVGPL MSARQSETPL LLLSGDSPRA QDGMGAFQTL
DQVAATAPFT KHSARVGSVP TLAHELRAAL TLARAGRPGP VHLALPQDLL TQRAEGVPSS
SRTAAATVPA SERETARLAA EIAEARRPLI VTSGLFSPTR GGDLARRLGQ RLAVPVVALE
SPRGLRDPAQ PGLRQRMADA DLVISLGKSV DYMLDFGRAT SADCGWIVVE PEDGAGEEAV
RNLGPKLRRL IAADPRALAQ GLAELPEAGC DAGRRDWCAR FGRRPAPPVL DETPGPIDSG
LLCATLSDVL GADGGETILV SDGGEFGQWA QALVQADRRL INGPAGGIGG ALGHAVAASL
ACPEARVAVA SGDGSIGFHL AELETAVRAG AAFVVVIGND RRWNAEHLLQ IREFGPDRVH
GCELSGARYD LVAAALGGTG AHVTCRSELR GALRRAFAAG GVVLVNVEIE GRAAPSGEEP
EAAETAQPDG