Gene Rsph17029_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4016 
Symbol 
ID4898569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1160350 
End bp1162194 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content72% 
IMG OID640114619 
Productthiamine pyrophosphate enzyme, central region 
Protein accessionYP_001045866 
Protein GI126464753 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGCA CGGACCAGGA CGACGCCGGG ACGCTGCGGC TGACGGCGGC GCAGGCGATG 
GTGCGCTGGC TCTCGGTGCA GCGGACCGAG GAGGGCGGGC GCTTCATCGA GGGCGTCTGG
GCGATCTTCG GCCATGGCAA TGTGGCGGGC CTGGGCGAGG CGCTGCAGGG GATCGGCGAC
GCGCTGCCCA CCTGGCGCGG GCAGAATGAA CAGACCATGG CCCATGCCGC CGTGGCCTTC
GCCAAGCAGA AGCGGCGGCG GCAGGCGATG GCCGTCACCT CCTCGATCGG GCCGGGCGCC
ACGAACATGG TCACGGCCGC GGCGCTGGCC CATGTGAACC GGCTGCCCGT GCTGTTCCTG
CCGGGCTGCG TCTTCGCCAA CCGCCGCCCC GACCCGGTGC TCCAGCAGAT CGAGGATTTC
GACGACGGCA CGGTGTCGGC CAACGACTGT TTCCGGCCGG TGGTGCGCTA TTTCGACCGG
ATCCAGCGGC CCGAGCATCT GCTGACGGCG CTGCCGCGGG CGCTGCAGGT GATGACGGAC
CCGGCGAACT GCGGCCCGGT CTGCCTCGCC TTCTGTCAGG ACGTGCAGGC CGAGGCCTGC
AACTATCCCA CGCGCTTCTT CGCCCCGCGC GTCTGGCGCA TCCGGCGGCC CGAACCCGAT
CCGGTCGAGG TCGAAGCGGT GGCGGTGCGG CTGCGCGGGG CGCGGCGGCC GGTGATCGTG
GCCGGAGGCG GCGTGCTCTA TTCCGGCGCC GAGGCGGAGC TTGCGGCCTT CTCGCAGCGC
CATCGGATCC CGGTGGTCGA GACGCAGGCC GGCAAGGGCA TCCTCGACTG GCAGGAGCCG
CTCAACTTCG GCTCGCCCGG CGTGACGGGA TCCGAGTGCG GCAACCGTCT GTGCGCCGAG
GCCGACGTCA TCCTCGGCGT GGGCACAAGG TTTCAGGATT TTACCACCGG CAGCCGGACG
ATCTTCGCCG AGGCCGACCT TCTGTCGGTC AACCTCCATC CCTACGATGC GCATAAGCAC
GGGGCGCTGC CGCTCGTCGC GGATGCGAAG GCCGCGCTGG CGCGCCTGAC GGAGGCGCTC
GGGGATGCGC AGTGGCCCGA GCCCGACGCG GCTCTGCGGG CGGACTGGTT TGCGGCCACC
GAGGCGGCGA TGGCGCGGCC CTCGGACAAT GCGCTGCCCA CGGACGCGCA GGTCATCGGC
GCGGTGCAGC GGGTCGCGAC CGCCCGGACG GTCGTCATGG GCGCCGCGGG CACGATGCCG
GGCGCGCTGC AGCTGCTCTG GCGGGCCTCG CCCGGCGGCT ATCACATGGA ATACGGCTAC
AGCTGCATGG GCTACGAGGT GGCCGGCGCC CTCGGAATCG CCCTGGCCGA GCCTGACCGC
GAGGTCATCT GCTTTGCAGG CGACGGCAGC TACATGATGG CCAACTCGGA ACTGGCGACC
GCGGTCATGC GGCGCGTGCC CTTCACGGTC GTTCTGACCG ACAACCGCGG CTATGGCTGC
ATCAACCGCC TGCAGCAGGC GTGCGGCGGC GCTCCGTTCA ACAACCTGTA TCGGGATGCG
CGCGTCGAAG CCCAGCCCGA GATCGATTTC GTGGGCCATG CCGCCGCGAT GGGCGCCCGT
GCGGTCAAGG CGGACGGCAT CCCCGCGCTC GAGGCCGAGA TCGTGGCCGC GCGCGGGCGC
GACCGGCCGA CCGTGATCGT CATCGAGACC GATCCGGAAC CGGGGACGGG TGTCGGCGGT
CACTGGTGGG ACGTGGCCGT GCCGCAGGCA GGAGAGGGCG CGCGGCTGGC AGAGGCCCAA
GCCCGCTACG CCACCCACGC CGCGCGGCAG CGCACCTTCG ATTAG
 
Protein sequence
MTGTDQDDAG TLRLTAAQAM VRWLSVQRTE EGGRFIEGVW AIFGHGNVAG LGEALQGIGD 
ALPTWRGQNE QTMAHAAVAF AKQKRRRQAM AVTSSIGPGA TNMVTAAALA HVNRLPVLFL
PGCVFANRRP DPVLQQIEDF DDGTVSANDC FRPVVRYFDR IQRPEHLLTA LPRALQVMTD
PANCGPVCLA FCQDVQAEAC NYPTRFFAPR VWRIRRPEPD PVEVEAVAVR LRGARRPVIV
AGGGVLYSGA EAELAAFSQR HRIPVVETQA GKGILDWQEP LNFGSPGVTG SECGNRLCAE
ADVILGVGTR FQDFTTGSRT IFAEADLLSV NLHPYDAHKH GALPLVADAK AALARLTEAL
GDAQWPEPDA ALRADWFAAT EAAMARPSDN ALPTDAQVIG AVQRVATART VVMGAAGTMP
GALQLLWRAS PGGYHMEYGY SCMGYEVAGA LGIALAEPDR EVICFAGDGS YMMANSELAT
AVMRRVPFTV VLTDNRGYGC INRLQQACGG APFNNLYRDA RVEAQPEIDF VGHAAAMGAR
AVKADGIPAL EAEIVAARGR DRPTVIVIET DPEPGTGVGG HWWDVAVPQA GEGARLAEAQ
ARYATHAARQ RTFD