Gene RSP_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4011 
SymbolilvB2 
ID3712050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp35085 
End bp36866 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content68% 
IMG OID640069327 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_345194 
Protein GI77404620 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0930701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGA CCACCGAAGA GGCCTTCGTG AAGGTCCTGC AGATGCACGG CATCGAACAT 
GCGTTCGGGA TCATCGGCTC GGCCATGATG CCGGTGTCGG ATCTGTTCCC GAAGGCCGGG
ATCACCTTCT GGGATTGCGC CCACGAGACG AACGCGGGCC TGATGGCCGA TGGCTTCACC
CGCTCGACGG GCAAGATGTC GATGGCCATC GCCCAGAACG GCCCCGGCGT CACCGGCTTC
GTGACGCCGG TCAAGACCGC CTACTGGAAC CATACGCCGC TTCTGCTGGT GACGCCGCAG
GCGGCCAACA AGACCATCGG GCAGGGCGGC TTCCAGGAGA TGGAGCAGAT GCGGCTCTTT
GCCGATTGCG TCTGCTATCA GGAGGAGGTG CGCGACGCCT CGCGCATCCC CGAGGTGCTG
AACCGGGTGA TCCTGCAGGC CTGGCGCAAC AGCGCGCCGG CGCAGATCAA CATTCCCCGG
GACATGTGGA CCCAGGTCAT CGATGTCGAG CTGCCGCAGA TCGTGGCCTT CGAGCGGCCC
GCGGGCGGCG AGGAGGCGGT GGCCGAGGCG GCGCGGCTTC TGTCGGGGGC GCGCTTCCCG
GTGATCCTCT CCGGGGCGGG CGTGGTGCTC TCGGGCGCGA TCCCGGATCT CGCGCGGCTG
GCCGAGCGGC TCGACGCGCC GGTGGCCTCG AACTACCAGC ACAACGACAG CTTCCCCGGC
AGCCATCCGC TCGCCGTGGG CCCTCTGGGC TACAACGGCT CGAAGGCCGC GATGGAGCTG
ATCGCCCGCG CCGACGTGGT GCTGGCGCTC GGCACGCGGC TCAATCCCTT CTCGACGCTG
CCGGGCTACG GCATCGACTA CTGGCCGCGC GAGGCCAGGA TCATTCAGGT CGACATCAAT
GCCGACCGGA TCGGGCTGAC GAAGAAGGTC ACCGTGGGCA TTCAGGGCGA TGCGGCCAAG
GTGGCGCGCG CGATCCTGGC CCAGCTGGGC GAGGGCGCGG GCGATGCGGG CCGCGAGGAG
CGGCGGCATC TCGTGGCGCA GACCAAGTCG CGGTGGGCGC AGGAGCTGTC GAGCCTCGAC
CATGAAGAGG ACGATCCGGG CACCGAATGG AACGCGGGCG CGCGCACGCG CGATGCCGAT
CTGATGAGCC CGCGGCAGGC CTGGCGCGCG ATCATGCAGG CGGTGCCGGC CGAGGCCATC
GTCTCGTCCG ACATCGGCAA CAACTGCGCC ATCGGCAACG CCTATCCGAG CTTCGAGGCC
GGGCGGAAAT ATCTGGCGCC GGGTCTCTTC GGCCCTTGCG GTTACGGCTT CCCGGCGATC
CTCGGCGCCA AGATCGGCAA TCCCGACACG CCGGTGATCG GCTTTGCGGG CGACGGCGCC
TTCGGCATCT CGATGAACGA GATGACCGCC TGCGGCCGCG AGGACTGGCC CGCCATCACC
ATGGTGATCT TCCGCAACTA CCAGTGGGGC GCGGAAAAGC GCAACACGAC GCTGTGGTAC
GACAACAACT TCGTGGGCAC CGAGCTCGAC CGCGACACGA GCTATGCGGC CATCGCCCGG
GCCTGCGGCG CGCATGGGGT GCAGGTGCGC AGCCAGTCCG AACTGACGGC GGCCTTGCAC
GAGGCGGTCG AGCGGCAGAT GAAGGCGCGA GAGACCACCT TCATCGAGGT GCTGCTCAAT
CAGGAGCTGG GCGAGCCCTT CCGCCGCGAC GCGATGAAGA AGCCGGTGGT GGTGGCGGGG
ATCGACCCGG CCGACATGCG CCCGCAGAAG GGCGCGGCCT GA
 
Protein sequence
MRMTTEEAFV KVLQMHGIEH AFGIIGSAMM PVSDLFPKAG ITFWDCAHET NAGLMADGFT 
RSTGKMSMAI AQNGPGVTGF VTPVKTAYWN HTPLLLVTPQ AANKTIGQGG FQEMEQMRLF
ADCVCYQEEV RDASRIPEVL NRVILQAWRN SAPAQINIPR DMWTQVIDVE LPQIVAFERP
AGGEEAVAEA ARLLSGARFP VILSGAGVVL SGAIPDLARL AERLDAPVAS NYQHNDSFPG
SHPLAVGPLG YNGSKAAMEL IARADVVLAL GTRLNPFSTL PGYGIDYWPR EARIIQVDIN
ADRIGLTKKV TVGIQGDAAK VARAILAQLG EGAGDAGREE RRHLVAQTKS RWAQELSSLD
HEEDDPGTEW NAGARTRDAD LMSPRQAWRA IMQAVPAEAI VSSDIGNNCA IGNAYPSFEA
GRKYLAPGLF GPCGYGFPAI LGAKIGNPDT PVIGFAGDGA FGISMNEMTA CGREDWPAIT
MVIFRNYQWG AEKRNTTLWY DNNFVGTELD RDTSYAAIAR ACGAHGVQVR SQSELTAALH
EAVERQMKAR ETTFIEVLLN QELGEPFRRD AMKKPVVVAG IDPADMRPQK GAA