Gene RPB_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3802 
Symbol 
ID3911605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4338928 
End bp4340730 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content68% 
IMG OID637885703 
Productacetolactate synthase large subunit 
Protein accessionYP_487407 
Protein GI86750911 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.880362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTCT CCGACTACGT CATCGACTTC CTCGCGCAGC GCGGCGTCAG CCATGTGTTC 
GGCATTTCCG GCGGCGCGGC GGTGCATATG TTCGATTCGG CGGCGAAGCA TCCGGATGTC
ACGCCGATCT TCCCGCAGCA CGAGCAGGCC GCCGCGATCG CCGCCGACGG CTACGCGCGC
GCCACCGGCA AGCTCGGCGT CGCCATCACC ACCTCCGGCC CCGGCGCGAC CAATCTGCTG
ACCGGGGTGT GCTGCGCGTA TTATGATTCA GTGCCGACGC TGATGATCAC CGGGCAGGTC
GCGACGCATC GGCTCAAGGG CAACAACGAC GTCCGCCAGC TAGGCTTCCA GGAGACCGAC
GTGACGTCGA TCTTCGCCAC GGTGACGAAA TATGCGGTGC AGATCTCCGA TCCCGCGACG
ATCCGCTATC ATCTGGAAAA GGCCTACTAT CTCGCCTTCG AGGGCCGGCC CGGCGCGGTG
CTGATCGATC TGCCGGACGA TCTGCAGCGC GCCGAGATCG ATCCGGAGGC GCTGGCGTCG
TTCGTGCCGG AGACGCAGAT CGCCACGACC GATCTCGACG CCGAGATCGT CGCCTTGCTG
CCGCTGATCG CGCAGGCGAA GCGACCGGTG CTGGTGCTGG GCGGCGGGCT GTCGACGCCG
CGGATCGGCG CCGCGCTCGA TCAACTGATC GACCGGCTCG CCATGCCGGT GCTGACGACC
TGGGCCGCGA CCGATCTGAT CGCGCATGAT CATCCGCTGC GGGTCGGGCC GTTCGGTGTT
TACGGGCCGC GGCTCGGCAA TTTCACCGTG CAGAATGCCG ACCTCATTCT CTGCCTCGGC
AGCCGGCTGT CGCAGAACGT CACCGGCGGC ATCCTGCCGT CGTTCGCGCG CGAGGCGACG
ATCGTGATGG TCGACGCCAG CCGCGGCGAG ATGGACAAGT TCGACGCGCG CGGCATCGCC
GTCGCGACGC GGATCGAGGC GCGGCTCGAC GGGTTCGTGC CGAAGCTGCT CGGAGCGATC
GAGGCCGCGC CGCCGCGCGA CGAATGGCTG GCGCAGATCG CGCATTGGCG TAGCGCGCTG
CCGGACGATC GTCCCGGTCC CGCGCCCGCC AATGCAGGCT TCGTGGACGC CTACGACTTC
GTCGACAAGT TGAGCGAGAC CGCGCCCGCC GACGAGCTGA TCTATGTCGA CACCGGCGGC
AATCTGACCT GGACCTGCAA CGGCTTCCGC ATCCAGCGCG GCCAGCGGCT GATCTCCGAC
TGGAACAACA CCGCGATGGG CTATGCGCTG CCGGCCGCGA TCGGCGCGGC GGTGCAGGCG
AAGGGCGGGG TGAGCTGCAT CATCGGCGAT GGCGGCCTGA TGCTGTCGCT CGGCGAACTG
GCGCTGCTGT CGCGTCACAG GCTGCCGGTG CGGCTGTTCC TGTTCAACAA TCACGGGCAC
GGCATCCAGA AGCAGACATT GGAGACCTGG CTCGACGGCA ACTATGTCGG CGTCGATGCG
CCGAGCGGAT TGTCGTTCGT CGATGTCGCC AAGGTCGCCG AGGCGATGGA CCTGCCGGTG
GTCACGATCA GCCGCAGCGC GGACATCGCC GCCAAGCTGC GCGAGGTCTA TGCGCGGCAG
GGACCGGTGT TCTGCAATGT CGAGATCAAT CCCGCGCAGA AGCTGTATCC GGTGCTGAAA
TTCGGCGCGC CGCTGGAAAG CCAGATGCCG GCGATCGACG ACGCGCTGAT CGCGCGCGAG
ATGATCGTGC CGCCGTTCGT CCCCGGCGCG GCGCCGAAGC ACAGCGGCGG CGCGGGGGTG
TGA
 
Protein sequence
MKLSDYVIDF LAQRGVSHVF GISGGAAVHM FDSAAKHPDV TPIFPQHEQA AAIAADGYAR 
ATGKLGVAIT TSGPGATNLL TGVCCAYYDS VPTLMITGQV ATHRLKGNND VRQLGFQETD
VTSIFATVTK YAVQISDPAT IRYHLEKAYY LAFEGRPGAV LIDLPDDLQR AEIDPEALAS
FVPETQIATT DLDAEIVALL PLIAQAKRPV LVLGGGLSTP RIGAALDQLI DRLAMPVLTT
WAATDLIAHD HPLRVGPFGV YGPRLGNFTV QNADLILCLG SRLSQNVTGG ILPSFAREAT
IVMVDASRGE MDKFDARGIA VATRIEARLD GFVPKLLGAI EAAPPRDEWL AQIAHWRSAL
PDDRPGPAPA NAGFVDAYDF VDKLSETAPA DELIYVDTGG NLTWTCNGFR IQRGQRLISD
WNNTAMGYAL PAAIGAAVQA KGGVSCIIGD GGLMLSLGEL ALLSRHRLPV RLFLFNNHGH
GIQKQTLETW LDGNYVGVDA PSGLSFVDVA KVAEAMDLPV VTISRSADIA AKLREVYARQ
GPVFCNVEIN PAQKLYPVLK FGAPLESQMP AIDDALIARE MIVPPFVPGA APKHSGGAGV