Gene RPB_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3345 
Symbol 
ID3911147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3825959 
End bp3827737 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content66% 
IMG OID637885248 
Productacetolactate synthase 3 catalytic subunit 
Protein accessionYP_486952 
Protein GI86750456 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.615353 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0219835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA GCACCAGCCA CGATCCCAAC CAGATGACCG GCGCCGCGAT GATCGTCCGC 
GCGATGAAGG ATCACGGCGT CGAGCACATC TTCGGCTATC CCGGCGGCGC GGTGCTTCCG
ATCTATGACG AGATCTTCCA GCAGTCCGAC GTGCAGCACA TCCTGGTCCG GCACGAGCAG
GGCGCCGGCC ATGCCGCCGA GGGCTATGCG CGCTCGACCG GCAAGCCGGG CGTGGTGCTG
GTGACCTCGG GCCCGGGCGC CACCAACATG GTGACGCCGC TGGCCGACGC GCTGATGGAT
TCGATCCCGA TCGTCTGCAT CACCGGGCAG GTGCCGACGC ATCTGATCGG CAATGACGCG
TTCCAGGAAT GCGACACCGT CGGCATCACC CGGCCTTGCA CCAAGCACAA TTGGTTGGTG
CGCGACATCA AGGATCTCGC CCGCGTGCTG CACGAGGCGT TCTACGTCGC GACCTCCGGC
CGTCCGGGCC CGGTCGTGGT CGACGTGCCG AAGGACGTGC AGTTCGCCAC CGGCACCTAT
CATCCGCCGC GCAAGGACGA CGTCCACGTC TCCTACAAGC CGGTCACCAA GGGCGATCCG
AACCAGATCC GCAAGGCGGT GGCGATGCTC GCCGGCGCCA AGCGGCCGGT GATCTATTCC
GGCGGCGGCG TGGTCAATTC CGGCGACGAA GCCTGCCGGC TGCTGCGCGA GCTGGTCGAG
GTCACCGATT TCCCGATCAC CTCGACGCTG ATGGGGCTCG GCGCCTATCC GGCGTCGGGC
AAGAACTGGC TCGGCATGCT CGGCATGCAC GGCACCTACG AAGCCAACAT GACGATGCAT
GGCTGCGACG TCATGCTGTG CATCGGCGCG CGCTTCGACG ACCGCATCAC CGGCCGCACC
GACGCGTTCG CGCCGCACGC CAAGAAGATC CACATCGACA TCGACCCGTC GTCGATCAAC
AAGAACATCC GCGTCGACGT GCCGATCATC GGCGACGTCG CCAGCGTGCT GACCGATCTG
CTCGCGGTGT TCAAGGCCGA GGCGAAGAAG CCCGACATCA AGCCGTGGTG GCACCAGGTC
GCGACCTGGC GCGCGCGCAA TTCGCTGGCC TACAAGAAGA ACAACGACCT GATCATGCCG
CAATACGCGA TCCAGCGGCT TTACGAGGCG ACGCGTGGCC GCGACACCTA CATCACCACC
GAAGTCGGCC AGCATCAGAT GTGGGCGGCG CAGTTCTACG GCTTCGAACA GCCGAAGCGC
TGGATGACCT CGGGCGGCCT CGGCACCATG GGCTACGGCC TGCCGGCGGC GCTCGGCGTC
CAGGTGGCGC ATCGCGATGC GCTGGTGATC GACATCGCCG GCGACGCCTC GGTGCAGATG
ACGATGCAGG AGATGGCGAC GGCGGTGCAG TACGAACTGC CGATCAAGAT CTTCATCCTC
AACAACCAGT ATATGGGGAT GGTGCGGCAG TGGCAGCAGC TGTTGCATGG TAATCGGCTG
TCGCACTCCT ACACCGAGGC GATGCCGGAC TTCGTCAAGC TCGCCGAGGC CTATGGCGGC
GTCGGCATGC AGGTGACCAA GCCGGCCGAT CTCGACGGCG CGATCATGGA CATGATCAAG
GTCAACAAGC CGGTGCTGTT CGACTGCCGC GTCGCCGCGC TGGAGAACTG CTTCCCGATG
ATCCCGTCCG GCAAGGCCCA CAACGAGATG CTGCTGCCGG CCGAAGCCAC CGACGAAGCC
ACCGCGGCGG CCTTCGCCGG CGGCAAGGCG CTGGTGTGA
 
Protein sequence
MSDSTSHDPN QMTGAAMIVR AMKDHGVEHI FGYPGGAVLP IYDEIFQQSD VQHILVRHEQ 
GAGHAAEGYA RSTGKPGVVL VTSGPGATNM VTPLADALMD SIPIVCITGQ VPTHLIGNDA
FQECDTVGIT RPCTKHNWLV RDIKDLARVL HEAFYVATSG RPGPVVVDVP KDVQFATGTY
HPPRKDDVHV SYKPVTKGDP NQIRKAVAML AGAKRPVIYS GGGVVNSGDE ACRLLRELVE
VTDFPITSTL MGLGAYPASG KNWLGMLGMH GTYEANMTMH GCDVMLCIGA RFDDRITGRT
DAFAPHAKKI HIDIDPSSIN KNIRVDVPII GDVASVLTDL LAVFKAEAKK PDIKPWWHQV
ATWRARNSLA YKKNNDLIMP QYAIQRLYEA TRGRDTYITT EVGQHQMWAA QFYGFEQPKR
WMTSGGLGTM GYGLPAALGV QVAHRDALVI DIAGDASVQM TMQEMATAVQ YELPIKIFIL
NNQYMGMVRQ WQQLLHGNRL SHSYTEAMPD FVKLAEAYGG VGMQVTKPAD LDGAIMDMIK
VNKPVLFDCR VAALENCFPM IPSGKAHNEM LLPAEATDEA TAAAFAGGKA LV