Gene RPD_2098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2098 
Symbol 
ID4022580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2348381 
End bp2350159 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content64% 
IMG OID637962291 
Productacetolactate synthase 3 catalytic subunit 
Protein accessionYP_569234 
Protein GI91976575 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.477274 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGT CCAAGAGCCA CGATCCGAAC CAGATGACCG GCGCCGCGAT GATCGTTCGC 
GCGATGATCG ATCATGGCGT GCAGCATCTT TTCGGCTATC CGGGCGGCTC GGTCCTTCCG
ATCTATGACG AGCTGTTCCA GCAGGATCAG TTGCAGCACA TCCTGGTCCG CCACGAGCAG
GGCGCCGGCC ACGCCGCCGA GGGCTATGCG CGCTCGACCG GCAAGCCCGG CGTGGTGCTG
GTGACGTCCG GTCCCGGCGC AACCAACATG GTGACGCCGC TTGCCGACGC GCTGATGGAT
TCGATCCCGC TGGTGTGCAT CACCGGCCAG GTGCCGACGC ATCTGATCGG CAACGACGCC
TTCCAGGAAT GCGACACCGT CGGCATCACC CGTCCTTGCA CCAAGCACAA CTGGCTGGTG
CGGGACATCA AGGATCTGGC CCGCGTGCTG CACGAGGCGT TCTACGTCGC CAGCTCCGGA
CGGCCGGGGC CGGTGGTGGT CGACGTGCCG AAGGACGTTC AGTTCGCGGT CGGCACCTAT
CATCCGCCGC GCAAGGCCGA CGTGCATATT TCCTACACGC CGCAGGTGAA GGGCGATCCC
GCCGCGATCC GCAAGGCGGT TGCGCTGTTG GCCGGCGCGA AGCGGCCGGT GATCTATTCC
GGCGGCGGCG TGGTCAATTC CGGCGACGAG GGCTGCAGGC TGCTCCGCGA ACTGGTCGAG
GTCACCGACT TCCCGATCAC CTCGACCTTG ATGGGGCTCG GCGCCTATCC GGCGTCCGGC
AAGAACTGGC TCGGAATGCT GGGGATGCAC GGCACCTACG AAGCCAATAT GACGATGCAT
GATTGCGACG TCATGCTGTG CATCGGCGCG CGGTTCGACG ACCGTATCAC CGGCCGCACC
GACGCGTTTT CGCCGAACTC GAAGAAGATC CACATCGACA TCGATCCGTC GTCGATCAAC
AAGAACATCC GCGTCGACGT GCCGATCATC GGCGACGTCG CCAACGTGCT CGGCGATCTG
CTCGCGATGT TCAAGGCCGA GGCGAAGAAG CCCGACATCA AGCCGTGGTG GCAGCAGGTC
GCGACCTGGC GCGCGCGCAA TTCGCTCGCC TACAAGAAGA ACAACGACCT GATCATGCCG
CAATATGCGA TCCAGCGGCT GTACGAGGCG ACGCGGGGCC GCGACACCTA CATCACGACC
GAAGTGGGCC AGCATCAGAT GTGGGCGGCG CAGTTCTTCG GCTTCGAACA GCCGAAGCGG
TGGATGACCT CGGGCGGCCT CGGCACCATG GGCTACGGCC TGCCGGCCGC GCTCGGCGTG
CAGGTCGCGC ATCCGGACAG TCTGGTGATC GACATCGCCG GCGACGCCTC GGTGCAGATG
ACGATGCAGG AGATGGCCAC GGCGGTGCAG TACGAACTGC CGATCAAGAT CTTCATCCTC
AACAACCAGT ACATGGGGAT GGTGCGGCAG TGGCAGCAGC TCTTGCACGG CAACCGGCTG
TCGCATTCCT ACACCGAGGC GATGCCTGAC TTCGTCAAGC TCGCCGAGGC TTATGGCGGC
GTCGGCATGC AGGTGACCAA GCCCGGCGAT CTCGACGGCG CGATCATGGA CATGATCAAG
GTGAAGAAGC CGGTGCTGTT CGACTGCCGC GTCGCGGCGC TCGAGAACTG CTTCCCGATG
ATCCCGTCCG GCAAGGCGCA TAATGAAATG CTTCTGCCGT CGGAAGCGAC CGACGAAGCC
ACCGCAGCCG CCTTCGCTGG CGGCAAGGCG CTGGTGTGA
 
Protein sequence
MSESKSHDPN QMTGAAMIVR AMIDHGVQHL FGYPGGSVLP IYDELFQQDQ LQHILVRHEQ 
GAGHAAEGYA RSTGKPGVVL VTSGPGATNM VTPLADALMD SIPLVCITGQ VPTHLIGNDA
FQECDTVGIT RPCTKHNWLV RDIKDLARVL HEAFYVASSG RPGPVVVDVP KDVQFAVGTY
HPPRKADVHI SYTPQVKGDP AAIRKAVALL AGAKRPVIYS GGGVVNSGDE GCRLLRELVE
VTDFPITSTL MGLGAYPASG KNWLGMLGMH GTYEANMTMH DCDVMLCIGA RFDDRITGRT
DAFSPNSKKI HIDIDPSSIN KNIRVDVPII GDVANVLGDL LAMFKAEAKK PDIKPWWQQV
ATWRARNSLA YKKNNDLIMP QYAIQRLYEA TRGRDTYITT EVGQHQMWAA QFFGFEQPKR
WMTSGGLGTM GYGLPAALGV QVAHPDSLVI DIAGDASVQM TMQEMATAVQ YELPIKIFIL
NNQYMGMVRQ WQQLLHGNRL SHSYTEAMPD FVKLAEAYGG VGMQVTKPGD LDGAIMDMIK
VKKPVLFDCR VAALENCFPM IPSGKAHNEM LLPSEATDEA TAAAFAGGKA LV