Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3802 |
Symbol | |
ID | 3911605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4338928 |
End bp | 4340730 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885703 |
Product | acetolactate synthase large subunit |
Protein accession | YP_487407 |
Protein GI | 86750911 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.880362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGCTCT CCGACTACGT CATCGACTTC CTCGCGCAGC GCGGCGTCAG CCATGTGTTC GGCATTTCCG GCGGCGCGGC GGTGCATATG TTCGATTCGG CGGCGAAGCA TCCGGATGTC ACGCCGATCT TCCCGCAGCA CGAGCAGGCC GCCGCGATCG CCGCCGACGG CTACGCGCGC GCCACCGGCA AGCTCGGCGT CGCCATCACC ACCTCCGGCC CCGGCGCGAC CAATCTGCTG ACCGGGGTGT GCTGCGCGTA TTATGATTCA GTGCCGACGC TGATGATCAC CGGGCAGGTC GCGACGCATC GGCTCAAGGG CAACAACGAC GTCCGCCAGC TAGGCTTCCA GGAGACCGAC GTGACGTCGA TCTTCGCCAC GGTGACGAAA TATGCGGTGC AGATCTCCGA TCCCGCGACG ATCCGCTATC ATCTGGAAAA GGCCTACTAT CTCGCCTTCG AGGGCCGGCC CGGCGCGGTG CTGATCGATC TGCCGGACGA TCTGCAGCGC GCCGAGATCG ATCCGGAGGC GCTGGCGTCG TTCGTGCCGG AGACGCAGAT CGCCACGACC GATCTCGACG CCGAGATCGT CGCCTTGCTG CCGCTGATCG CGCAGGCGAA GCGACCGGTG CTGGTGCTGG GCGGCGGGCT GTCGACGCCG CGGATCGGCG CCGCGCTCGA TCAACTGATC GACCGGCTCG CCATGCCGGT GCTGACGACC TGGGCCGCGA CCGATCTGAT CGCGCATGAT CATCCGCTGC GGGTCGGGCC GTTCGGTGTT TACGGGCCGC GGCTCGGCAA TTTCACCGTG CAGAATGCCG ACCTCATTCT CTGCCTCGGC AGCCGGCTGT CGCAGAACGT CACCGGCGGC ATCCTGCCGT CGTTCGCGCG CGAGGCGACG ATCGTGATGG TCGACGCCAG CCGCGGCGAG ATGGACAAGT TCGACGCGCG CGGCATCGCC GTCGCGACGC GGATCGAGGC GCGGCTCGAC GGGTTCGTGC CGAAGCTGCT CGGAGCGATC GAGGCCGCGC CGCCGCGCGA CGAATGGCTG GCGCAGATCG CGCATTGGCG TAGCGCGCTG CCGGACGATC GTCCCGGTCC CGCGCCCGCC AATGCAGGCT TCGTGGACGC CTACGACTTC GTCGACAAGT TGAGCGAGAC CGCGCCCGCC GACGAGCTGA TCTATGTCGA CACCGGCGGC AATCTGACCT GGACCTGCAA CGGCTTCCGC ATCCAGCGCG GCCAGCGGCT GATCTCCGAC TGGAACAACA CCGCGATGGG CTATGCGCTG CCGGCCGCGA TCGGCGCGGC GGTGCAGGCG AAGGGCGGGG TGAGCTGCAT CATCGGCGAT GGCGGCCTGA TGCTGTCGCT CGGCGAACTG GCGCTGCTGT CGCGTCACAG GCTGCCGGTG CGGCTGTTCC TGTTCAACAA TCACGGGCAC GGCATCCAGA AGCAGACATT GGAGACCTGG CTCGACGGCA ACTATGTCGG CGTCGATGCG CCGAGCGGAT TGTCGTTCGT CGATGTCGCC AAGGTCGCCG AGGCGATGGA CCTGCCGGTG GTCACGATCA GCCGCAGCGC GGACATCGCC GCCAAGCTGC GCGAGGTCTA TGCGCGGCAG GGACCGGTGT TCTGCAATGT CGAGATCAAT CCCGCGCAGA AGCTGTATCC GGTGCTGAAA TTCGGCGCGC CGCTGGAAAG CCAGATGCCG GCGATCGACG ACGCGCTGAT CGCGCGCGAG ATGATCGTGC CGCCGTTCGT CCCCGGCGCG GCGCCGAAGC ACAGCGGCGG CGCGGGGGTG TGA
|
Protein sequence | MKLSDYVIDF LAQRGVSHVF GISGGAAVHM FDSAAKHPDV TPIFPQHEQA AAIAADGYAR ATGKLGVAIT TSGPGATNLL TGVCCAYYDS VPTLMITGQV ATHRLKGNND VRQLGFQETD VTSIFATVTK YAVQISDPAT IRYHLEKAYY LAFEGRPGAV LIDLPDDLQR AEIDPEALAS FVPETQIATT DLDAEIVALL PLIAQAKRPV LVLGGGLSTP RIGAALDQLI DRLAMPVLTT WAATDLIAHD HPLRVGPFGV YGPRLGNFTV QNADLILCLG SRLSQNVTGG ILPSFAREAT IVMVDASRGE MDKFDARGIA VATRIEARLD GFVPKLLGAI EAAPPRDEWL AQIAHWRSAL PDDRPGPAPA NAGFVDAYDF VDKLSETAPA DELIYVDTGG NLTWTCNGFR IQRGQRLISD WNNTAMGYAL PAAIGAAVQA KGGVSCIIGD GGLMLSLGEL ALLSRHRLPV RLFLFNNHGH GIQKQTLETW LDGNYVGVDA PSGLSFVDVA KVAEAMDLPV VTISRSADIA AKLREVYARQ GPVFCNVEIN PAQKLYPVLK FGAPLESQMP AIDDALIARE MIVPPFVPGA APKHSGGAGV
|
| |