Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4466 |
Symbol | |
ID | 3912282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5056019 |
End bp | 5058007 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886369 |
Product | transketolase |
Protein accession | YP_488060 |
Protein GI | 86751564 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0021] Transketolase |
TIGRFAM ID | [TIGR00232] transketolase, bacterial and yeast |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.913477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGG TCGATCATTC CCGTATGGCG AACGCAATCC GGGCGCTGGC GATGGACGCG GTCGAGAAGG CCAAATCCGG CCACCCCGGC CTGCCGATGG GCGCCGCCGA CGTCGCCACA GTGCTGTTCA CCGAGTTCCT GAAATTCGAC GCCGCCGATG CGCACTGGCC GGATCGCGAC CGCTTCATCC TGTCCGCCGG CCACGGCTCG ATGCTGCTCT ATGCGCTGTT GTACCTCACC GGTAATTCCG AGCTGACGCT CGACCAGATC AAGGCGTTCC GCCAGCTCGA CTCCAAGACG CCCGGGCACC CGGAGAACTG CATCACAGAT TCGGTTGAAA CCACCACCGG CCCGCTCGGC CAGGGCGTGG CGTCGTCGGT CGGCACCGCG CTGGCGGAGC GTCTGCTCGC CGCCGAATTC GGCGAGATCG TCGATCACAC CACCTACGTG CTGTGCTCGG ACGGCGATCT GATGGAAGGC GTGAGCCACG AGGCGATCGC GCTGGCCGGC CATCTCAGGC TGTCGAAGCT GATCTTCCTC TACGACGACA ACGGCATCTC GATCGACGGC CCGCTGACGC TGACCGACAA TGTCGATCAG GTCGCGCGCT TCCAGGCGCA TGGCTGGAAT GCGATGCGGA TCGACGGCCA CGATCACAAG GCGATCGCCG AGGCGATCAA GGCCGCGAAA GCCTCCGACC GGCCGACCAT GATCGCCTGC AAGACCACGA TCGGCTTCGG CGCCCCCACC CGGGCCGGGA CCTCCAAGGC GCATGGCGAG CCGCTCGGCG CCGAAGAACT GGCCGGCGCC AAGAAGGCGC TGGGCTGGGA CTACGGCCCG TTCGAAATTC CGGACGACGT GCTCTCCGCT TGGCGCGCGG TCGGCGCCAA GGGCGCCAAG GCCCACGCGG AATGGCAGTC GAAGTTCGAC GCGATGGACA AGGAGCTGCG CGCCGAATTC CAGCGCCGGG TGATCGACCG CAAGCGGCCG GCGGCGCTCG ACGGCGCGAT CCGCAAGCTC AAGGACAGGC TCGTCGCGGA GCCGCAGACC ATCGCCACCC GCAAGGCCAG CGAGCTGGCG CTGGAGGCCA TCGTCGAGGT GGTGCCGGAA ATGCTGCTCG GCTCGGCGGA CCTGACGCCG TCCAACAACA CCCGCACCAA ACACGCGAAA GACGTCACCC CGGACGACTT CTCGGGTCGC TACATCCATT ACGGCATCCG CGAAATGGGC ATGGCGGCGG CGATGAACGG TATCGCGATG CATGGCGGTT TCGCGCCGGC CGGCGGCACC TTCATGTGCT TCGCCGACTA CGCCCGCCCG TCGATGCGGA TCGCGGCGCT GTCGCATGTC CCGGTGGTCT ACATCATGAC CCATGATTCG ATCGGGCTCG GCGAAGACGG CCCGACGCAT CAGCCGGTCG AGCACCTCGC TTCGCTGCGG GCGATGCCCA ATATGCGGGT GTTCCGCCCG GCCGACCCGG TCGAGACCGC CGAATGCTGG CAGCTCGCGC TGGAGAACAC CAAGGGCCCG ACGGTGCTGG CGCTGTCGCG GCAGAATCTG ACGCCGGTGC GCACCAGCAA ATCCGACGAC AACCGCTGCG CCCGGGGCGC CTATGAGCTG ATCGCCGCCG ACGGCAAGGC TCAGGTGACG ATCTTCGCCA CCGGCTCCGA GGTCGAGATC GCGGTCGCGG CGCACAAGCT GCTCGCCGCC AAGGGCATCG CTGCGCGCGT GGTGTCGGTG CCGTCGCTGG ATCTCTTGCT GCAGCAGGAC GACGCCACCC GCAAGGCGAT CATCGGCGAC GCCCCGGTCA AGGTCGCGGT CGAGGCCGCG GTGCGGTTCG GCTGGGACGC GGTGATCGGC CCGGAGGGCG GCTTCATCGG CATGTCGAGC TTCGGCGCCA GCGCGCCCGC GAAGGATCTG TACAAGCATT TCGGAATTAC CGCCGAGGCG GTCGCAGAGG CTGCGGCAAG CCGTCTCGGC GGCAAGTAA
|
Protein sequence | MAKVDHSRMA NAIRALAMDA VEKAKSGHPG LPMGAADVAT VLFTEFLKFD AADAHWPDRD RFILSAGHGS MLLYALLYLT GNSELTLDQI KAFRQLDSKT PGHPENCITD SVETTTGPLG QGVASSVGTA LAERLLAAEF GEIVDHTTYV LCSDGDLMEG VSHEAIALAG HLRLSKLIFL YDDNGISIDG PLTLTDNVDQ VARFQAHGWN AMRIDGHDHK AIAEAIKAAK ASDRPTMIAC KTTIGFGAPT RAGTSKAHGE PLGAEELAGA KKALGWDYGP FEIPDDVLSA WRAVGAKGAK AHAEWQSKFD AMDKELRAEF QRRVIDRKRP AALDGAIRKL KDRLVAEPQT IATRKASELA LEAIVEVVPE MLLGSADLTP SNNTRTKHAK DVTPDDFSGR YIHYGIREMG MAAAMNGIAM HGGFAPAGGT FMCFADYARP SMRIAALSHV PVVYIMTHDS IGLGEDGPTH QPVEHLASLR AMPNMRVFRP ADPVETAECW QLALENTKGP TVLALSRQNL TPVRTSKSDD NRCARGAYEL IAADGKAQVT IFATGSEVEI AVAAHKLLAA KGIAARVVSV PSLDLLLQQD DATRKAIIGD APVKVAVEAA VRFGWDAVIG PEGGFIGMSS FGASAPAKDL YKHFGITAEA VAEAAASRLG GK
|
| |