Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2745 |
Symbol | |
ID | 8754417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2856346 |
End bp | 2858295 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_003409748 |
Protein GI | 284991194 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACT ACCCGCCGCC GGCACGGACC GACACCGTGC GGCTCACCGT GTCCCAGGCC GTCGTGCGCT TCCTCGCGCA GCAGCACACC GAGCGCGACG GGCAGCGCCA GCGGCTGTTC GCCGGCTGCT TCGGCATCTT CGGGCACGGC AACGTCGCCG GGCTCGGCCA GGCGCTGCTG CAGGCCGAGA TCGACGAGCC CGGCGCGCTG CCCTACGTGC TGGGCCGCAA CGAGCAGGCC GTCGTGCACA CCGCCGTCGC CTATGCCCGG GCCAAGGACC GCCTGCAGAC CTGGGCGGTC TCCACCAGCG TCGGCCCCGG CTCGACGAAC ACGCTGACCG GCGCGGCGCT GGCGACCATC AACCGGCTGC CGGTGCTGCT GCTGCCCGCC GACACCTTCG CCACCCGCTC GGCCGGCCCG CTGCTGCAGG AGCTGGAGCA GCCCCACAGC GGCGACGTGA CCGTCAACGA CGCGTTCCGC CCGGTGTCGC GCTACTTCGA CCGCATCTGG CGGCCCGAGC AGCTGCCCGG CGCGCTGCTC GGCGCGATGC GGGTGCTCAC CGACCCGGCC GAGACCGGCG CGGTCACCCT CTGCTTCCCG CAGGACGTGC AGGCCGAGGC GTTCGACTGG CCGGTCGAGC TGTTCGCCGA GCGCACCTGG CACGTGGGCC GGCCGCTGCC CGAGCCGGCC GCCCTGCAGC GCGCGGTCGA GGTGATCCGG TCGGCCCGCC GGCCGCTGGT CGTCGCCGGC GGCGGCGTCA TCTACTCGCA GGCCACCGAG GCGCTGGACG CCTTCGTGTC GGCCACCGGC ATCCCGGTCG GGCAGACCCA GGCCGGCAAG GGCGCGCTGC CCTACGACCA CCCGCAGTCG GTGGGGGCGA TCGGCTCCAC CGGGACGACG GCGGCCAACG CCCTGGCTCG CGAGGCCGAC GTCGTCATCG GGATCGGCAC CCGCTACCAG GACTTCACCA CCGCGTCGCA CACCGCGTTC AACGACCCCG GCGTGCGGTT CGTCAACGTC AACGTCGCCT CGATGGACGC CGTCAAGCAC GCCGGGGTGG GCGTGCAGGC CGACGCCCGC GAGACGCTCA CCGCGCTGAC CGACGCCCTG GCCGGCTGGT CGGTCGACGA CGCCCACCGG CAGCGCACCG CCGAGCTCGC CGCGCAGTGG GAGGCCACCG TCGAGGCCGC CTACCACCCG GAGGTGCCCA GCGCGGGCGA GGGCGGCCGG CTCACCCAGA ACGAGGTCAT CGGCATCGTC AACGAGGTCT CCGCCCCGCG GGACGTCGTC GTCTGCGCCG CGGGCTCGAT GCCCGGTGAC CTGCACAAGA TGTGGCGGGT CCGCGACCGC AAGGGCTACC ACGTCGAGTA CGGCTACTCG ACCATGGGGT ACGAGATCCC CGGCGGCATC GGCATCCGGA TGGCCAGCCC CGACCGGGAC GTCTTCGTGA TGATCGGCGA CGGGTCCTAC CTGATGATGC CGACCGAGCT GGTCACCGCC GTCCAGGAGG GCGTCAAGAT CGTCGTCGTC CTGGTGCAGA ACCACGGCTT CGCCTCGATC GGCGCGCTGT CGGAGCAGCT GGGGTCGCAG CGGTTCGGCA CCCGCTACCG CTACCGCTCG GAGTCCGGCC GGCTCGACGG CGACGTGCTG CCGGTGGACC TCGCCGCCAA CGCCGCGAGC CTCGGCGTCG ACGTCCTGGT CGCGCAGGAC GGGCCGAGCT TCGAGGCGGC GCTGCGCAAG GCCAAGGCCA GCGACCGGAC GACGGTCGTG CACGTCGAGA CCGACCCGCT GATCGACGCC CCCTCCAGCG AGTCCTGGTG GGACGTGCCG GTCAGCGAGG TCTCCACGCT GTCCAGCACC CAGGAGGCCC GCGCGGTCTA CGACCGCTGG AAGGCCGTGC AGCGGCCCGC CTACCTCGCG CCGTCCGAGC GCGACACCCC GCAGTCCTGA
|
Protein sequence | MSDYPPPART DTVRLTVSQA VVRFLAQQHT ERDGQRQRLF AGCFGIFGHG NVAGLGQALL QAEIDEPGAL PYVLGRNEQA VVHTAVAYAR AKDRLQTWAV STSVGPGSTN TLTGAALATI NRLPVLLLPA DTFATRSAGP LLQELEQPHS GDVTVNDAFR PVSRYFDRIW RPEQLPGALL GAMRVLTDPA ETGAVTLCFP QDVQAEAFDW PVELFAERTW HVGRPLPEPA ALQRAVEVIR SARRPLVVAG GGVIYSQATE ALDAFVSATG IPVGQTQAGK GALPYDHPQS VGAIGSTGTT AANALAREAD VVIGIGTRYQ DFTTASHTAF NDPGVRFVNV NVASMDAVKH AGVGVQADAR ETLTALTDAL AGWSVDDAHR QRTAELAAQW EATVEAAYHP EVPSAGEGGR LTQNEVIGIV NEVSAPRDVV VCAAGSMPGD LHKMWRVRDR KGYHVEYGYS TMGYEIPGGI GIRMASPDRD VFVMIGDGSY LMMPTELVTA VQEGVKIVVV LVQNHGFASI GALSEQLGSQ RFGTRYRYRS ESGRLDGDVL PVDLAANAAS LGVDVLVAQD GPSFEAALRK AKASDRTTVV HVETDPLIDA PSSESWWDVP VSEVSTLSST QEARAVYDRW KAVQRPAYLA PSERDTPQS
|
| |