Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3118 |
Symbol | |
ID | 7976763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 3139422 |
End bp | 3140594 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644799904 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002951043 |
Protein GI | 239828419 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.353961 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT TATTAGCTAC CGTTTTCGAT TATCCTCACG CTGGAGGATT ATCGACACAT GTTACAACAT TAAAAGCAGG GTTGGAAGCA AGGGGACACG AAGTCGACGT TCTCTCATTC AGCGATATTA GTCCTGTAAG CCAAAATCTA CTTGCGAAAG GGCCAAGCTT TATTTTGAAC AAACTATCGA AAGGCAGTGG AATTATTTGG AGCCACCACG TCCGGCAAAA AATGCTTTAT CGCTTGATCA AACAGCATAA ATCGAAAGGA TACGACATCA TTAACGCTCA AGAAGTATTC GCAACGCTTG CAGCGGTAGA AACGGGGATT CCGACCGTCA CGACCGTGCA TGGATATATG ACATACGAAG CAATTAGTCG CGGCTCTGTT TTGGAAGGAA GCCGGCAAGC CCATTATTTA TTACAAAAGG AAGTTGAAGC CTATACAAAA ACGAGGAAAA TTGTTACGGT CGACCAGCGC ATTAAAAATT ATGTTTTTGA GAAAGCAGGC GTAGAGGCGA CAGCGATTCG CAACTTTATT GACATTAACA GTTTTAAGCC AGATAAAGAA AATCGGCTCG CTTACCGCCG CAAACATGGC TTTGCCGAGG ATACGAATAT TATTTTCGTT CCAAGGCGCC TGACAAAGAA AAACGGCGTC ATTTATCCTG TACTCGCGCT TCCACAAGTG CTCGAAAAAT ATCCGAATAC GATGCTCGTC TATGCCGGCA TGGGGGAAGC ATTCCAAGAA TTAAAATCAC TGATCCATGA AAAAGGATTA GAAGAGAAAA CAAAATTGCT AGGAGCGATT CCGCACGAAG CGATTAAAGA GTATTATGCG CTATCGGATA TCGTCCTTGT GCCAAGCGTT CATTCAGCGG GCGTAGAGGA AGCGACATCC ATTTCCGCTC TCGAAGCGAT GGGATCCGGC TCTCCGCTCA TCGCTTCCGC CGTCGGCGGT CTAAAAGAAA TTGTCCGCCA CGAACAAGAT GGTCTTCTGG TGGAAGAGAA AAACGTCGAT CAGCTTGCCC AAGCAATCAT TTACTTGCTT GATCATCCGG AAATGGGACA AAAGTTCGCG AAAGAAGCAA GACGAAAAAT TGAAGAAGAA TATTCCCACT TGGCTGCAGC GAAAAAGTAT GAAGAAATTT ATGCAGCAGC GTTGCAGCAA TAA
|
Protein sequence | MKILLATVFD YPHAGGLSTH VTTLKAGLEA RGHEVDVLSF SDISPVSQNL LAKGPSFILN KLSKGSGIIW SHHVRQKMLY RLIKQHKSKG YDIINAQEVF ATLAAVETGI PTVTTVHGYM TYEAISRGSV LEGSRQAHYL LQKEVEAYTK TRKIVTVDQR IKNYVFEKAG VEATAIRNFI DINSFKPDKE NRLAYRRKHG FAEDTNIIFV PRRLTKKNGV IYPVLALPQV LEKYPNTMLV YAGMGEAFQE LKSLIHEKGL EEKTKLLGAI PHEAIKEYYA LSDIVLVPSV HSAGVEEATS ISALEAMGSG SPLIASAVGG LKEIVRHEQD GLLVEEKNVD QLAQAIIYLL DHPEMGQKFA KEARRKIEEE YSHLAAAKKY EEIYAAALQQ
|
| |