Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3114 |
Symbol | |
ID | 7976759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3134851 |
End bp | 3135960 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644799900 |
Product | glycosyl transferase group 1 |
Protein accession | YP_002951039 |
Protein GI | 239828415 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000549793 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAGTAT TGCACGTCAT TAGCGGCGGG GAGACGGGAG GATCGCGCAA ACACGTCGTC ACGCTTTTAT CTAAATTCCC GAAGGAGACA GCAACGCTCG TGGTGTTTCA AGAAGGGCCG CTCGCGAAAG AAGCGCGGGA ACATCATATT GATGTCCGCC TTCTTCCGCA ATCGTCCCGC TATGATTTAT CGGTATTGCG CAAACTCGTC GAATTAATCC AAAACGGGCG TTACGACTTG CTGCATACAC ACGGCCCGAG AGCTAATTTA TATGGTGCGC TGATTAAGCG GAAAATCGGG ATTCCGTGGA TGACGACGAT TCATAGCGAT CCGCGTCTTG ATTTTATGAA ATCGGGATTA AAAGGGTTCC TTTTTACCCG CTTAAATTTA TGGGCGCTAA AAAAAATCGA TTATTTCTTT GCCGTGTCAG AGCGGTTTAA AGACAATCTC GCCGCTTTCG GAATTCCTAA AGAGCGGATC AAGACGATTT ACAACGGCAT CGACTTCAAC GAAACATCTC CCAGCTGTCT TCTGCAACGA GCAGATGTTG GTGTGAACGC CGATGATTTT GTCATCGCCA TGGTCGCAAG ATTGCATCCT ATTAAAGGTC ATGCGCTCGT ATTTGAAGCG CTTCAATCGC TTCCGTATCG TGACATTAAG CTTTTAGTCG TTGGGGATGG TCCGCTTGAG CAGGAGCTAA AAGAAAAGGC GTCGGAGCTG CAAATTGAAG ATCGAGTGAA GTTTCTTGGC TTTCGTCGCG ATATCGCCGC GATTTACTCC CTCTCCGACG TCGCGCTCAT GGCTTCATAC AGCGAAAGCT TTCCGTTAGC CCTGCTCGAA GCTGCCAACG AACGTATTCC CGTCATTTCC ACCGATGTTG GCGGAGTTAG ACAGCTTATT GCCTCTAAGG AGATGGGATG GATCGTTCCT GTCGGCGACA GTGCGGCGCT GACTGAGGCA ATCAAAGAAG CGCGTGAAAA AAAGCAGCAA TTAAAGCAAA TGGGGCAGAC GTTATACGAA TACGCCTCTT CCCATTTTTC ATTAGATCGC TTGTACGAAG AAACAATCGC TACTTACAAA CATGTGCTGG AGAAATATCA TCAAAAGTAA
|
Protein sequence | MKVLHVISGG ETGGSRKHVV TLLSKFPKET ATLVVFQEGP LAKEAREHHI DVRLLPQSSR YDLSVLRKLV ELIQNGRYDL LHTHGPRANL YGALIKRKIG IPWMTTIHSD PRLDFMKSGL KGFLFTRLNL WALKKIDYFF AVSERFKDNL AAFGIPKERI KTIYNGIDFN ETSPSCLLQR ADVGVNADDF VIAMVARLHP IKGHALVFEA LQSLPYRDIK LLVVGDGPLE QELKEKASEL QIEDRVKFLG FRRDIAAIYS LSDVALMASY SESFPLALLE AANERIPVIS TDVGGVRQLI ASKEMGWIVP VGDSAALTEA IKEAREKKQQ LKQMGQTLYE YASSHFSLDR LYEETIATYK HVLEKYHQK
|
| |