Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1936 |
Symbol | |
ID | 7978760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1994408 |
End bp | 1995403 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 644798765 |
Product | acyltransferase 3 |
Protein accession | YP_002949935 |
Protein GI | 239827311 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3594] Fucose 4-O-acetylase and related acetyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00021741 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAC GCGATTATTA CTTTGACAAT GCAAAATGCG CACTGATGCT TCTCGTCGTG TTCGGTCATT TTCTCCGCCC GTATATCGAT AGTGTACTAT GGGTGCATAG CTTATATATT TGGATTTTCT TTTTCCACAT GCCGGCGTTT ATTTTTATCT CAGGTTATTT TGCTAAAAAA TTTCATGAAC AAGGATATTT AAAAAAGATA ACAAAAAAAT TATTGCTTCC CTACCTTATT TTTCAACTGT TGTATTCGGT TTACTATTAC TTTTTATATC AAAAAGATTC CATTGAATTC GATTTGCTCA CTCCCCAATG GAGTCTATGG TTTTTAATTA GCTTATTTAG TTGGAATTTG TTATTATGGA TCTTTGCCAA AATCCCAAAA CACGTTTCAC TATTGCTTGC CCTTCTCTTA GGAGTAGGAG CGGGCATGAT CGAAGCAGAG AAATGGCTAA GTATATCGCG TACGTTTACG TTTTTCCCAT TCTTTTTATT AGGATTTTTC CTTCAAAAAA AGAATATTGA ACGATTATTT ACATGGCGCG TACGTTTATT TTCACTATTT GCACTAGTCG GGATGTTTAT CATGATTCAT TTCGGATTTC CTGATTTGCC GCAAGAATGG CTATACGGTT CCAAATCATA CGATACGCTC GGCGTACCAG AGGAAAAAGG AGTTTTCGTG CGCCTGGTTA TTTATGGCGC AAGCACTCTG ATGATGGTCA GTTTTCTCTC GCTTATTCCA AACCATCGTT TTTCTTTCTC GGTGTTAGGC GCAAGAACAT TTTATATTTA TATTTTGCAC GGGTTTATTT TAAAATACTT GCATGAAACA AAACTTCCAG ATTTTATTAT GAGCATTCAC GGTTACCCGC TATTACTAGG ATTGTCCGTC ATCGTTGTGC TTGTTCTTGG AAGCAAACCG ATTGTACGCA TTATGCGCCC ACTTTTGGAA TGGCGATTCC CAAAAAAGAA ACAGGCTGTT TCTTAA
|
Protein sequence | MSERDYYFDN AKCALMLLVV FGHFLRPYID SVLWVHSLYI WIFFFHMPAF IFISGYFAKK FHEQGYLKKI TKKLLLPYLI FQLLYSVYYY FLYQKDSIEF DLLTPQWSLW FLISLFSWNL LLWIFAKIPK HVSLLLALLL GVGAGMIEAE KWLSISRTFT FFPFFLLGFF LQKKNIERLF TWRVRLFSLF ALVGMFIMIH FGFPDLPQEW LYGSKSYDTL GVPEEKGVFV RLVIYGASTL MMVSFLSLIP NHRFSFSVLG ARTFYIYILH GFILKYLHET KLPDFIMSIH GYPLLLGLSV IVVLVLGSKP IVRIMRPLLE WRFPKKKQAV S
|
| |