Gene GWCH70_1936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1936 
Symbol 
ID7978760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1994408 
End bp1995403 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content37% 
IMG OID644798765 
Productacyltransferase 3 
Protein accessionYP_002949935 
Protein GI239827311 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3594] Fucose 4-O-acetylase and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00021741 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC GCGATTATTA CTTTGACAAT GCAAAATGCG CACTGATGCT TCTCGTCGTG 
TTCGGTCATT TTCTCCGCCC GTATATCGAT AGTGTACTAT GGGTGCATAG CTTATATATT
TGGATTTTCT TTTTCCACAT GCCGGCGTTT ATTTTTATCT CAGGTTATTT TGCTAAAAAA
TTTCATGAAC AAGGATATTT AAAAAAGATA ACAAAAAAAT TATTGCTTCC CTACCTTATT
TTTCAACTGT TGTATTCGGT TTACTATTAC TTTTTATATC AAAAAGATTC CATTGAATTC
GATTTGCTCA CTCCCCAATG GAGTCTATGG TTTTTAATTA GCTTATTTAG TTGGAATTTG
TTATTATGGA TCTTTGCCAA AATCCCAAAA CACGTTTCAC TATTGCTTGC CCTTCTCTTA
GGAGTAGGAG CGGGCATGAT CGAAGCAGAG AAATGGCTAA GTATATCGCG TACGTTTACG
TTTTTCCCAT TCTTTTTATT AGGATTTTTC CTTCAAAAAA AGAATATTGA ACGATTATTT
ACATGGCGCG TACGTTTATT TTCACTATTT GCACTAGTCG GGATGTTTAT CATGATTCAT
TTCGGATTTC CTGATTTGCC GCAAGAATGG CTATACGGTT CCAAATCATA CGATACGCTC
GGCGTACCAG AGGAAAAAGG AGTTTTCGTG CGCCTGGTTA TTTATGGCGC AAGCACTCTG
ATGATGGTCA GTTTTCTCTC GCTTATTCCA AACCATCGTT TTTCTTTCTC GGTGTTAGGC
GCAAGAACAT TTTATATTTA TATTTTGCAC GGGTTTATTT TAAAATACTT GCATGAAACA
AAACTTCCAG ATTTTATTAT GAGCATTCAC GGTTACCCGC TATTACTAGG ATTGTCCGTC
ATCGTTGTGC TTGTTCTTGG AAGCAAACCG ATTGTACGCA TTATGCGCCC ACTTTTGGAA
TGGCGATTCC CAAAAAAGAA ACAGGCTGTT TCTTAA
 
Protein sequence
MSERDYYFDN AKCALMLLVV FGHFLRPYID SVLWVHSLYI WIFFFHMPAF IFISGYFAKK 
FHEQGYLKKI TKKLLLPYLI FQLLYSVYYY FLYQKDSIEF DLLTPQWSLW FLISLFSWNL
LLWIFAKIPK HVSLLLALLL GVGAGMIEAE KWLSISRTFT FFPFFLLGFF LQKKNIERLF
TWRVRLFSLF ALVGMFIMIH FGFPDLPQEW LYGSKSYDTL GVPEEKGVFV RLVIYGASTL
MMVSFLSLIP NHRFSFSVLG ARTFYIYILH GFILKYLHET KLPDFIMSIH GYPLLLGLSV
IVVLVLGSKP IVRIMRPLLE WRFPKKKQAV S