Gene GWCH70_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3056 
Symbol 
ID7977418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3076784 
End bp3077989 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content42% 
IMG OID644799849 
Productpolysaccharide deacetylase 
Protein accessionYP_002950988 
Protein GI239828364 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAACA AGGTACTAGG CAGCGTGTTA CTTGTTGCAC TTCTTCTTCT ATTGCCTGAT 
GGAAAGCATG CGAAAACAAA TCAAAACGTC TTTATTCAAA CCATCGGTCC ATCGGCACCC
ATTTATGCGT ATGTCGCAGA TGAGCTAAAA GAAGTAGGAA AAATTTATCG AGGTCGAGTA
TTTGAAGTAG TTGGGAAGAA TGAAACGTAT TATTTCATTC AATACGGTTT TGTGAAAGGA
ATGGTGAAAA AGCGAGACGT AAAACTTGGG AAGCCGTCGG AGTTTACGAA ACTAGCAGAT
CGCTTAAAAA GAAGTGTGAT TGGAACGAAA GATTTGCAAG TATATATTTT CCAAAATGGA
AATAAACAGT CCATTGGGAA GATTAATGCA GGTGTTCGTT ATCCAGTGAA GGCGGAAACG
CATCTTTTTT ATGTTGTTGA GTTTGGAGGA AGAGACGGGT ATATTTATAA ACATGTAACA
GAGAAAGATC CAGGAGTTCC AGTTTTGTTA TACCATCATA TCTTGGAAGA TAAAGATGAG
CATTTTTTCA AAGGCACGAC AACCGTCCCG TTAAGCCAGT TTAGGGATGA AATGAATTAC
TTAAAGAAGC AACATTATAC GACGATTACT CCGCAGCAAC TGTTGGCGTA CGTCAAGAAG
GAGCAGCTGC TTCCGCCAAA GTCGGTATTG ATTACATTTG ACGATGGGTT GAAATCGAAT
TATGTGTACG CGTATCCGTT ATTGAAACGG TTGAATTTTA AAGCGACCAT CTTTATGATT
ACAGGGCGGA TGCGACCGAG TCCGCAGCCG TTTGACCCGC GCGGCTTGCA GTTTTTAAGC
AAGCAGGAAG TAGCGGAAAT GAAGGATGTG TTTACGTTTG AAAGCCATAC GAGCCATTTT
CATTTATTCG ATAAAAAGCA CGGGCCTTAT TTGTTGTTTA AACCGTATGA CGAAATTATG
GCCGATTTAA AAGCAAGCAT CGCTAAAGTA AATGCGACGG CTATCGCGTA TCCGTTCGGC
GCGTATAACG GACGGGTGAT TCAAATCGTG AAAGACGCGG GATTTACGAT GGCGTTTACA
ACGAAAAAAG GCACGGTTTA TCCAGGTGAT CCAGTTTTTG AGTTAAAACG GCAATGGGTA
TATCCTCACA TTACCTTGCG GCAGTTTGAG CAGTTATTGA CTCCATCATC CAAAACCTCC
CCTTGA
 
Protein sequence
MSNKVLGSVL LVALLLLLPD GKHAKTNQNV FIQTIGPSAP IYAYVADELK EVGKIYRGRV 
FEVVGKNETY YFIQYGFVKG MVKKRDVKLG KPSEFTKLAD RLKRSVIGTK DLQVYIFQNG
NKQSIGKINA GVRYPVKAET HLFYVVEFGG RDGYIYKHVT EKDPGVPVLL YHHILEDKDE
HFFKGTTTVP LSQFRDEMNY LKKQHYTTIT PQQLLAYVKK EQLLPPKSVL ITFDDGLKSN
YVYAYPLLKR LNFKATIFMI TGRMRPSPQP FDPRGLQFLS KQEVAEMKDV FTFESHTSHF
HLFDKKHGPY LLFKPYDEIM ADLKASIAKV NATAIAYPFG AYNGRVIQIV KDAGFTMAFT
TKKGTVYPGD PVFELKRQWV YPHITLRQFE QLLTPSSKTS P