Gene GWCH70_3120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3120 
Symbol 
ID7976765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3142217 
End bp3144100 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content45% 
IMG OID644799906 
ProductS-layer domain protein 
Protein accessionYP_002951045 
Protein GI239828421 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4193] Beta- N-acetylglucosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGGTTAC TTGTCAGTCT TTTTCTAGTT TTTTGCTTAT TTTTTGCCAA TATTTCCGTT 
TCATTCGCAG ATGATATTAC AGGAATTGCG CTTGAAAAAG AAATGCGCGC GATGGTGAAT
CAAGGAGTCG TTCAAGGATA TCCTGATGGC AAATATCGTC CGAACGAATC TGTCACTCGC
GGCCAGTTTG CTACGTTTGT CGCCAGAGCA TTGCAGCTTC CGAAAGGATC GGGGAACTTT
TCCGACGTGT CACCATCTTC CGAACTAGCC GACGGCATTT ACAAAGCAAG CGCGGCAGGG
ATCGTTCAAG GGTATTCGAA CGGCACGTTT GGCGTGAATA ATAAAATTAC AAGAGAAGAA
ATGGCAATAA TGATCGACCG AACGCTAGAT TATTTAGGAA TAGAGAAAAA GCAGGCATCG
CTTGACCATT TTACCGATGT GGATGGGCTC TATTCGGCTT CGAAAATCGC AATCTCTCAT
AATGTGTATT ATGGCATTAT TCGCGGAATT CCAAACACGG ATGGAACCAC GTTCCGATTT
GCTCCGAAAG CGTACGCGAC AAGAGCGCAC GCCGCTGCCT TTCTTTACCG CCTGCTTGAA
GTGTGGGCGG AGCAGACGCC GGAAATGGCG TATCAAATTG CTGAGATTCA AAACGGACAA
TTAACGCTGC TGCCAAAGCG CTATGCCACA TTTGCACAGG CGGAAGCGTC TGTTACGAAT
TGGTCTTCTC AAGTGGTGGT GCAAGGCACC AAAATTGTTA AGATGGCTAG CGGAAACGTT
ATTGCCCAGC CGTCTCCTGG AAAATCAACC ACAATAATCT ATCAATCTAA TTTAAGTACG
AGTCTTACGT ATGTCGCGCC AAATACAGAA ATGAAATATT TAGATGCCGA TGAAGAAAAA
GTAAAAGTGC AAATCGCCAA TACGATCGGG TATGTGAAAC AGTCGGAAGT GATGCTAGTG
CCAACGGCGT TGCTGCAAGG CCGGTCGTAT TACATTGCGC AAAAAGGGGA GTTGTATCAT
TACATTTACA ATACGACAAG CAACAAATAC GTAAGCTATC TGTACGGCAA AGCGCCGTCG
TTTATGCAAG ATGGGCAAAA GTATTACAGC TGGGATGGCG AAACGTTTTA TAACACAGCA
GGCAAGCCCA TCGGCACGGC GTATCAATAC TTTAACGTCC TGCCAATCCG TACGAAAACA
AATTATACGG CCGAAGAATT GAATCGGTTT GTCGCGGCAA ACCGTTCTGA CAGCCCATTA
AAAACATTAG GAGAAGCCTT TAAAAAGACG GAAGAAAAAT ATAACGTAAA CGCGCTTTAT
TTGTTGGCGC ACGCGATTCA TGAAAGCGAT TGGGGAACTA GCGAAATCGC GAAAGAGAAG
AAAAACTTAT TCGGTATTCG AGCGGTTGAT AGCGATCCAC TCAACAGTGC GGTGAAGTTT
AACACCTTCG AAGATTGCAT CAATTATATG GCACAGACGA TAGTATCGAA CAGATATGCC
AATCCAAAGG ACTGGCGGTA TAGTGGAGCG GTTCTTGGTG ATAAGACCAT TGGCTTTAAC
GTCCGTTATG CGTCTGACCC ATATTGGGGA CAAAAAATCG CTGGAATTAT GTATCGGGCT
GACAAATTTT TAGGCTGGAA AGACTGGGGC AAATATACGA TTATGGGAAC AACAGTAGAC
GGCGTGAACG TCCGATTGCA GCCAGCGGCG GTTCCGCCTG TATTCTATGA ATACCGTCTT
GCAAACACTC CTGTCGTTAA AATCGGGGAA ACAGCGAAAC AGCCGGACGG CTATGTATGG
TATCAAATTC TTTCCGATTT ACCGACGGAA GAGAATGCTT ATATCCGCAG CGACTTATTA
AAACCGCTGT CAATCGCAAA ATAA
 
Protein sequence
MRLLVSLFLV FCLFFANISV SFADDITGIA LEKEMRAMVN QGVVQGYPDG KYRPNESVTR 
GQFATFVARA LQLPKGSGNF SDVSPSSELA DGIYKASAAG IVQGYSNGTF GVNNKITREE
MAIMIDRTLD YLGIEKKQAS LDHFTDVDGL YSASKIAISH NVYYGIIRGI PNTDGTTFRF
APKAYATRAH AAAFLYRLLE VWAEQTPEMA YQIAEIQNGQ LTLLPKRYAT FAQAEASVTN
WSSQVVVQGT KIVKMASGNV IAQPSPGKST TIIYQSNLST SLTYVAPNTE MKYLDADEEK
VKVQIANTIG YVKQSEVMLV PTALLQGRSY YIAQKGELYH YIYNTTSNKY VSYLYGKAPS
FMQDGQKYYS WDGETFYNTA GKPIGTAYQY FNVLPIRTKT NYTAEELNRF VAANRSDSPL
KTLGEAFKKT EEKYNVNALY LLAHAIHESD WGTSEIAKEK KNLFGIRAVD SDPLNSAVKF
NTFEDCINYM AQTIVSNRYA NPKDWRYSGA VLGDKTIGFN VRYASDPYWG QKIAGIMYRA
DKFLGWKDWG KYTIMGTTVD GVNVRLQPAA VPPVFYEYRL ANTPVVKIGE TAKQPDGYVW
YQILSDLPTE ENAYIRSDLL KPLSIAK