Gene GWCH70_2121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2121 
Symbol 
ID7976932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2190532 
End bp2191659 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content43% 
IMG OID644798937 
Productglycosyl transferase group 1 
Protein accessionYP_002950097 
Protein GI239827473 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTAA AAATAGGGAT TGTTTGTTAT CCAACAGTAG GCGGCTCTGG TGTCGTTGCG 
ACCGAATTAG GGAAATTGCT GGCAGAAAAA GGGCATGAAA TTCATTTTAT TTCTTCGAGC
ATGCCGTTTC GTCTCAATAA AGTGTATGGC AACATTTATT ATCATGAAGT GGGCGTCAAT
CAATATTCCG TATTTCAATA TCCGCCATAT GATTTAGCTC TGGCAAGCAA AATTGCTGAA
GTGGCGAAGC GGGAGCGGCT CGATGTGTTG CATGCCCACT ATGCCGTTCC GCATGCGGTT
TGCGCTGTAT TGGCGAAGCA AATGGTAGGC GGGAAATTAA AGATTGTTAC GACATTGCAC
GGAACGGATA TTACGGTGCT TGGATATGAT CCATCGTTGA GCGATATGAT TAAATTTGGC
ATTGAACAAT CAGATGTTGT CACCGCCGTT TCGAATGCGC TTGTCAAGCA GACGTATGAG
CTTCTTGACG TACAAAAACC GATCCAAACC GTCTATAACT TTGTGGACGA GCGTGTATAT
CACAAAAAAA ATGCCAATCA TTTAAAGAAA GAATATGGGA TTGATGAGAA CGAAAAAGTC
ATCATTCATG TATCCAACTT TCGAAAAGTC AAGCGGGTTC CTGATGTTGT GCGCGCTTTT
TCTCTCATTC GCAAGCATCT GCCTGCGAAA CTGCTGCTTG TCGGCGATGG ACCGGAAATG
ACTGTCGTCA GCCGCCTTGT GACAGAGCTT GGACTTAGTG ATGATGTACG CTTTTTAGGA
AAACAAGACA AGCTCGATGA ATTATATTCG ATTAGCGATG TGAAGATGCT ATTATCAGAA
AAAGAAAGCT TTGGTCTTGT GCTATTAGAA GCGATGGCCT GCGGCGTTCC TTGCATTGGT
ACGACGATCG GCGGCATTCC TGAAGTGATT GAAGACGGTA AAACAGGGTT TTTATGTGAG
CTTGGAAATG TGGAAGAAGT GGCAAATAAA GCGCTTCGCA TTTTAACAGA CAAACATCTT
CACATGTATA TGGCCAAGCA GGCGGTTCAA ACGGTATATC AAAAATTTTA TTCGGAACAA
ATTGTGGAAC AATATGAAGA TATTTATTTT TCATTGGCAA AGGGGTGA
 
Protein sequence
MKLKIGIVCY PTVGGSGVVA TELGKLLAEK GHEIHFISSS MPFRLNKVYG NIYYHEVGVN 
QYSVFQYPPY DLALASKIAE VAKRERLDVL HAHYAVPHAV CAVLAKQMVG GKLKIVTTLH
GTDITVLGYD PSLSDMIKFG IEQSDVVTAV SNALVKQTYE LLDVQKPIQT VYNFVDERVY
HKKNANHLKK EYGIDENEKV IIHVSNFRKV KRVPDVVRAF SLIRKHLPAK LLLVGDGPEM
TVVSRLVTEL GLSDDVRFLG KQDKLDELYS ISDVKMLLSE KESFGLVLLE AMACGVPCIG
TTIGGIPEVI EDGKTGFLCE LGNVEEVANK ALRILTDKHL HMYMAKQAVQ TVYQKFYSEQ
IVEQYEDIYF SLAKG