Gene GWCH70_1770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1770 
Symbol 
ID7978686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1843267 
End bp1844601 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content39% 
IMG OID644798610 
Productprotein of unknown function DUF21 
Protein accessionYP_002949782 
Protein GI239827158 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGATAG TTAATTTACT GATGGTTGGT GTTTTAATTG CTTTAACGGC ATTCTTTGTT 
GCATCCGAAT TTGCCATTGT TAAAGTACGC AGTACACGTA TTGATCAACT GATAGCTGAA
GGAAATCGAA ACGCTAGGGC TGCAAAACGA GTCATTAGCA ATTTAGATGA ATATTTATCG
GCTTGCCAGC TTGGTATTAC TATTACCGCA TTGGGGCTTG GTTGGCTTGG TGAACCAACG
GTGGAGCATT TGCTTCACCC TGTATTTGAA CGAATGAATT TAAGTGAGTC CGTAGCGTCT
TTCCTTTCTT TTGCGATTGC ATTTGCAACG ATTACCTTTT TGCATGTGGT CGTTGGAGAA
CTGGCTCCGA AAACATTGGC AATTCAAAAA GCAGAAACCA TTACGCTGTT ATGTTCAAGA
CCTTTAATCT TTTTCTACAA AATAATGTAT CCGTTTATTT GGGCATTAAA CGGCTCTGCT
CGCGTCATTA CCGGATTATT TGGGTTAAAG CCAGCATCCG AACATGAAGT GGCCCATTCA
GAGGAAGAAT TGCGCTTGAT TTTATCTGAA AGTTATAAAA GTGGAGAAAT TAATCAGTCG
GAATATAAAT ACGTAAACAA TATTTTTGAG TTTGACGATC GGATTGCCAA AGAAATCATG
GTACCGCGTA CAGAAATTGT CGCACTGGAT AAAAACCGTT CCATTGCGGA ATATTTTGAA
ACGATAAAGC AAGAAAAATA TACGCGCTAT CCTGTTATAG ACGGAGATAA AGACCATATC
ATTGGGATGG TCAACATAAA AGAAATATTA ACGGATTGTA TTCAAAATCC AAAGGCCACT
GAAAAGAAGT TAGGTGATTA TATCCGCCCG ATCATACAAG TAATCGAATC CATCCCTATT
CATGACTTGC TCGTAAAAAT GCAGCGTGAA CGAGTTCATA TGGCGATCTT AGTTGATGAA
TACGGCGGAA CAGCAGGGCT TGTGACGGTT GAGGATATAT TAGAGGAAAT TGTCGGTGAG
ATTCAAGATG AGTTTGATAT TGACGAAGTT CCGATGATTC GCAAAGTGAA CGAACACACC
ACCATTGTTG ATGGAAAAGT ATTGATTGAA GATGTAAATG ACTTACTTGG AACGGATATC
GATGATACGG ATGTTGATAC GATTGGTGGT TGGATTTTAA CAGAAAAATT TGATATTCAA
CAAGGCGGCA TTATTTCCTA CGGCGATTAT GAATTTAAGG TATTAAAAAT GGAAGGGCAT
CACGTCCAAT TAGTTGAAAT CACAAAACGC GTGAAGGCTC CTTCTCTTAT TGTACAAGAA
GCGGCAATAG AATAG
 
Protein sequence
MEIVNLLMVG VLIALTAFFV ASEFAIVKVR STRIDQLIAE GNRNARAAKR VISNLDEYLS 
ACQLGITITA LGLGWLGEPT VEHLLHPVFE RMNLSESVAS FLSFAIAFAT ITFLHVVVGE
LAPKTLAIQK AETITLLCSR PLIFFYKIMY PFIWALNGSA RVITGLFGLK PASEHEVAHS
EEELRLILSE SYKSGEINQS EYKYVNNIFE FDDRIAKEIM VPRTEIVALD KNRSIAEYFE
TIKQEKYTRY PVIDGDKDHI IGMVNIKEIL TDCIQNPKAT EKKLGDYIRP IIQVIESIPI
HDLLVKMQRE RVHMAILVDE YGGTAGLVTV EDILEEIVGE IQDEFDIDEV PMIRKVNEHT
TIVDGKVLIE DVNDLLGTDI DDTDVDTIGG WILTEKFDIQ QGGIISYGDY EFKVLKMEGH
HVQLVEITKR VKAPSLIVQE AAIE