Gene GWCH70_0616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0616 
Symbol 
ID7978805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp681160 
End bp682428 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content40% 
IMG OID644797603 
Productprotein of unknown function DUF21 
Protein accessionYP_002948777 
Protein GI239826153 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000839014 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAGAGT TGCCTCTGAG TCTGTTAGGG TGGTTTTTTC TTTGTATCGT GTTAGTTGCC 
TTCTTTTCTT CAGTGGAAGC CGCATTTTCT TCAGCGAATA AAATTCGCTT GAAAAATTAT
GTGGAAGAAA ATCACCGTGG CAGCAAGCGA GTAAATTACA TTATGGAAAA CTTAGATCGC
GTCTTATTGA CGGTGCTTGT TGCAAACAGA GTCACAAGTA TTGTTGCCGT TGCGTTTTTA
GCAGATATTG CGACAACGAT GCTTGGGGAG CGTGCAGGTC TTATTGCTGC GATTATTGTC
ATGACCGTGC TTCTTTTAAT CTTCGGTGAA ATTTTGCCAA AATCGATTGC GAAAGAGCAT
GCGGAATCGC TATCGATTCG TTATGCCGGA ATTGTTTACG CATTGATGAA ACTGCTTTCA
CCCATCACGA CATTATTTAA CGCTGTAAAA GAGAGCGTAG CGAAACGGTT TACGAATGGA
ACGGTTGTTC CAGCAGTAAC GGAAGAGGAA ATTAAAGTGA TGATTGATTT AAGTGAAGAA
GAAGGCATTA TTGACAACAA AGAAAAAGAA TTAATTCACC GTTCGCTCGA TTTTGATGAA
ATTTTAGTTG GAGAAATTTT TACGCCACGG TCAGATATGG TCGCTGTGGA AGTAAATCAG
CCGATCGAAG CAATTCGCGA TGTTTTTCTT GAAGAAAAGT ACTCCCGTAT ACCGGTGTAT
GAGGAAGATA TTGATAATGT GATTGGTATT TTATCTGAAA GGGACTTTTT TAGTGAGCTT
GTACAACAAA AAGATATAAA CATTCGCGCG TTATTACGCA AGCCGCTGTT TGTCGTGGAA
TCAATGAAAA TTTCTGATTT GTTGCCGGAA CTTCAAAAAA GCAGAGTGCA TATGGCAATT
GTCATTGATG AGTTTGGAGG AACAGCAGGA TTAATTACGC TTGAAGACAT TATTGAACAA
ATTGTTGGAG AAATATGGGA TGAGCATGAT GAAGCGGTAA AAAATATTCA ACAAATCGAT
GAAAACAGTT ATGAATTTAA CGCTGAACTT CCGCTTGATG AATTTTGCGA AATAATGAAA
ATTGATGCAC CGGAAAGCTC TTCCCATACG TTAGGCGGTT GGATATTTGA AATGTTTGAA
CGCGTGCCGA ATGTCGGCGA AACGCTGCAT TATGGTCCGC TTACTTTAAC CGTACGACAA
GTCGAAAATC GCAGAATTAG GAAAGTACTT GTTTCATTAA ATGAGCCGCC GTTAGTGGAA
AATATGTAA
 
Protein sequence
MGELPLSLLG WFFLCIVLVA FFSSVEAAFS SANKIRLKNY VEENHRGSKR VNYIMENLDR 
VLLTVLVANR VTSIVAVAFL ADIATTMLGE RAGLIAAIIV MTVLLLIFGE ILPKSIAKEH
AESLSIRYAG IVYALMKLLS PITTLFNAVK ESVAKRFTNG TVVPAVTEEE IKVMIDLSEE
EGIIDNKEKE LIHRSLDFDE ILVGEIFTPR SDMVAVEVNQ PIEAIRDVFL EEKYSRIPVY
EEDIDNVIGI LSERDFFSEL VQQKDINIRA LLRKPLFVVE SMKISDLLPE LQKSRVHMAI
VIDEFGGTAG LITLEDIIEQ IVGEIWDEHD EAVKNIQQID ENSYEFNAEL PLDEFCEIMK
IDAPESSSHT LGGWIFEMFE RVPNVGETLH YGPLTLTVRQ VENRRIRKVL VSLNEPPLVE
NM