Gene GWCH70_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0521 
Symbol 
ID7978236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp591286 
End bp592632 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content41% 
IMG OID644797522 
Productprotein of unknown function DUF21 
Protein accessionYP_002948696 
Protein GI239826072 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACATAG TTAACTTGTT AGTGGTAGCA TTGCTTATTG CATGTACCGC TTTTTTCGTA 
GCTTCGGAAT TTGCGATTGT CAAAGTTCGC AGCTCGCGCA TTGACCAATT AGTTAGCGAA
GGAAATAAAC GGGCGATTGC TGCCAAAAAG GTGATCTCCA ACCTTGATGG CTATTTGTCG
GCGAACCAAT TAGGCATTAC GATTACATCG TTAGGACTTG GTTGGCTTGG TGAACCGACG
GTTGAACGGA TGCTAACACC GCTTTTCGAA CGCATCCATT TATTAGAATC GGTTTCGCAC
GTTTTATCCT TCGTTATTGC GTTTTCGACG ATTACATTTC TTCACGTCGT TGTCGGCGAG
CTGGCTCCAA AAACGTTTGC CATCCACAAA GCGGAGGCGA TTACGCTGCT TACAGCCCAA
CCGCTTATTT TATTTTATAA AGTGATGTAT CCGTTTATTT GGGCGCTCAA CAATTCCGCG
CGCCTCGTCG CCAGAATGTT TGGGTTAAAG CCAGCGGCAG AACATGAAAT TGCCCATTCT
GAAGAAGAGT TGCGCCTTAT TTTATCAGAA AGTTACAAAA GTGGAGAGAT TAACCAATCA
GAATATCGAT ATGTGAACAA TATTTTCCGA TTTGATGATC GGGTTGCAAA AGAAATTATG
GTACCGCGCA AAGAAATTGT TGCGCTCGAT ATTAATCGAA GCGTGAAAGA GAATTTGGAA
ATTATTAAAG AGGAAAAATA TACTCGTTAT CCAGTTATTG ATGGCGATAA AGATCACGTT
CTTGGACTTA TTAATGTGAA AGAAGTGTTT ACCGATCTTG TGACAAATCC ATCCGAAGAA
AAACAAATGA AAGATTATAT CCGCCCAATC ATTCAAGTGA TTGAATCGAT CGCTATTCAT
GATTTGCTTG TGAAAATGCA GAAAGAACGC ATCCACATGG CCATTTTAGT CGACGAATAT
GGCGGAACAT CGGGGCTTGT TACCGTCGAG GATATTTTAG AAGAAATCGT TGGAGAAATT
CAAGACGAGT TTGACGTAGA TGAAATCCCG TTGATTCAAA AAGTTGATGA AACACGTACA
ATTATCGACG GGAAAGTGCT GATTAGCGAA GTGAACGATT TGTTTGGCCT TTCCATTGAT
GATGAGGATG TCGATACGAT TGGAGGATGG ATTTTAACGA AGCATTATGA TATTAAAGTC
GGCGATAGCG TCGAAATCGA TAATTACTTG TTTACGGTGA AGGAGATGGA TGGTCACCAC
GTGAAGACGA TAGAAGTAGT AAAACAGGAG AAAGAAGAGA AAGCGGCAGA TCATGAACTC
GGTGAAAAAG AGGAATTGCA TTTATGA
 
Protein sequence
MDIVNLLVVA LLIACTAFFV ASEFAIVKVR SSRIDQLVSE GNKRAIAAKK VISNLDGYLS 
ANQLGITITS LGLGWLGEPT VERMLTPLFE RIHLLESVSH VLSFVIAFST ITFLHVVVGE
LAPKTFAIHK AEAITLLTAQ PLILFYKVMY PFIWALNNSA RLVARMFGLK PAAEHEIAHS
EEELRLILSE SYKSGEINQS EYRYVNNIFR FDDRVAKEIM VPRKEIVALD INRSVKENLE
IIKEEKYTRY PVIDGDKDHV LGLINVKEVF TDLVTNPSEE KQMKDYIRPI IQVIESIAIH
DLLVKMQKER IHMAILVDEY GGTSGLVTVE DILEEIVGEI QDEFDVDEIP LIQKVDETRT
IIDGKVLISE VNDLFGLSID DEDVDTIGGW ILTKHYDIKV GDSVEIDNYL FTVKEMDGHH
VKTIEVVKQE KEEKAADHEL GEKEELHL