Gene GWCH70_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0079 
Symbol 
ID7978518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp91475 
End bp92662 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content53% 
IMG OID644797039 
Producttransposase IS4 family protein 
Protein accessionYP_002948285 
Protein GI239825661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000349204 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT TAGCACATCA CCAAGGAATC CACAAGTTTT TCTTCACGCT GGGGTTGACG 
CTGCAGCTTT CCAAACCGGT CATCAAGCAT CTCATTCATA TTGTCGATGC CTTGACCACC
AAGGGATTCT CGGGAACATT GACTGATATT CATTACTGGA GCTTTCATCC GAATCATCGA
ACGACGCTCA GTCACTTTTT CACGAAAAGC CCTTGGAACG AGGAAAGGCT GCTTGGGAAG
CTTCAAGAGT GGATCCTTTC CCAGGTCGAA CGACTGGCCA AACGGAAGAA TCAACCCCTT
TTTGTTTCGA TTGATGATAC GATTTGCCAA AAAACAAAGC CTTCGTCACG GGCTGTGCAC
GCCATTCAAG GGTGCGACTG GCACTACTCG CATAAAGATC ATCAATCGGT CTGGGGGCAT
TCGCTCGTTT GGCTGATGGT GCACACCTTC ACGCAGGCGT TCCCATTTGC GTTCCGCCTG
TATGACAAGA AAGCGGGAAA AAGCAAGATC GACCTGGCGA TCGAGATGCT TTCCTCGCTC
AAGGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG
CTCATCGAAG CCTGTCTGAA ACAGGGATTC CATGTCATCG CGATGCTCAA GACGAACCGG
ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TTGCCCGCTA TATCGAGTCC
AAAGACACCC GCCTCGTCAC GGTGGGGCAG GAGCGTTATC GCGTGTATCG CTATGAGGGG
GCCATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG
GCGCCGGAAC ATCTTCATTG CATCTTGAGC ACCGACCGGG AACTCGGGGA CGAAGACATC
TTGCGTTACT ACGCCCAGCG CTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA
CTGAAGCTGG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG
GTGCTGTTGG CCTGCGTGTA CAGCATCGCC GAATCCCGAC AAAACCTCTC CACCGGGCTG
GAGCTTCTTC GGTCGCGGAA AGACCACAGC GTCGTCGAGT TCATTTATGA CGCTGCAAAG
CAAGATATTC CCATTGATGT GATCAAAAAA CAGCTCCGTA TCGCGTAA
 
Protein sequence
MNRLAHHQGI HKFFFTLGLT LQLSKPVIKH LIHIVDALTT KGFSGTLTDI HYWSFHPNHR 
TTLSHFFTKS PWNEERLLGK LQEWILSQVE RLAKRKNQPL FVSIDDTICQ KTKPSSRAVH
AIQGCDWHYS HKDHQSVWGH SLVWLMVHTF TQAFPFAFRL YDKKAGKSKI DLAIEMLSSL
KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYIES
KDTRLVTVGQ ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHCILS TDRELGDEDI
LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLLACVYSIA ESRQNLSTGL
ELLRSRKDHS VVEFIYDAAK QDIPIDVIKK QLRIA