Gene GWCH70_2180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2180 
Symbol 
ID7976984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2241885 
End bp2243072 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content53% 
IMG OID644798995 
Producttransposase IS4 family protein 
Protein accessionYP_002950155 
Protein GI239827531 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000421211 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT TAGCACATCA CCAAGGAATC CACAAGTTTT TCTTCACGCT GGGGTTGACG 
CTGCAGCTTT CCAAACCGGT CATCAAGCAT CTCATTCATA TTGTCGATGC CTTGACCACC
AAGGGATTCT CGGGAACATT GACTGATATT CATTACTGGA GCTTTCATCC GAATCATCGA
ACGACGCTCA GTCACTTTTT CACGAAAAGC CCTTGGAACG AGGAAAGGCT GCTTGGGAAG
CTTCAAGAGT GGATCCTTTC CCAGGTCGAA CGACTGGCCA AACGGAAGAA TCAACCCCTT
TTTGTTTCGA TTGATGATAC GATTTGCCAA AAAACAAAGC CTTCGTCACG GGCTGTGCAC
GCCATTCAAG GGTGCGACTG GCACTACTCG CATAAAGATC ATCAATCGGT CTGGGGGCAT
TCGCTCGTTT GGCTGATGGT GCACACCTTC ACGCAGGCGT TCCCATTTGC GTTCCGCCTG
TATGACAAGA AAGCGGGAAA AAGCAAGATC GACCTGGCGA TCGAGATGCT TTCCTCGCTC
AAGGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG
CTCATCGAAG CCTGTCTGAA ACAGGGATTC CATGTCATCG CGATGCTCAA GACGAACCGG
ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TTGCCCGCTA TATCGAGTCC
AAAGACACCC GCCTCGTCAC GGTGGGGCAG GAGCGTTATC GCGTGTATCG CTATGAGGGG
GCCATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG
GCGCCGGAAC ATCTTCATTG CATCTTGAGC ACCGACCGGG AACTCGGGGA CGAAGACATC
TTGCGTTACT ACGCCCAGCG CTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA
CTGAAGCTGG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG
GTGCTGTTGG CCTGCGTGTA CAGCATCGCC GAATCCCGAC AAAACCTCTC CACCGGGCTG
GAGCTTCTTC GGTCGCGGAA AGACCACAGC GTCGTCGAGT TCATTTATGA CGCTGCGAAG
CAAGATATTC CCATTGATGT GATCAAAAAA CAGCTCCGTA TCGCGTAA
 
Protein sequence
MNRLAHHQGI HKFFFTLGLT LQLSKPVIKH LIHIVDALTT KGFSGTLTDI HYWSFHPNHR 
TTLSHFFTKS PWNEERLLGK LQEWILSQVE RLAKRKNQPL FVSIDDTICQ KTKPSSRAVH
AIQGCDWHYS HKDHQSVWGH SLVWLMVHTF TQAFPFAFRL YDKKAGKSKI DLAIEMLSSL
KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYIES
KDTRLVTVGQ ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHCILS TDRELGDEDI
LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLLACVYSIA ESRQNLSTGL
ELLRSRKDHS VVEFIYDAAK QDIPIDVIKK QLRIA