Gene GWCH70_2824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2824 
Symbol 
ID7978040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2855155 
End bp2856342 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content53% 
IMG OID644799614 
Producttransposase IS4 family protein 
Protein accessionYP_002950773 
Protein GI239828149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGAT TAGCACATCA CCAAGGAATC CACAAGTTTT TCTTCACGCT GGGGTTGACG 
CTGCAGCTTT CCAAACCGGT CATCAAGCAT CTCATTCATA TTGTCGATGC CTTGACCACC
AAGGGATTCT CGGGAACATT GACTGATATT CATTACTGGA GCTTTCATCC GAATCATCGA
ACGACGCTCA GTCACTTTTT CACGAAAAGC CCTTGGAACG AGGAAAGGCT GCTTGGGAAG
CTTCAAGAGT GGATCCTTTC CCAGGTCGAA CGACTGGCCA AACGGAAGAA TCAACCCCTT
TTTGTTTCGA TTGATGATAC GATTTGCCAA AAAACGAAGC CTTCGTCACG GGCTGCGCAC
GCCATTCAAG GGTGCGACTG GCACTACTCG CATAAAGATC ATCAATCGGT CTGGGGGCAT
TCGCTCGTTT GGCTGATGGT GCACACCTTC ACGCAGGCGT TCCCATTTGC GTTCCGCCTG
TATGACAAGA AAGCGGGAAA AAGCAAGATC GACCTGGCGA TCGAGATGCT TTCCTCGCTC
AAGGTGAAGC GGGCTCAGCC GGTGTATGTG CTCATGGATT CGTGGTATCC GTCCAAAAAG
CTCATTGAAG CCTGCTTGAA ACAGGGATTC CATGTCATCG CGATGCTCAA GACGAACCGG
ATTCTCTACC CGAAAGGCAT CGCCATCCAA GCCAAGCAGT TCGCCCGCTA TATCGAGTCC
AAAGACACCC GCCTCGTCAC GGTGGGGCAG GAGCGTTATC GCGTGTATCG CTATGAGGGG
GCCATCCATG GCCTCGATGA CGCGGTGGTG CTGCTGGCTT GGAAGGCGGA TCAGCCGATG
GCGCCGGAAC ATCTTCATTG CATCTTGAGC ACCGACCGGG AACTCGGGGA CGAAGACATC
TTGCGTTACT ACGCCCAGCG CTGGACGATC GAGTGCTTTT TCCGGCAGGC GAAAGATCAA
CTGAAGCTGG ATGGATACCG CGTTCGCCAC ATTCGGGCGG TGAAACGGTA TTGGGCGGTG
GTGCTGTTGG CCTGCGTGTA CAGCATCGCC GAATCCCGAC AAAACCTCTC CACCGGGCTG
GAGCTTCTTC GGTCGCGGAA AGACCACAGC GTCGTCGAGT TCATTTATGA CGCTGCGAAG
CAAGATATTC CCATTGATGT GATCAAAAAA CAGCTCCGTA TCGCGTAA
 
Protein sequence
MNRLAHHQGI HKFFFTLGLT LQLSKPVIKH LIHIVDALTT KGFSGTLTDI HYWSFHPNHR 
TTLSHFFTKS PWNEERLLGK LQEWILSQVE RLAKRKNQPL FVSIDDTICQ KTKPSSRAAH
AIQGCDWHYS HKDHQSVWGH SLVWLMVHTF TQAFPFAFRL YDKKAGKSKI DLAIEMLSSL
KVKRAQPVYV LMDSWYPSKK LIEACLKQGF HVIAMLKTNR ILYPKGIAIQ AKQFARYIES
KDTRLVTVGQ ERYRVYRYEG AIHGLDDAVV LLAWKADQPM APEHLHCILS TDRELGDEDI
LRYYAQRWTI ECFFRQAKDQ LKLDGYRVRH IRAVKRYWAV VLLACVYSIA ESRQNLSTGL
ELLRSRKDHS VVEFIYDAAK QDIPIDVIKK QLRIA