Gene GWCH70_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1993 
Symbol 
ID7979491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2050753 
End bp2051943 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content45% 
IMG OID644798820 
Producttoxic anion resistance family protein 
Protein accessionYP_002949990 
Protein GI239827366 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3853] Uncharacterized protein involved in tellurite resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00104364 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGT CTGATCACAT GCCGGCAAAC GATCTTCGCG AGCAGTGGAC AAGCTCGCTG 
GATTCGCTTT TGGAAAATCC GTTTTCCTTA CCGAATGAAC AAGAGGAGAC GGATATTCAT
AAACAAACGC AGCCGACAAG ACTGATCGAT ACGTTAAAGC CGGAACATCG CGACAAAGCA
CTGCAGCTTG CGAAACAAAT CGATCCGCGC AACCAGCAAG CCATCATTCA ATACGGGGTT
GCGGCGCAGG CAGAGCTGTC GAAGTTTTCC CATACGATTT TACATCATGT GCAAACAAAA
GATGCAGGGC CCGTTGGGGA AGTGATCAGT GATTTAATGA CGAAAATTAA AGAAGTGAAT
CCGGATGACT TGCTTCCGGC GAAAAAAGGA TTGTTTGCGC GGCTGTTTGG CTCTGTATCA
AAATCGCTGC AAGGCATGAT CGCCAAATAC CAAAAAATTG GCGTAGAGAT TGATAAAATC
GCCGATCAAC TGGAAAAGCA CCGTCAATTG CTGTTTCGCG ACATTATGAT GTTAGAAACG
TTGTACGAAA AAAATAAGGA ATACTTTGAT GCGCTTAACA TTTATATTGC GGCGGCGGAG
TATAAACTAG AAGAATTGCG GACGAAAGTG ATTCCAGAAA AACGCGCCCA AGCAGAACGG
TCAGGAAACC AAATGGAAAT GCAAGAAGTC AACGATTTGT TGCAGTTTGC CGATCGCCTG
GAAAAACGCA TTCACGATTT AAAATTAAGC CGGCAAGTAA CGATTCAAAC CGCACCGCAA
ATCCGCATGA TTCAACATAT GAACCAGACA CTGGTCGAGC GCATCCAATC ATCGATTTTA
ACCGCGATTC CGCTATGGAA AAACCAAGTT GTTATCGCTC TGACGCTATT CCGCCAGCAA
AAAGCGGTCG AGGCGCAAAA ACAAGTAACG GAAACGACGA ACAATTTGCT GCTTCGCAAT
TCGGAAATGC TGAAAACAAA CAGCATCGAA GTCGCGAAAG AAAACGAGCG CGGTCTCATT
GATATTGAAA CATTGAAAAA AACGCAGGAA AATTTAGTGA CGACATTAGA AGAAACGTTA
AAAATCCAGC AAGAAGGCCG CCTCAAACGC CAGCAAGTAG AACGAGAACT CGTTACGATG
GAAGAACAAC TGAAACAAAC GTTGTTGTCA TTAAAACGAA ACGATGGATG A
 
Protein sequence
MKPSDHMPAN DLREQWTSSL DSLLENPFSL PNEQEETDIH KQTQPTRLID TLKPEHRDKA 
LQLAKQIDPR NQQAIIQYGV AAQAELSKFS HTILHHVQTK DAGPVGEVIS DLMTKIKEVN
PDDLLPAKKG LFARLFGSVS KSLQGMIAKY QKIGVEIDKI ADQLEKHRQL LFRDIMMLET
LYEKNKEYFD ALNIYIAAAE YKLEELRTKV IPEKRAQAER SGNQMEMQEV NDLLQFADRL
EKRIHDLKLS RQVTIQTAPQ IRMIQHMNQT LVERIQSSIL TAIPLWKNQV VIALTLFRQQ
KAVEAQKQVT ETTNNLLLRN SEMLKTNSIE VAKENERGLI DIETLKKTQE NLVTTLEETL
KIQQEGRLKR QQVERELVTM EEQLKQTLLS LKRNDG