Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1993 |
Symbol | |
ID | 7979491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2050753 |
End bp | 2051943 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798820 |
Product | toxic anion resistance family protein |
Protein accession | YP_002949990 |
Protein GI | 239827366 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3853] Uncharacterized protein involved in tellurite resistance |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00104364 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGT CTGATCACAT GCCGGCAAAC GATCTTCGCG AGCAGTGGAC AAGCTCGCTG GATTCGCTTT TGGAAAATCC GTTTTCCTTA CCGAATGAAC AAGAGGAGAC GGATATTCAT AAACAAACGC AGCCGACAAG ACTGATCGAT ACGTTAAAGC CGGAACATCG CGACAAAGCA CTGCAGCTTG CGAAACAAAT CGATCCGCGC AACCAGCAAG CCATCATTCA ATACGGGGTT GCGGCGCAGG CAGAGCTGTC GAAGTTTTCC CATACGATTT TACATCATGT GCAAACAAAA GATGCAGGGC CCGTTGGGGA AGTGATCAGT GATTTAATGA CGAAAATTAA AGAAGTGAAT CCGGATGACT TGCTTCCGGC GAAAAAAGGA TTGTTTGCGC GGCTGTTTGG CTCTGTATCA AAATCGCTGC AAGGCATGAT CGCCAAATAC CAAAAAATTG GCGTAGAGAT TGATAAAATC GCCGATCAAC TGGAAAAGCA CCGTCAATTG CTGTTTCGCG ACATTATGAT GTTAGAAACG TTGTACGAAA AAAATAAGGA ATACTTTGAT GCGCTTAACA TTTATATTGC GGCGGCGGAG TATAAACTAG AAGAATTGCG GACGAAAGTG ATTCCAGAAA AACGCGCCCA AGCAGAACGG TCAGGAAACC AAATGGAAAT GCAAGAAGTC AACGATTTGT TGCAGTTTGC CGATCGCCTG GAAAAACGCA TTCACGATTT AAAATTAAGC CGGCAAGTAA CGATTCAAAC CGCACCGCAA ATCCGCATGA TTCAACATAT GAACCAGACA CTGGTCGAGC GCATCCAATC ATCGATTTTA ACCGCGATTC CGCTATGGAA AAACCAAGTT GTTATCGCTC TGACGCTATT CCGCCAGCAA AAAGCGGTCG AGGCGCAAAA ACAAGTAACG GAAACGACGA ACAATTTGCT GCTTCGCAAT TCGGAAATGC TGAAAACAAA CAGCATCGAA GTCGCGAAAG AAAACGAGCG CGGTCTCATT GATATTGAAA CATTGAAAAA AACGCAGGAA AATTTAGTGA CGACATTAGA AGAAACGTTA AAAATCCAGC AAGAAGGCCG CCTCAAACGC CAGCAAGTAG AACGAGAACT CGTTACGATG GAAGAACAAC TGAAACAAAC GTTGTTGTCA TTAAAACGAA ACGATGGATG A
|
Protein sequence | MKPSDHMPAN DLREQWTSSL DSLLENPFSL PNEQEETDIH KQTQPTRLID TLKPEHRDKA LQLAKQIDPR NQQAIIQYGV AAQAELSKFS HTILHHVQTK DAGPVGEVIS DLMTKIKEVN PDDLLPAKKG LFARLFGSVS KSLQGMIAKY QKIGVEIDKI ADQLEKHRQL LFRDIMMLET LYEKNKEYFD ALNIYIAAAE YKLEELRTKV IPEKRAQAER SGNQMEMQEV NDLLQFADRL EKRIHDLKLS RQVTIQTAPQ IRMIQHMNQT LVERIQSSIL TAIPLWKNQV VIALTLFRQQ KAVEAQKQVT ETTNNLLLRN SEMLKTNSIE VAKENERGLI DIETLKKTQE NLVTTLEETL KIQQEGRLKR QQVERELVTM EEQLKQTLLS LKRNDG
|
| |