Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1374 |
Symbol | |
ID | 7978173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1446793 |
End bp | 1447695 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644798303 |
Product | protein of unknown function DUF6 transmembrane |
Protein accession | YP_002949476 |
Protein GI | 239826852 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000124016 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGA AAATCTTTTT AGTGTTAGCC AACTTATTTT GGGCAGGTAA TTATCTATTT GGAAAATATG TCATCGCAGA AATTAGCCCT CTATGGCTTA CATTCATCCG CTGGAGCGTT GCGTTTTTAT TTCTTCTTCC CATCTCCTAT TTTTTGGAAC GACCGCGCTA TGGAGAAATC ATGAAGCAAT TTTGGCTGCC ATTAAGCATC GCCGGTATTC TTGGCATTAT TGGCTATAAC CTGCTTCTTT ACGGAGCATT GGAATATACC TCACCGATGA ATGCGGCGAT TGTCAACGCA TTGAACCCGG CGATTATCGT GATTATGTCG TATTTTCTTT TAAAAGAAAA AGAACGTATG ACATTTATCA ATGTGATCGG TTTTGTCATT TCGCTAGTCG GCGTATTGTT TATTTTAACA AACGGACATC TGCAGTGGAT TTTCCAAACG AGCTATAACC GAGGCGATTT GATGATGCTC ATCGCTGGTG TAGTGTGGGC GCTTTATTCG ATTATCGGAA AAAAATTAGC CGTTCCCCCG ATTACTGCGA CTACTTGTTC CGTATTTTTC AGCATTATTT TGTTGTTTCC GTTTCTTTTC TTTCAGCCGA TTCCGCTTGC CGAGTTAAGC GGGAAAGGAT GGATTGGCAT CAGCTATATT TGTTTGTTTC CATCCGTTTT TTCGTTCTTA TTTTGGAACA TGTCTGTCAA AAAGGTTGGA CCAAGCTACG CAGGCATTTA TTTGAACTTA ATCGCCGTAT TTACAGCGTT GTTCACATTT TTATTAGGGG GAAAAATTTC CGCGTCACAA CTGATTGGCG GTGCATTTGT GTTGATTGGG GTATACTTGG CAACAAACGC CCGAAAAGCA GAAGCAGCCC AGGCGAAAGA CGCGGCCATA TAA
|
Protein sequence | MKEKIFLVLA NLFWAGNYLF GKYVIAEISP LWLTFIRWSV AFLFLLPISY FLERPRYGEI MKQFWLPLSI AGILGIIGYN LLLYGALEYT SPMNAAIVNA LNPAIIVIMS YFLLKEKERM TFINVIGFVI SLVGVLFILT NGHLQWIFQT SYNRGDLMML IAGVVWALYS IIGKKLAVPP ITATTCSVFF SIILLFPFLF FQPIPLAELS GKGWIGISYI CLFPSVFSFL FWNMSVKKVG PSYAGIYLNL IAVFTALFTF LLGGKISASQ LIGGAFVLIG VYLATNARKA EAAQAKDAAI
|
| |