Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1853 |
Symbol | |
ID | 7976474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1918340 |
End bp | 1919503 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644798687 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002949857 |
Protein GI | 239827233 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000000358317 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG CTAATTTTCT TTCTCTTTTA TTTGCCGTTA TGTTTTTGGT GATGTCTGGT TTTGGCATTA TCATCCCGGT TCTTCCCTTT TTAGCAGAAG AAGTGGGTGC CACTCCTACG CAATTAGGAC TGCTAATGGC AACGTACTCT TTGATGCAAC TATTATTCGC TCCATTTTGG GGCCAGATGT CCGACCGTTA CGGGCGAAAA CCAATTTTAT TCATCGGCAT CGCCGGATTG TCGCTCTCGT TCTTTTTGTT TGCCGTGTCA AAAACACTTA CGATGTTATT TATCGCCCGT ATTATCGGCG GGATGCTATC AGCAGCTACG ATTCCTACCG CAATGGCTTA CGTTGCTGAC GTCACGACAC CGCAAGAACG TGGAAAAGCA ATGGGGGCTA TCGGCGCCGC TACCGGTCTT GGATTTATTT TTGGTCCTGC GATTGGCGGC ATTTTTTCGA AAATAAATTT GCATATTCCT TTCTTTATCT CAGGAACACT ATCCGCTGTT ACCGCATGTC TCGTGCTTCT TTTTTTGAAA GAATCATTGA CAAAAGAAAA ACAACCGGCA ACTTTAAAAA CAAAAGAACC GATATGGTAC ATACTAAAAG GGCCGCTTTT ATTTCTGTAC CTTCTTCAAT GGCTTATTAC TTTTTCATTG GCTGGGCTAG AAGCAACATT CGCTTACTAC GCAGCAAAAC GCGCAGAATT ATATTCCTCG CAATTAGGCT ATATTTTTAT GATTATGGGG CTTGCAAGTG CGATTGTCCA AGGAGGGCTC ATTGGAAAAC TAATTCAAAA ATTTGGTGAA GGCCGCGTCA TTCAAGGCGG AATCATTGTA TCAGCCGTTG GATTTGCCCT CATTTTATTC GTTCACAACT TCCTGACCGC CGCTATATTT TTATCGATTT TTGGCATCGG AAACGGTGTC ATCCGCCCTT GTGTATCCTC GCTAGTCACT AAACATATAT CAAGCGGCCA AGGGCGTGCT ACCGGATTGC TTTCTTCGTT TGACTCCCTT GGGCGCATCA TCGGACCTCC AATTGCCGGA CAGATGTTTA CAACAATGAT CGAACTTCCA TACTTAGCAG GTATTGTATT ATCCTGTTTC GCGTTGATCC TTTATCATGT TTTTGCAAAA CAGTCACAGC AAGTTTCGTC TTAA
|
Protein sequence | MKKANFLSLL FAVMFLVMSG FGIIIPVLPF LAEEVGATPT QLGLLMATYS LMQLLFAPFW GQMSDRYGRK PILFIGIAGL SLSFFLFAVS KTLTMLFIAR IIGGMLSAAT IPTAMAYVAD VTTPQERGKA MGAIGAATGL GFIFGPAIGG IFSKINLHIP FFISGTLSAV TACLVLLFLK ESLTKEKQPA TLKTKEPIWY ILKGPLLFLY LLQWLITFSL AGLEATFAYY AAKRAELYSS QLGYIFMIMG LASAIVQGGL IGKLIQKFGE GRVIQGGIIV SAVGFALILF VHNFLTAAIF LSIFGIGNGV IRPCVSSLVT KHISSGQGRA TGLLSSFDSL GRIIGPPIAG QMFTTMIELP YLAGIVLSCF ALILYHVFAK QSQQVSS
|
| |