Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1955 |
Symbol | |
ID | 7978777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2015953 |
End bp | 2017134 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644798783 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002949953 |
Protein GI | 239827329 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.135687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACGTACA TCCAAAAAGG CAGCGCCGAA TACATAAAAG CAAGTTTGGC GCTATTTTTC GGCGGGTTTG TCACCTTTGC GATTTTATAC ACGACGCAGC CGCTGTTGCC GATTTTCGCA AAAGAGTTTC ATGTTTCTGC GGCGTCCGCC AGCTTAACGG TTTCCGCGTC TACCGGAACG TTGGCGGTAA TGATGCTCAT AGCGGCCAGT TTATCCGACC GCATCGGCAA AAAGAATGTG ATGATGATGT CCATGCTGTT GACATCGATG CTGGCAATGG CCATGGCGTT TAGCCCTAAT TTTCTTTCCC TTGTCCTTGC GCGCACGTTT CTTGGCATGA CGGCCGCGGG CATCCCGTCA CTAGCCATGG CTTACGTCGC TGAGGAATTT CATCCGGCTG GGATTGGAAA AGTGATGGGT CTCTATATTA GTGGAACGAG CATCGGCGGT ATGGCTGGAC GTATTGTAAC AGGCGTGCTG ACCGATTTGT TTTCGTGGCG GACGGCGCTA TTTTCCATTG GAGCAATTTC TCTTCTATTA AGCATGATTT TTGCTTTCAT TCTCCCAACG CCTCGACATT CAATAAAAAA ACCGCTCAAT GGCAAAACCG CACTGCAGGC GTATGCCGTT CATTTGCGCA ATAAACCGCT AATGGCTCTC ATTGCACTTG GCTTTTTATT TATGGGCGGA TTTGTGACGC TGTATAACTA TATCGGCTTT TTATTGAGCG AACCGCCATA TTCGTTCAGC CAGTCCGTAC TTGGATTTCT GTTTATCGTG TACATATTCG GCAGCTTTAG CTCGGTTTAT ATGGGAAGAA AAGCCGATTT ACACGGCCAT GCGCTGGTGT TAAGCATTTC GGTGGCATTG ACCGTGCTCG GTGCGGTCGT TACACTCGTT CCGTCTATTG TTGTCAACAT CATCGGGCTT TCGTTATTTA CTTTTGGCTT TTTCGGCTGC CATTCCATCG CGAGCACCTG GATTGGCGAA CGTGCAAACG TAAATAAAGC GCAAGCCTCT TCTCTTTATC TCTTATTCTA TTATTTAGGC TCCAGCCTCG CTGGCACAGC CGGCGGCTAT TTTTGGACCC ATTTTCATTG GCTTGGCGTC ATCACATTTA TCGTGGCTTT ATTGCTATTA AGCTTTCCAC TTATTGGCTA TGCTCAAAAG CAGCTCGAGT AA
|
Protein sequence | MTYIQKGSAE YIKASLALFF GGFVTFAILY TTQPLLPIFA KEFHVSAASA SLTVSASTGT LAVMMLIAAS LSDRIGKKNV MMMSMLLTSM LAMAMAFSPN FLSLVLARTF LGMTAAGIPS LAMAYVAEEF HPAGIGKVMG LYISGTSIGG MAGRIVTGVL TDLFSWRTAL FSIGAISLLL SMIFAFILPT PRHSIKKPLN GKTALQAYAV HLRNKPLMAL IALGFLFMGG FVTLYNYIGF LLSEPPYSFS QSVLGFLFIV YIFGSFSSVY MGRKADLHGH ALVLSISVAL TVLGAVVTLV PSIVVNIIGL SLFTFGFFGC HSIASTWIGE RANVNKAQAS SLYLLFYYLG SSLAGTAGGY FWTHFHWLGV ITFIVALLLL SFPLIGYAQK QLE
|
| |