Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2765 |
Symbol | |
ID | 7977988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2801942 |
End bp | 2803135 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644799561 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002950720 |
Protein GI | 239828096 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000543377 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCGTG CGCTTTGGAT ATTAATGATC GGCATGGCCA TCAACGTGAC AGGGGCTTCT TTTTTATGGC CGATGAATAC GATTTACGTT CATGAGCAGC TTGGCAAATC ACTCTCTGTC GCTGGAATGG TGCTCATGCT CAATTCTGGA GCAAGCGTGG TTGGAAACTT AGCCGGAGGG CTATTGTTTG ATAAAATTGG CGGTTTTAAG TCGATGATGA TCGGAATTGT CGCGACGATG TCCGCGCTTG TTGGCCTTGT TTTCTTTCAT GGCTGGCCGC ATTATGCCAT CTTTTTAACG ATCATCGGGA TGGGAAGCGG GATTGTATTT CCAGTAGCGG CTGCGTATGC GGGAGCAATT TGGCCAGAGG GCGGAAGACG GGCATTTAAC GGGCTTTATG TCGCGCAAAA TATTGGCGTG GCAGTTGGTT CGGCGCTTGG CGGTGTAGTC GCATCGTATT CGTTTACGCT TATTTTCCTT GCCAATTTAT TGCTATACGC CGTCTTTTGC TTGCTTATCG TGTTTGGATT GCGTCATGTC CGCGCTGTCA AAGCGGCAAA ATCAAAGGAG GCGGAAGCAG TGAAAGGAGA GGCGCGGGCC AATTGGTATG CGCTCGTGAC GCTTTGTACC GGTTATTTTT TATGTTGGAT TAGCTATGTG CAATGGGCAA CGACAATTGC TGCATATACG CAAGAGCTCC ACATTACGCT GAAACAATAT AGCCTTCTTT GGACGATTAA TGGTGCGCTT ATTGTATTCG CGCAACCTTT CTTGTCAGCA GTTGTGCAAC GATGGATGCG GCATGTAAAA CGGCAAATGC TTGTTGGATT TGCTATTTTT GTCGTTTCTT TTTCTCTTTT GCTGCAAGCG AATTCGTTTT TTGAATTTGC GCTGGCAATG GTTGTATTAA CGATTGCGGA AATGCTTGTA TGGCCGGCGA TTCCAACGGT GGCAAGCGAG CTCGCTCCTG CAGGGAAAGA AGGGTTTTAC CAAGGATTTG TCAACAGCAC GGCAACGGCA GGGAGAATGA TCGGACCTGT GCTAGGCGGC TTTATCGCTG ATTACGCTGG CATGAAACCG CTGTTTGCCG CGCTTGTTTT CTTTTGCGCG CTTTCGCTTG TCACGACGTT CTTTTATGAT CGGAACTTGC CAAAGAAGCA AAACAAGGGG AAACAAACGG TTACAGTAGG GTAG
|
Protein sequence | MPRALWILMI GMAINVTGAS FLWPMNTIYV HEQLGKSLSV AGMVLMLNSG ASVVGNLAGG LLFDKIGGFK SMMIGIVATM SALVGLVFFH GWPHYAIFLT IIGMGSGIVF PVAAAYAGAI WPEGGRRAFN GLYVAQNIGV AVGSALGGVV ASYSFTLIFL ANLLLYAVFC LLIVFGLRHV RAVKAAKSKE AEAVKGEARA NWYALVTLCT GYFLCWISYV QWATTIAAYT QELHITLKQY SLLWTINGAL IVFAQPFLSA VVQRWMRHVK RQMLVGFAIF VVSFSLLLQA NSFFEFALAM VVLTIAEMLV WPAIPTVASE LAPAGKEGFY QGFVNSTATA GRMIGPVLGG FIADYAGMKP LFAALVFFCA LSLVTTFFYD RNLPKKQNKG KQTVTVG
|
| |