Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1860 |
Symbol | |
ID | 3831491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1921593 |
End bp | 1922804 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637829792 |
Product | major facilitator transporter |
Protein accession | YP_430703 |
Protein GI | 83590694 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000002484 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAACAGTA AACTCAAGGA AAAACAATTT ATTATGACAA GATCGTTTAT ATTGTTGATG GCGGTTGTCT GTGGCGTTTC CGTCGCAAAC CTTTATTACA TACAACCGCT GGAGGGGCAG ATTTCGACTA CTTTTCATGT CTCACAGAGC GCGGCAGGTA TTGCAGCCAT GCTCACACAG GTGGGTTATG CGTTTGGCCT GTTGTTATTT GTTCCACTGG GAGACATGTG TGAACGCCGC TCCCTTATTC TGCATATGCT GCTTTTGGTT GCTATATCAC TGCTCACAGC TGGTTTATCA CCGTGCTATC CTGTGCTGCT AATTGCGATG TTTGCCGTTG GGATTACAAC AATCGTGCCA CAACTTATCG TTCCCTATGC AGCCCATCTT TCACGTCCGG AAGAGCAGGG GGAAATTATT GGCTATGTCA TGAGCGGTCT GCTTATCGGA ATTTTGCTGT CCCGGACATT CAGTGGCCTT GTGGGCGCGG CTTTGAATTG GCGAGCAGTT TACCTTTTTG CAGCCGGATT TATCATTATT TTATTGGTTC TAATCAGGTG TTTTTTTCCG GAAAGCCAGC CGTCTTCAAA GATTTCATAT CAAGAGCTAC TCAAATCAAT ACCTGGTCTC GTTAAGAGAG AACGCCCTCT GCGTGAAGCG GCCCTCAATG GTTTTTTCAT GTTTGGTTCG TTCAGCGCGT TCTGGACTTC TCTGATTTTC CTTCTTGAAA CACCGATCTA TCGTATGAGT ACAAGAGAAG CAGGTTTGTT CGGGTTAGCA GGAGTAGCCG GTGCGCTCGC AGCACCTCTG ATTGGGAAAG CAGCTGACAC AAAAAGCCCG CGTTTTACAG TAGGTATTGG CGTCATCCTA TCGACTCTTG CCTATCTATG CTTTAGCCTG TTCGGGTATA ATATTTGGGG CCTTATCATC GGCGTTATCG TGCTTGATCT TGGCAATCAG TGTGGACAAG TTTCCAATCA GGCAAGGGTC CAAGCACTTG GTGACTCAAC ACGGAGTCGC AATAACACCG TGTTCATGTT TTCATATTTT ATCGGTGGAG CAGCAGGCTC TTTCCTTGGC ACCTTTTGTT GGCAGCATTA TGGATGGTAC GGTGTTTGCA TGGTAGGGCT TGCGTTCCAA TTTGCTGCGT TAATTACTCA TTTTTTGATT TACAGAAAGC AAAAATTTGA TAATGCGCTG CTGAGTCGTT AG
|
Protein sequence | MNSKLKEKQF IMTRSFILLM AVVCGVSVAN LYYIQPLEGQ ISTTFHVSQS AAGIAAMLTQ VGYAFGLLLF VPLGDMCERR SLILHMLLLV AISLLTAGLS PCYPVLLIAM FAVGITTIVP QLIVPYAAHL SRPEEQGEII GYVMSGLLIG ILLSRTFSGL VGAALNWRAV YLFAAGFIII LLVLIRCFFP ESQPSSKISY QELLKSIPGL VKRERPLREA ALNGFFMFGS FSAFWTSLIF LLETPIYRMS TREAGLFGLA GVAGALAAPL IGKAADTKSP RFTVGIGVIL STLAYLCFSL FGYNIWGLII GVIVLDLGNQ CGQVSNQARV QALGDSTRSR NNTVFMFSYF IGGAAGSFLG TFCWQHYGWY GVCMVGLAFQ FAALITHFLI YRKQKFDNAL LSR
|
| |