Gene Moth_1860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1860 
Symbol 
ID3831491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1921593 
End bp1922804 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content46% 
IMG OID637829792 
Productmajor facilitator transporter 
Protein accessionYP_430703 
Protein GI83590694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000002484 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAACAGTA AACTCAAGGA AAAACAATTT ATTATGACAA GATCGTTTAT ATTGTTGATG 
GCGGTTGTCT GTGGCGTTTC CGTCGCAAAC CTTTATTACA TACAACCGCT GGAGGGGCAG
ATTTCGACTA CTTTTCATGT CTCACAGAGC GCGGCAGGTA TTGCAGCCAT GCTCACACAG
GTGGGTTATG CGTTTGGCCT GTTGTTATTT GTTCCACTGG GAGACATGTG TGAACGCCGC
TCCCTTATTC TGCATATGCT GCTTTTGGTT GCTATATCAC TGCTCACAGC TGGTTTATCA
CCGTGCTATC CTGTGCTGCT AATTGCGATG TTTGCCGTTG GGATTACAAC AATCGTGCCA
CAACTTATCG TTCCCTATGC AGCCCATCTT TCACGTCCGG AAGAGCAGGG GGAAATTATT
GGCTATGTCA TGAGCGGTCT GCTTATCGGA ATTTTGCTGT CCCGGACATT CAGTGGCCTT
GTGGGCGCGG CTTTGAATTG GCGAGCAGTT TACCTTTTTG CAGCCGGATT TATCATTATT
TTATTGGTTC TAATCAGGTG TTTTTTTCCG GAAAGCCAGC CGTCTTCAAA GATTTCATAT
CAAGAGCTAC TCAAATCAAT ACCTGGTCTC GTTAAGAGAG AACGCCCTCT GCGTGAAGCG
GCCCTCAATG GTTTTTTCAT GTTTGGTTCG TTCAGCGCGT TCTGGACTTC TCTGATTTTC
CTTCTTGAAA CACCGATCTA TCGTATGAGT ACAAGAGAAG CAGGTTTGTT CGGGTTAGCA
GGAGTAGCCG GTGCGCTCGC AGCACCTCTG ATTGGGAAAG CAGCTGACAC AAAAAGCCCG
CGTTTTACAG TAGGTATTGG CGTCATCCTA TCGACTCTTG CCTATCTATG CTTTAGCCTG
TTCGGGTATA ATATTTGGGG CCTTATCATC GGCGTTATCG TGCTTGATCT TGGCAATCAG
TGTGGACAAG TTTCCAATCA GGCAAGGGTC CAAGCACTTG GTGACTCAAC ACGGAGTCGC
AATAACACCG TGTTCATGTT TTCATATTTT ATCGGTGGAG CAGCAGGCTC TTTCCTTGGC
ACCTTTTGTT GGCAGCATTA TGGATGGTAC GGTGTTTGCA TGGTAGGGCT TGCGTTCCAA
TTTGCTGCGT TAATTACTCA TTTTTTGATT TACAGAAAGC AAAAATTTGA TAATGCGCTG
CTGAGTCGTT AG
 
Protein sequence
MNSKLKEKQF IMTRSFILLM AVVCGVSVAN LYYIQPLEGQ ISTTFHVSQS AAGIAAMLTQ 
VGYAFGLLLF VPLGDMCERR SLILHMLLLV AISLLTAGLS PCYPVLLIAM FAVGITTIVP
QLIVPYAAHL SRPEEQGEII GYVMSGLLIG ILLSRTFSGL VGAALNWRAV YLFAAGFIII
LLVLIRCFFP ESQPSSKISY QELLKSIPGL VKRERPLREA ALNGFFMFGS FSAFWTSLIF
LLETPIYRMS TREAGLFGLA GVAGALAAPL IGKAADTKSP RFTVGIGVIL STLAYLCFSL
FGYNIWGLII GVIVLDLGNQ CGQVSNQARV QALGDSTRSR NNTVFMFSYF IGGAAGSFLG
TFCWQHYGWY GVCMVGLAFQ FAALITHFLI YRKQKFDNAL LSR