Gene Moth_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0116 
Symbol 
ID3832006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp114256 
End bp115476 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content56% 
IMG OID637828050 
Productmajor facilitator transporter 
Protein accessionYP_428998 
Protein GI83588989 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR01131] ATP synthase subunit 6 (eukaryotes),also subunit A (prokaryotes) 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAGCAT CTATGAAATT ACCCTTTGCT ATCCTCTGTA CTGTTCCCTT CATTATGGTC 
CTGGGTAACT CGATGCTGGT TCCCCTGTTA CCCCTGATGC GTTCGAGCCT CAATGTTACC
CTGGTCCAGG TGAGCCTTTT CATCACGGCC TTCTCCCTGC CGGCCGGGAT AGTAATCCCT
TTTGCCGGCT TTCTGTCCGA CTGCTACGGC CGTAAAACCA TCATGGCCCC GGCCCTCATC
ATCTACGGAG CCGGCGGGAT CCTGGCCGGC CTGGCCGCCT GGCTGGTAGC CTCCCCCTAT
TACCTGATCC TGGGTAGCCG CATCCTGCAA GGTATCGGCG CCGGTGGCAC CTACCAGCTG
GCTATGGCCC TGGCCAGTGA TATCTTCCAG AGCAACGAGC GAACCAAAGC CCTGGGCCTG
CTGGAGGCGG CCAATGGCCT GGGTAAGGTC GTCAGCCCCA TTGCCGGCGC AGCCCTGGGC
CTGCTGCTGT GGTTCGCCCC CTTTTTTGTC TATGGAATTC TGGCCATTCC CATCGGCCTG
GCAGTATGGT TTCTGGTCCG GGAACCTGAA CAGGGGAATA AAAACAAGAC CGATTTTCAT
AGTTATTTCC ATAACCTGGG GCAGGTCCTC CAGGAGAAAG GTCTTTCCCT CATGGCCAGT
ATCCTGGCCG GGATGATTGT TCTTTTTATT TTGTTCGGGG TCTTGAGTTA TGTATCCGAC
CTGCTGGAAG GGAGGTACGG CGTCACAGGC ATCCGCATCG GCCTGCTCAT TGCCATCCCT
GTAGGAACCA TGGCCCTGAC TTCCTATTTA TCCGGCAGTT ACTTACAGAA AACAGCCGGC
AACTTTCTCA AAATCGTAAT CATTGCCGGC CTGGTGCTGG AAGCCGCCGC TCTGGTGATC
ATGGGCTTCT TTGCCAATAT TTATATCTTT TTCCTGGCTA TGATCCTCAT GGGTTTCGGA
ACGGGAATTG TCCTACCGGC GGTCAATACC CTGATTACCA GCGCTTCCGC CAGGGAACGG
GGTGGTATTA CCTGTCTTTA TGGCTCGGCC CGGTTTTTTG GCGTCGCCCT TGGGCCCCCG
GCCTTCGGCC TGGCCATGAC CCTGGGAAAG TTACCCCTGT TCCTCGGAGC CGCCGTGCTG
GTGGGGATTA TTGCCTTTTT GACTATTGCC TTTATCCAGA CCGAAAAGAT GCTCCCGCCG
GAATTACTGC CGGGACATTA A
 
Protein sequence
MKASMKLPFA ILCTVPFIMV LGNSMLVPLL PLMRSSLNVT LVQVSLFITA FSLPAGIVIP 
FAGFLSDCYG RKTIMAPALI IYGAGGILAG LAAWLVASPY YLILGSRILQ GIGAGGTYQL
AMALASDIFQ SNERTKALGL LEAANGLGKV VSPIAGAALG LLLWFAPFFV YGILAIPIGL
AVWFLVREPE QGNKNKTDFH SYFHNLGQVL QEKGLSLMAS ILAGMIVLFI LFGVLSYVSD
LLEGRYGVTG IRIGLLIAIP VGTMALTSYL SGSYLQKTAG NFLKIVIIAG LVLEAAALVI
MGFFANIYIF FLAMILMGFG TGIVLPAVNT LITSASARER GGITCLYGSA RFFGVALGPP
AFGLAMTLGK LPLFLGAAVL VGIIAFLTIA FIQTEKMLPP ELLPGH