Gene Moth_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2020 
Symbol 
ID3831395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2106914 
End bp2108107 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content42% 
IMG OID637829949 
Productinner-membrane translocator 
Protein accessionYP_430859 
Protein GI83590850 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.232129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTAATC TTATTAAAAC TATTTCCCCT GCAGAAACGG TACAATCTAA ACCTAAAGCG 
AATGGGTATT TAAGTAGAAT AGATATACGC GCTTATACAA TGATATTAGC CTTATTGGGT
ATATGGGCTA TTTTCACTTA TACTACCCAG GGTGCTTTTT TGACTTCCCG TAATCTATCA
AACCTCTTCA GGCAGATGTC AATTACCTCT ATTTTAGCGA TAGGTATGGT CTTTGTAATA
GTAGCCGGTC ATATTGACCT TTCCGTAGGT TCTCTCATGG GACTTACTGG AGGGGTAGCG
GCAATTTTAC AGGTCTGGTA TGGTTGGCAG ACCATTCCTG CTATTTTTAT AAGCTTTTTA
ATTGGTCTGC TGGCCGGCTT ATGGCAGGGC TGGTGGGTTG CCTATAAAAA GGTGCCTGCT
TTCATTGTCA CCCTGGGCGG TATGATGGTA TTTCGGGGAA TTCTAATAGG AATTAGTCAT
GGCGAAACAG TTTCGCCTCT CATGGATAGT TTTAAACAAA TAGGCCAATC CTATGTACCT
GAAAGTACAG GCTTCTTATT AGCATTCCTG GGTATTATTT ATGTGATCTA TGTTACTGTA
AAGCAACGGT ATACTAGAAT TAAGTATGGT TTTACTGTGC CTTCTTTAGC TCTGGAAATA
ATGCGTACCA TTTTTTACGC CTTTCTCATT GGCCTTTTTG TCTATCTAAT GAATGATTAC
CAGGGTATAC CTGTACCTGT CCTAATCGTA GTGGCGATGG CATTTATTTT TACGGGTTTA
GCAACGAAAA CTCGCTTCGG GCGTTATGTC TACGCAATTG GTGGTAACAG TGAAGCAGCA
CGTTTATCCG GTATTAATAT TCGATATAAC ATCCTGGCCG TTTTTGTTAT CAGTGGGTTA
ATGGCTGCCT TAAGCGGTAT CCTGTTAACT GCAAGATTAA ACGGTGCTTC AGTAGCTGCA
GGGCAAAATG CTGAGCTTGA TGCCATTGCA GCGTGCGTTA TAGGTGGTAC AAGTCTTATG
GGTGGTACAG GTAGTATTGG TGGGGCGATG ATAGGAGCAC TTGTTATGGC CAGTTTAGAT
AATGGCCTGA GCATGATGAA TACCCCGACC TTCTGGCAGT TTATAGTTAA AGGTTTGATT
CTTGTGCTGG CGGTATGGAT CGATATCGCA ACTAAAACAA GGGCTCAAAA TTGA
 
Protein sequence
MFNLIKTISP AETVQSKPKA NGYLSRIDIR AYTMILALLG IWAIFTYTTQ GAFLTSRNLS 
NLFRQMSITS ILAIGMVFVI VAGHIDLSVG SLMGLTGGVA AILQVWYGWQ TIPAIFISFL
IGLLAGLWQG WWVAYKKVPA FIVTLGGMMV FRGILIGISH GETVSPLMDS FKQIGQSYVP
ESTGFLLAFL GIIYVIYVTV KQRYTRIKYG FTVPSLALEI MRTIFYAFLI GLFVYLMNDY
QGIPVPVLIV VAMAFIFTGL ATKTRFGRYV YAIGGNSEAA RLSGINIRYN ILAVFVISGL
MAALSGILLT ARLNGASVAA GQNAELDAIA ACVIGGTSLM GGTGSIGGAM IGALVMASLD
NGLSMMNTPT FWQFIVKGLI LVLAVWIDIA TKTRAQN