Gene Moth_0614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0614 
Symbol 
ID3832589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp639974 
End bp640969 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content46% 
IMG OID637828555 
Productinner-membrane translocator 
Protein accessionYP_429487 
Protein GI83589478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAG CGACCAATAT TTCGAATAGC GCTAAGTTGA AGTATAGCCA TCTGAATAGC 
ATTGTTAAGG AATATAGTTT CATCTTTGTA TTCCTTGCTT TATGTATACT ACTCTCTATA
CTGGTTCCGA CTTTCCTCTT GCCCCAAAAT TTGTTAAATG TATTGATTCA GATCTCTATC
AACGCCCTGT TAGCTATAGG TATGACCTTT GTCATTATTT CCGGTGGCAT TGACCTTTCA
GTGGGTTCAG TAGCAGCACT GGCAGGTATT GTCGTTACGG CTTTGCTTAA ACAGTACCCG
TCCAGTACGC CGATGATGTA TGTAATAATT ATTTTTAGTG TCCTGGCTGT GGGTATAGTC
TGTGGCGGTA TCTCCGGCCT GGCAATTGCG AAACTTAATG TCGAACCCTT TATAGCCACC
CTCGCTATGT TGAGCATCGC CAGGGGGTTT GCCTTCGTTT ACACCCAGAG CAAACCAATT
TTCGGCTTGC CCCCGGCCTT TAGTTGGATT GGTCAGGGGT ATATTGGCCC TATCCCGGTT
ATCGTGTTGA TTATGATTTT TTGCCTGGTT ATCGCCCACA TTGTCTTATC AAAAACCTGT
TTTGGACGTT ATATTTACGC CATCGGGAGC AACGAAGAAG TGGCTAAATT ATGCGGTATT
AACGTTGCCC GGGTGAAGCT TATTATTTAT GTAATCAGCG GCGTCCTTTC TGCTCTGGGG
GGAGTCGCTC TGGCGTCCCG TTTAGCAACA GGGCAACCGG CTGCCGCCAG CGGTTACGAG
CTCAATGCGA TCGCAGCGGT TGTGCTGGGG GGTACCAGCC TTTCCGGAGG TAAGGGCAGT
ATTGGTAAAA CCATTATCGG CATTATGACC ATTGGCGTTA TTAACAACGG TTTAAGCCTG
TTGCAAATCT CCTCTTACTG GCAGTCCATT ACCATGGGTT TAATCATTAT GATTGCCGTA
ATACTGGATA AAATCAACAC CCGTAAGAAA GCCTGA
 
Protein sequence
MNGATNISNS AKLKYSHLNS IVKEYSFIFV FLALCILLSI LVPTFLLPQN LLNVLIQISI 
NALLAIGMTF VIISGGIDLS VGSVAALAGI VVTALLKQYP SSTPMMYVII IFSVLAVGIV
CGGISGLAIA KLNVEPFIAT LAMLSIARGF AFVYTQSKPI FGLPPAFSWI GQGYIGPIPV
IVLIMIFCLV IAHIVLSKTC FGRYIYAIGS NEEVAKLCGI NVARVKLIIY VISGVLSALG
GVALASRLAT GQPAAASGYE LNAIAAVVLG GTSLSGGKGS IGKTIIGIMT IGVINNGLSL
LQISSYWQSI TMGLIIMIAV ILDKINTRKK A