Gene Moth_0632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0632 
Symbol 
ID3832530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp655898 
End bp657313 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content44% 
IMG OID637828574 
Productgeneral substrate transporter 
Protein accessionYP_429504 
Protein GI83589495 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000142658 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTAG GTGAATTATG GGGGAATTTT TATATGGACC AAGTAAGAAA TAAAATTTTG 
GATTCCGGCA TGCGCGGAAG TTCTATAACC AATCAATCCT CACAACTCAT TACACGAATA
GAAAGCGTTC CTTTTTCGCG CTGGCACATC AAGCCGCGTG TGATTATGGG CAGCGCTACC
TTTTTTGATG CATTTGACAC CTTATCCCTC TCATATGCTA TGCCGGTATT AATAGGGCTA
TGGCATTTGA ATCCAGGCCA AATCGGAATA CTTATTGGCA TTGGATATCT TGGACAGGCT
ATAGGTGCGC TATTGTTCGG ATCGATTGCC GAGCGTTTTG GCCGCGTTTT TAGTGCGAAA
TGGGCCACTT TAGTGATGTC TATAATGGCT ATTGCTTGCG CCTTTGCAGG AAATTACAAT
GAGTTGGTGG CACTGCGTTT TATACAGGGA ATCGGTGTTG GTGGCGAAGT TCCTGTAGCA
GCCGCCTATA TTAATGAAAT TTCTCGTGCT TCTGGTCGGG GTCGTTTCTT CATGCTTTAT
GAGATGGTTT ATCCTATTGG ATTGATGGTA ACTGCCCAGC TTGGGACCAT TATTGTACCA
AGCCTGGGGT GGAAATGGAT GTTCTTCATA GGCGGTGGAA CAGGCATAAT CATTGTTCTA
CTTATGAATT TGCTGAAGGA ATCACCTCGC TGGCTTATTT CCAAAGGACG GTTCGAGGAG
GCCGAGCGCA TAATTGAAGA GATTGAGGCA AGCACCGACC AACGCATACC TGTCAATATT
AAGGGAACTC AGGAGGCAGT TAAAGGTAAC TGGAAGGAGT TATTCTCACC ATTCTACCGG
GGGAGGACAA TAGTCGTTTG GATGTTATGG TTTTCAACAT ATTTTGTTTC AAACGGCCTG
AATAACTGGT TACCCAGTCT GTACAAGACA GTCTATAAAC TTCCCCTACA GACTTCTTTG
CGGGTAGCAT CGCTTACAAA CCTTATCCAA ATAGTTGCTG TATTTGCATG TGCGATGCTA
ATTGATAAGG TAGGCCGTAA ATTATGGGCA ACTATAGCAT TTCTCGTGGC TAGTTTGCTT
CTTGGAATAC TTTGGATAAA CGGTGCAGCG ACTGCCTACA GCGTCATGTA CCTTGGGTCG
TTAGCTTATG GCGTCATTGG CACGGTAACG GTTCTGCTTT ATTTGTATAC TCCGGAAATT
TATCCAACCA GGATGCGAGC AGTTGGAACA GCATTTGCTA CTACATGGTT GCGTCTCGCA
TCAGCAATTG CTCCTACCAT AGTAGGATTT ATTTTAGGGA CTAGAGGGAT TTCCAAGGTT
TTTGCACTAT TTGCATGTGT TAGCGTTATT GGTGCTTTTA TGGCTATCCG GATGGTTGAA
ACGAGGGAAA AGATGTTAGA AGAGATTGCA CCCTAA
 
Protein sequence
MNLGELWGNF YMDQVRNKIL DSGMRGSSIT NQSSQLITRI ESVPFSRWHI KPRVIMGSAT 
FFDAFDTLSL SYAMPVLIGL WHLNPGQIGI LIGIGYLGQA IGALLFGSIA ERFGRVFSAK
WATLVMSIMA IACAFAGNYN ELVALRFIQG IGVGGEVPVA AAYINEISRA SGRGRFFMLY
EMVYPIGLMV TAQLGTIIVP SLGWKWMFFI GGGTGIIIVL LMNLLKESPR WLISKGRFEE
AERIIEEIEA STDQRIPVNI KGTQEAVKGN WKELFSPFYR GRTIVVWMLW FSTYFVSNGL
NNWLPSLYKT VYKLPLQTSL RVASLTNLIQ IVAVFACAML IDKVGRKLWA TIAFLVASLL
LGILWINGAA TAYSVMYLGS LAYGVIGTVT VLLYLYTPEI YPTRMRAVGT AFATTWLRLA
SAIAPTIVGF ILGTRGISKV FALFACVSVI GAFMAIRMVE TREKMLEEIA P