Gene Moth_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2025 
Symbol 
ID3831400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2113215 
End bp2114744 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content61% 
IMG OID637829954 
Productxylulokinase 
Protein accessionYP_430864 
Protein GI83590855 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR01312] D-xylulose kinase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTATC TTCTCGGGAT TGATATCGGT ACTTCGGGCA CCAAAGCCCT CCTGGTGGAG 
GAAACCGGCA GGGTTGTAGC TTCTGCCTAT AAAGAGTATC CTTTGAGCCA GCCCCGGCCG
GGGTGGGCCG AGCAGGATCC GGAGGAGTGG TGGCGGGCCG TGGTGGAGGC GGCCCGGGAA
GTCCTCGCCC GGAGCGGTCT GGCAGGCGGC GATGTTGCCG GCGTGGGTCT TTCCGGCCAG
ATGCATGGGG CGGTAGTTCT TGACGCTAAT TACAGGGTTT TACGGCCGGC TATCCTCTGG
TGCGACCAGC GAACGGGAGC AGAGTGCGCC TGGATGTACG AAGAAATAGG GCAAGAAAAA
CTCTACCGGT GGACGGGAAA CCCGGTCCTG CCAGGCTTTA CGGCGCCCAA GCTGGTCTGG
CTTAAGCGCC ATGAACCGGA AACCTATAGC AGGATACGCC ATGTCCTGTT GCCCAAGGAT
TATATTCGCT TCCGCCTGAC AGGGGAACTG GCTACCGAGG TATCTGACGC TTCCGGAACC
CTGCTCCTGG ATGTGACCCA CCGGTGCTGG TCCGGTGAGA TACTGGCCGC CATGGGCCTC
CCGGAGGAAT GGCTGCCCCG GGTATATGAA TCGCCGGAAG TAACCGGACG CATAACCCCT
GAGGCCGCTG CCCTCACCGG TCTAATGGCG GGAACGCCGG TGGTAGGCGG AGGCGGCGAC
CAGGCCGCCG GGGCCATAGG TACAGGAGTC GTAGTAGAAG GAATTATTTC CACTGCCCTG
GGTACGTCGG GTGTAGTTTT CGCCATGACC GATCGGCCCC ACACCCAGCC GGGGAGCGCC
CTGCATTCCT TCTGCCATGC CGTGCCGGGC AAGTGGCACC TGATGGGTGT GATGCTGGCC
GCCGGGGGAT CCTTGCAGTG GTTCCGTAAT CAACTCGGTA GCGAGGAGGT GCGGGAAGCC
GCCGTAAAGA ATATCGATCC CTACGAAATG TTGACAGAAC AGGCGGCTGA GGTCGGCACC
GGCGCAGGAG GATTGCTGTT CCTCCCTTAC CTTCTGGGAG AAAGGACGCC CTACCCTGAT
CCGGCGGCTA GGGGCGCTTT TATCGGCCTG ACCATGCGTC ACCGGAAAGG CCATCTGGTA
CGGGCCGTCA TGGAGGGGGT TGCTTTCGGC CTGAGGGACT CCTTGGAGCT GCTCCGGCAG
GCCGGGGTGA AAGTAGAGGA AATCCGGGTT TCAGGCGGCG GCGCCAGAAG CCCCCTGTGG
CGCCATATCC TGGCCAGTAC CTTTAAGTAT CCAATAACCA CGGTGAACAG CACCGACGGT
CCGGCCTTCG GCGCCGCCCT GCTGGCTGGC GTGGGGGCGG GCATTTATCC TTCCGTCGAG
GAAGCCTGCA GGTTGACCAT TAAAGTGACC AGCCGGGTGG AACCGATAGC ACCCGAAACA
GATGCCTACG ATAGGCTGTA TGAAATATAT AAATCCCTTT ATCCTCTCCT GCGCGACACG
ATGCACGATC TGACAAAATT TACGGAGTGA
 
Protein sequence
MPYLLGIDIG TSGTKALLVE ETGRVVASAY KEYPLSQPRP GWAEQDPEEW WRAVVEAARE 
VLARSGLAGG DVAGVGLSGQ MHGAVVLDAN YRVLRPAILW CDQRTGAECA WMYEEIGQEK
LYRWTGNPVL PGFTAPKLVW LKRHEPETYS RIRHVLLPKD YIRFRLTGEL ATEVSDASGT
LLLDVTHRCW SGEILAAMGL PEEWLPRVYE SPEVTGRITP EAAALTGLMA GTPVVGGGGD
QAAGAIGTGV VVEGIISTAL GTSGVVFAMT DRPHTQPGSA LHSFCHAVPG KWHLMGVMLA
AGGSLQWFRN QLGSEEVREA AVKNIDPYEM LTEQAAEVGT GAGGLLFLPY LLGERTPYPD
PAARGAFIGL TMRHRKGHLV RAVMEGVAFG LRDSLELLRQ AGVKVEEIRV SGGGARSPLW
RHILASTFKY PITTVNSTDG PAFGAALLAG VGAGIYPSVE EACRLTIKVT SRVEPIAPET
DAYDRLYEIY KSLYPLLRDT MHDLTKFTE