Gene Moth_1157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1157 
Symbol 
ID3833125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1189739 
End bp1190752 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content60% 
IMG OID637829088 
Productthiamine-monophosphate kinase 
Protein accessionYP_430014 
Protein GI83590005 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000470435 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAACTTA AAGAAGTAGG GGAATTTGGA CTTATAGAAC GCTTAAAGGC CGGGGCCATA 
GTGGACCCGG CCAGGGTAGT TATGGGGATC GGTGATGACG CCGCCGTATT AAAGGGAGAA
CCTGGTTTCC TGACCCTGGC TTCGACGGAT ATGCTGGTGG AGGATATCCA CTTTACCTTG
GCCACAGCTA CCCCCCGGGA AATAGGTTAT AAAACCATGG CTGTTAATGT CAGCGATATT
GCCGCCATGG GGGGGATACC AGAACAGGCC CTGGTATCCC TGGCTTTAAG GCCGGAGCAG
CAGGTCGAGT TTGTGGACGA GCTCTACGCC GGCCTGCGGG AATGTGGCCA GCGTTTTGGA
GTAAATATAA TTGGTGGCGA CACGGTCTCT TCCCCCAGGG CCATGGTAAT CAACCTGGCC
ATATTGGGAC GGGTGGAAAA TGATGCCTGT CTCTACCGCC ACGGTGCCCA GCCCGGGGAT
ATTCTGCTAG TAACCGGCGA CCTGGGGGGC TCTGCCGCAG GTCTGGATAC CCTGCTCAGT
CCCCGCCCGG CCCCGGCGGA GGTCATCGCC TGGGCCCGGG CACGCCACTT CCGGCCTACC
CCGCGGGTAG TGGAAATCCG TGCCGCCCTG AAGGCCGGTG GGCTCACGGC GGCCGATGAC
ATCAGCGACG GCCTGGTAGC GGAGGTCTAT ACCCTGGCGA CGGCTTCCAG GGTGGGAATA
GTCCTGGAGG CCGGGGCCAT CCCCATCGCT CCGGCCACCC GGCAATTGGC GGCTATATAC
CATAAGGAGC CCCTGGATTA CGCCCTTTAT GGCGGCGAAG ACTTTGAGCT CCTGCTGGCC
TGCCGGCCGG ATAAAGTAGA CGCCGTTCGG GAAGCTGTGA ACCGGGCCTG CAGGACGCCG
GTGACGGTCA TCGGCAGGGT CGTCCCGGCA GAAGAGGGTA TTACCATAAA CCACAGCGGC
CGAGTGCTAC CCTTGACGCC AGGGGGTTAT AACCACTTCC AGCTGGATAG ATAA
 
Protein sequence
MELKEVGEFG LIERLKAGAI VDPARVVMGI GDDAAVLKGE PGFLTLASTD MLVEDIHFTL 
ATATPREIGY KTMAVNVSDI AAMGGIPEQA LVSLALRPEQ QVEFVDELYA GLRECGQRFG
VNIIGGDTVS SPRAMVINLA ILGRVENDAC LYRHGAQPGD ILLVTGDLGG SAAGLDTLLS
PRPAPAEVIA WARARHFRPT PRVVEIRAAL KAGGLTAADD ISDGLVAEVY TLATASRVGI
VLEAGAIPIA PATRQLAAIY HKEPLDYALY GGEDFELLLA CRPDKVDAVR EAVNRACRTP
VTVIGRVVPA EEGITINHSG RVLPLTPGGY NHFQLDR