Gene Moth_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0697 
Symbol 
ID3832698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp726414 
End bp728000 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content63% 
IMG OID637828629 
ProductAIR synthase related protein-like 
Protein accessionYP_429559 
Protein GI83589550 
COG category[O] Posttranslational modification, protein turnover, chaperones
[S] Function unknown 
COG ID[COG0309] Hydrogenase maturation factor
[COG1992] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000561591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000203748 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGGCA AAGTAGATGA CGCTTTCTTC CGGCAGGCTA TCCTGCCCCA TACGGGAGCA 
GGGGATCCTG AGGTGGTAGT CGGGCCGCGC ATGGGAGTGG ACGCGGCGGT ACTTAAAATA
GGGGAGGAGT ACCTGGCCGT CGCAGAGGAC CCCATTTTTC CTGGACCGAC GACTTCCCCC
GATGACTTCG GCTGGATCAC CGTCCATATC GGCGCCAGCG ACGTCGCCGT CATGGGTATC
AAGCCCCGTT TTATGACCTA TTCCCTCTTG CTGCCGCCGG GGACGCCGGA GGACTACATC
GCCGGGCTGG TCCGCAGCAT CAGCACCTAT GCCCGGGAGC TGGGCATTAC TATCGTCGGC
GGTCATACCG GTTTCTACGG GGCCGTGACC ATACCTACCA TTGGCGGAAT TACCGTCTGG
GGCCGGGGTC GGGAAGTAGT CACCCCCGCT GGGGCCCGGG TCGGCGACGC CGTAATTATT
ACTAAAGGGG CGGCCATTGA AGCGGCAGCC CTGGTGGCCT GCGAGCTGGG TGAGAAGCTC
CTGGCTGCCG GGATCTCCCC GGACCTGGTG GCAAGGGCTA AAAAGCGGTT GCGGGAGATG
TCCGTGGTGG CTGAAGCCGG TATTGCCGTA GAAGTCGGCG GTGTACATGC CATGCACGAT
GCCACGGAAG GGGGCCTGGC GCGGGGCCTC TGGGAGGTGG CCGAAGCTTC CGGTGTGGGT
TTAAGGATCG AACGCGCCCG GGTACCGGTA CCTGCCGATA TCCGAGCAGT TTGTGACTAT
ATTGGCCTTA ACCCTTACGA AGTAATCAGC GAGGGCACCC TGGTGCTCAC CTGCGCGCCG
GAAAAGGCTG ACGCCATGCT GGCGGCCTTT AAAGAAGCCG GCATCGAAGC GGCGGTTATC
GGCCGGGTAG TACCAGCGGG CGCAGGCCGC GCCTGGCTGG AGGATGACGG CCGGGAAGAG
CAGCTCCTGC CGCCGGCGGT GGACCGCTTC TGGGAGGTCT TTTTTAACGC CCTGGCCCTA
AAAAACGATA CCCGTACTCC GGCGGAAGTG GCCCTGTGCC GGGAACTGGG ACAGGCCGTC
AGGGAGCTCG AGGAAGCTAA CGTTGCCGCC CTCATCCCCG AGATCGGCGC CAACCTGGCC
TATTGCTTGC CGGAGGCAAA AGAACTCCGG GACATCGCTG CTATACCCGG CCGCCTGCTG
CGTTTTAAGG GGCGAGTGGC AACCCTGGGT GAGCCGGAGA TGGGCTGTTC CCACCACATG
GGCGGCACCA TCCTGGTGGT GCGGGAGTTC TTCCCGCAGG CACGCTGCGT CATCAACCTC
CGCAACAACG CCCGGGTGCG TCAGGCCTGC GCCGATCTGG GTTATAAGGT TGTCAGCATG
CCCGTGCCCC CGGACTACCG CCAGACGGAC GATGATTTCT ATACCGACCT GCGCCGGACC
ATGGCGGCCT GCCGGGAACT TCCTGACGTA ATTGAAATAC CCGATCGCAT CAACCTGGAG
CGCCTCATCC TGGTCCTGGG CCGGAACCCC GGTGAAATCG TCAGCAAGGT AACCTCCCTG
GCCACCAGGG TGGCGGAATT GGAGTAG
 
Protein sequence
MIGKVDDAFF RQAILPHTGA GDPEVVVGPR MGVDAAVLKI GEEYLAVAED PIFPGPTTSP 
DDFGWITVHI GASDVAVMGI KPRFMTYSLL LPPGTPEDYI AGLVRSISTY ARELGITIVG
GHTGFYGAVT IPTIGGITVW GRGREVVTPA GARVGDAVII TKGAAIEAAA LVACELGEKL
LAAGISPDLV ARAKKRLREM SVVAEAGIAV EVGGVHAMHD ATEGGLARGL WEVAEASGVG
LRIERARVPV PADIRAVCDY IGLNPYEVIS EGTLVLTCAP EKADAMLAAF KEAGIEAAVI
GRVVPAGAGR AWLEDDGREE QLLPPAVDRF WEVFFNALAL KNDTRTPAEV ALCRELGQAV
RELEEANVAA LIPEIGANLA YCLPEAKELR DIAAIPGRLL RFKGRVATLG EPEMGCSHHM
GGTILVVREF FPQARCVINL RNNARVRQAC ADLGYKVVSM PVPPDYRQTD DDFYTDLRRT
MAACRELPDV IEIPDRINLE RLILVLGRNP GEIVSKVTSL ATRVAELE