Gene Moth_0656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0656 
Symbol 
ID3832143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp686504 
End bp688486 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content53% 
IMG OID637828597 
Producthypothetical protein 
Protein accessionYP_429527 
Protein GI83589518 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000140636 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000360107 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA GAATAGTTTC CCTTTTGATG GCAGCAATTC TACTTTTGTT GTTGCCAGTC 
ACCCCTTCGG CCGCAACCCC CCAATTCACC GACATCCAGA ACCACTGGGC GAAAGACTAC
ATCCTTAGTT TTGCCAATAA AGGTTTTGTC AAGGGCTACC CGGACCAGAC CTTCAAACCC
GACAGGCCAA TCAGCCGGGC AGAGTTCACC TGCATACTGC TCAACTGCCT GGGTATCACC
CCGGCGTCAG ACGTTAACAC ACCGACCTTT AGCGATACCA CCAACCACTG GACCAGGGCC
CAGATCGCCG AAGCGGTGCG GAGGGGGATC CTGGTAGTGA GCGAATACCC CGGTGGGCTC
AAACCTGATG ATCCCATATA TCGCAGTGAA GCTGCCGCTA TGATGATTAG AGCCCTGGGG
AAGAGTCCCG ACATGACCCC AACCTCTTTT AAAGACAGCA ACCAGATTGC AAAGAGCATG
TACCGGGGCT ACATCAAGGC AGCCTCCAGC GAAGGGCTGA TGCACGGCTA CCCGGATGGA
ACGTTCCGTC CCTTCCAGGG CGTGAAAAGG GGTGAGGCCT GTGCCATGCT GGTCAATCTG
CTAGGCAAGA TCGGTACCGC CTCTCCTCCT GCCGTCCAGG TCAACCCATC AAGCAACAGT
GCCCTGTCCG CCCTGGTAAT CCAGGGCAAC CGCTACAAAC TGGGGGACAC TGTCGTTTAC
CTCAAGCGGG ACTCGACTAA CATACCCATA TACTCCCTCA GCGTGGCGGG GGGCTTAGTC
TTCATCAACA ACACCTTCAC TTACCCTCTA AACAGCACCG ACAATAACCC GGATCTGGTC
GTCAACAATA CCCGCTATGT CCAATGCCGG CTCAGCGTAA GCGGCAGCGA TTTACAGGTC
ACCCCGGGCG CCGTGAAGCT GGATTCCATC TCATATAACG GCTACAAGTA TAACGCCGAT
TACGTGAAGC TCTATATCGG CAACAAAAAC GGCAGCTACT ACCTGTCCGA CGCCGAACTG
GTAGATCGGC AAACCATCAG GGTCGGCGGT AACAGCTATG ACATCGGCAG CACCCCGGTA
TCCATCGCCC TGGGGGATAA TTTCTATGCC ATCAACGGGA TCAATTACGA CAGCAGCAGT
ATCAGCCTGG ACCTGGCAGC CACCACCCCG GTGGTGACGA ACGGCCTTGA TATCTCGGAT
ATCTCTGCCA TCTTCGTGGA TACCAGGTCC CTGGACCTCA ATACCATCAG CAGCCTCTTT
TTCATTATTG ACGGCAGCAG GTATGACCGC TCGGAAGTGG TCATCGACGC CTCCGGCAAT
TTTACGGCTA ATAACAAGTA TTATACCCCC GATCAGGTTA CCATGGTTAT AAACAACAGC
TTCTATAAGC TAACCGACGT CAAGTCCTTC GGCGGCAAGT TTATATTCTA CTGCACCGCC
AGCAATGTGA CCACCTGGGC AATAGTTAAT GATAAATACC AGGATGCCAG CACGATCCAG
ATCCTGATGG GGAACAACAT TTACACCCTG GACAAGATCC TGGTGGTGCA GCATAACGTG
ATCCGCATCG GGGGGCGCCA ATATAAACTC GGCGACATAT TCGGCTGCCG GATTAACGGA
ACATTGTACG ACATCGAAGA TATCGATTAC GACAACAGCC TGGATCTGGT AACGATGGAC
GTTACCGAAT CCACCGGCAG TTGGACCGGC TACCTCCCCG GCCAGCCCCA AAAGTACTTA
TTCTACGTCG ATAACTCAAT CTATCAAGAC GGCGCCACCG GCAATGTCAC CATTTACGCC
GGCGGAGGCT GGAGGACATT TGACAGCATC ACCTTCTCCG ATCAGTCCCA TTTCGTATAC
GACAACACGA CTTATAACCT GTTGGGAGCA GAGATAAAGA TTGGAGACAC CGTTTTTACG
GTCGTCGATT CCGCCTGGCG GGTCAGCTCT CAGGTTATGG AGGTGTACCT GCAGAAGGCC
TGA
 
Protein sequence
MKKRIVSLLM AAILLLLLPV TPSAATPQFT DIQNHWAKDY ILSFANKGFV KGYPDQTFKP 
DRPISRAEFT CILLNCLGIT PASDVNTPTF SDTTNHWTRA QIAEAVRRGI LVVSEYPGGL
KPDDPIYRSE AAAMMIRALG KSPDMTPTSF KDSNQIAKSM YRGYIKAASS EGLMHGYPDG
TFRPFQGVKR GEACAMLVNL LGKIGTASPP AVQVNPSSNS ALSALVIQGN RYKLGDTVVY
LKRDSTNIPI YSLSVAGGLV FINNTFTYPL NSTDNNPDLV VNNTRYVQCR LSVSGSDLQV
TPGAVKLDSI SYNGYKYNAD YVKLYIGNKN GSYYLSDAEL VDRQTIRVGG NSYDIGSTPV
SIALGDNFYA INGINYDSSS ISLDLAATTP VVTNGLDISD ISAIFVDTRS LDLNTISSLF
FIIDGSRYDR SEVVIDASGN FTANNKYYTP DQVTMVINNS FYKLTDVKSF GGKFIFYCTA
SNVTTWAIVN DKYQDASTIQ ILMGNNIYTL DKILVVQHNV IRIGGRQYKL GDIFGCRING
TLYDIEDIDY DNSLDLVTMD VTESTGSWTG YLPGQPQKYL FYVDNSIYQD GATGNVTIYA
GGGWRTFDSI TFSDQSHFVY DNTTYNLLGA EIKIGDTVFT VVDSAWRVSS QVMEVYLQKA