Gene Moth_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2242 
Symbol 
ID3831288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2342228 
End bp2345674 
Gene Length3447 bp 
Protein Length1148 aa 
Translation table11 
GC content53% 
IMG OID637830162 
Producthypothetical protein 
Protein accessionYP_431072 
Protein GI83591063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.297499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC GGGATATTTT CCAGCGTGAC CCTGTGGTTT CAAAACTGCT TAATGACGGC 
GTGGCTACAG TTACTGAAGC GGCCACATCC AAAGAGATTG AAACCCTTCG CTATGAACTG
GAGCATTTTG TTTGCGAGGG GCAGTATAAA GACGGACTTA TACGCATTCT TGAGTCTTAC
TTGAGCAATG CGGATGCTAC AAGCCAGCCG GCGGTCTGGG TAAGCGGTTT TTTTGGCAGC
GGTAAATCAC ATCTTCTGAA GATGCTCCGT CACTTATGGG TTAACACCAA ATTTGAATCG
GATGGAGCGA CTGCCCGTGA ACTGGCAAGG CTACCTGTGG AAGTAAAAGA CCTGTTAAAA
GAATTGGATA TATTAGGGAA AAGATGTGGC GGGCTGCATG CTGCGGGGGG AACGCTGCCA
TCCGGGGGAA GGAAAAGTCT TCGTCTGGCA GTGCTCAGCA TTATTTTTCG CTCGAAAGGT
CTGCCCGAAT CTCTTCCCCA AGCACAATTT TGCCTTTGGC TTCAAAAAAA TGGCATCTAT
ACTCGGGTTA AAAAGGTTGT GGAAGATTCA GGCAAGGTTT TTCAACAGGA ACTCCGTCAT
TTGTATGCCA GTCCGGTATT AGCCAGGGCC CTGTTGGAGG CTGACCCTGA TTTTGCCTCC
GATCTCAAAC AGGTGCGGGC CACCATCCAG GCCCAGTTTC CTGACGTAGA CGACGTCTCG
ACATCCGAAT TTATCCAGAT CATTTGTGAT GTCCTTTCTG TAAACAGGCA ACTTCCCTGC
ACGGCGATTG TACTCGACGA AGTCCAGCTC TTTATAGGTG ACAGTACGGC TCGTTCCTAT
GAAGTACAAG AAGTAGCGGA AGCCCTTTGC AAGCAGCTTG ATAGCCGGAT ACTGCTTATA
GGGGCTGGCC AGACTGCTCT GAGCGGCAAT CTTCCGCTGC TCCAGCGCTT GAAAGATCGT
TTTACCATAC CTGTGGAACT CTCTGATACG GATGTTGAAA CTGTAACCCG CCGGGTGGTG
CTGGCTAAAA GGGCAGATAA GCGCAAGGCC ATCGAAGAAA TATTAACTGC TTACGCTGGT
GAAATTGATC GACAGCTTGC CGGTACCCGC ATTGCACCCA GAAGTGAAGA CCGGGCGATT
ATCGTCGAAG ACTATCCCCT TTTACCTGTC CGCCGCCGGT TCTGGGAGCA TGTCCTGCGA
GCTGTTGATA TTCCTGGAAC CAGTAGCCAG CTGCGCACGC AGCTCCGTAT TGTTCATGAT
GCTGTCCGGG AGATTGCTGA AAAACCCCTG GGTACAGTCG TGCCCGCTGA TTTCATATTT
GACCAGCTAC AGCCGGACTT GCTTCGCACC GGGGTTCTTT TACGGGAAAT CGATGAAACC
ATTCGCAATC TGGACGATGG GACTCCGGAA GGTTTATTGG CCAGGCGTAT TTGTGCCCTG
GTATTCCTCA TCCGCAAACT CCCCCGTGAA GCCGTCGTTG ATATTGGTAT CCGTGCTACC
GCGGAGACGC TGGCCGATCT TCTGGTAAGC GACCTGGCCG GCGATGGTCC CGCCCTGCGC
AGGGAGATTC CCCGGGTTTT AGAGAAACTG GTTAAGGAAG GAAAGCTTAT CAAGGTGGAC
GAGGAATACA GCCTTCAGAC CCGGGAGAGC AGCGAATGGG ACCGGGAATT TCGTAACCGG
CAAACCCGGT TGAACAATGA TCTTGCGGCT TTGGCCAGTA AACGTTCGGC CCTGTTGAAT
TCGGCCTGTA TGACCGCCCT TGGCAACATT AAACTCATCC AGGGAAAGGA CAAAGTGCCG
CGTAAGTTGG CAATTCACTT TGGCAGCGAG CCGCCCGAAA CTAAAGGCCA CGAAATTCCA
GTTTGGGTCC GCGACGGCTG GGGAGAAAAC GAGAATACCG TGGTGGCCGA TGCCCGGGCC
GCCGGCAACG ATAGCCCGAT TATTTTCGTC TATATCCCCA AAGCAAATGC CGAGGACCTG
CAAAGAACAA TTATTGAATA TGAGGCGGCT AAAGCGACCC TCGAATTCAA GGGAACCCCT
ACAAATCCAG AGGGGCAGGA AGCCCGCGAC GCGATGTCCA CACGTATGAA GACGGCCGAG
GCTACCCGGG ATGAGATTAT TAAAGAGGTT ATTAATGCCG CCAGGGTGTT CCAGGGTGGG
GGCCAGGAGC GTTTTGAGCA TTCCATGAAA GAAAAAGTGG AGGCAGCCGC CGAAGCGTCC
CTGGACCGTC TCTTTCCCCA CTTCCGGGAT GGTGATGATG ATCAATGGCC GGCTGTTATC
AACCGGGCCA AGAGCGGCGA CGAAGCCGCC CTTCAGGCCA TAGGCTGGAA TGATGCTCCC
GAAAAACACC CTGTCTGTGC GGCCATTCTT GCTGAAATCG GATCCGGTAA GACAGGCAAG
GAAATCAGGG ATATCTTTAT GAAAACACCT TATGGTTGGG GGCAGGATGC TGTCGACGCC
GCTTTGATTA CGCTTTTTGC CACCGGGCAT CTTCGCGCTG TATATAAAGG GGTCCAGCTT
GACCGTGGCC AATTAGATCA AGCTAAAATC CCGGCTACCG ACTTCCGGGT GGAAACGGTA
ACCATAGATG TCCATAGCCG CATGAAGCTG CGCAAGCTTT TTCAAACAGC TGGCTTTAAC
TGCAAGGCGG GGGAGGAATC TTCCGTAGCC GGGCAATTTC TTGCCAAGCT TATGGACCTG
GCCGATCGTG CCGGCGGCGA TCCACCAATG CCTGAGCGCC CATCTAAGAC CCATTTGGAA
ACAATGCGGG GACTAGCTGG TAACGAACAA CTTGCAGCGA TACTGGAACA GTTTGACACC
CTGGCCCAAC AACTGCAGAA TTGGTCAGCA TTGGCGGACT TGGCGGCAAA GCGCAGACCT
GCGTGGGATA GACTTCAGAT CCTGTTGAGA CACGCCCATG GCCTTCCTGA AGCGGAAGAT
CTACAGAGTC AGGCCCATGC TGTGCGCGAC GAACGGCGCC TGTTAGCAGA GCCCGATCCG
GTTCCGGACA TTTACCAGGC AGTCGCCAGG GTGCTTCGTA CCGCTGTCAG GCAGGCTTAC
GCCAACTTTG AAATGGTATA TAACCGGGAG AAAGCGGCAC TGGAGGCAAA TGCGAACTGG
CAGAAACTAT CTCCGGAACA GCAGCAAAAA ATTCTTACGG CTGAAGGGAT TGCCAGCGTT
CCCCGGCTTT CCATAGAAGA TGACGACGCC TTGCTTCAAA GCCTCCAGGA AACCCCTCTG
TCAGGCTGGA AAACCAGGAC AGATGCCCTG CCCCAGCAGT TCAGTAATGC GGCCATGGCA
GCTGCGAAGC TACTGGAGCC AAAAACGCAG AAGGTAAAAC TGACCAGCAG CACCTTGAGA
ACGAAAGACG AAGTCAGGGC CTGGGTAGCC GGAGTTGAGA GAGAGCTTTT AGAAAGGATT
AAAAACGGCC CCGTCGTGAT CTCGTAG
 
Protein sequence
MKNRDIFQRD PVVSKLLNDG VATVTEAATS KEIETLRYEL EHFVCEGQYK DGLIRILESY 
LSNADATSQP AVWVSGFFGS GKSHLLKMLR HLWVNTKFES DGATARELAR LPVEVKDLLK
ELDILGKRCG GLHAAGGTLP SGGRKSLRLA VLSIIFRSKG LPESLPQAQF CLWLQKNGIY
TRVKKVVEDS GKVFQQELRH LYASPVLARA LLEADPDFAS DLKQVRATIQ AQFPDVDDVS
TSEFIQIICD VLSVNRQLPC TAIVLDEVQL FIGDSTARSY EVQEVAEALC KQLDSRILLI
GAGQTALSGN LPLLQRLKDR FTIPVELSDT DVETVTRRVV LAKRADKRKA IEEILTAYAG
EIDRQLAGTR IAPRSEDRAI IVEDYPLLPV RRRFWEHVLR AVDIPGTSSQ LRTQLRIVHD
AVREIAEKPL GTVVPADFIF DQLQPDLLRT GVLLREIDET IRNLDDGTPE GLLARRICAL
VFLIRKLPRE AVVDIGIRAT AETLADLLVS DLAGDGPALR REIPRVLEKL VKEGKLIKVD
EEYSLQTRES SEWDREFRNR QTRLNNDLAA LASKRSALLN SACMTALGNI KLIQGKDKVP
RKLAIHFGSE PPETKGHEIP VWVRDGWGEN ENTVVADARA AGNDSPIIFV YIPKANAEDL
QRTIIEYEAA KATLEFKGTP TNPEGQEARD AMSTRMKTAE ATRDEIIKEV INAARVFQGG
GQERFEHSMK EKVEAAAEAS LDRLFPHFRD GDDDQWPAVI NRAKSGDEAA LQAIGWNDAP
EKHPVCAAIL AEIGSGKTGK EIRDIFMKTP YGWGQDAVDA ALITLFATGH LRAVYKGVQL
DRGQLDQAKI PATDFRVETV TIDVHSRMKL RKLFQTAGFN CKAGEESSVA GQFLAKLMDL
ADRAGGDPPM PERPSKTHLE TMRGLAGNEQ LAAILEQFDT LAQQLQNWSA LADLAAKRRP
AWDRLQILLR HAHGLPEAED LQSQAHAVRD ERRLLAEPDP VPDIYQAVAR VLRTAVRQAY
ANFEMVYNRE KAALEANANW QKLSPEQQQK ILTAEGIASV PRLSIEDDDA LLQSLQETPL
SGWKTRTDAL PQQFSNAAMA AAKLLEPKTQ KVKLTSSTLR TKDEVRAWVA GVERELLERI
KNGPVVIS