Gene Moth_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2237 
Symbol 
ID3831283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2334040 
End bp2336394 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content57% 
IMG OID637830157 
Producthypothetical protein 
Protein accessionYP_431067 
Protein GI83591058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00266274 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAGCA GGACGAGTGA AATGTTGGCC AATACTCTGC TGGAGGCTGT GCGCGATTCA 
TTGCTTCGAG CCGGCCGGTA TGACTCTGCG ACCATTGTCC CGCCGGCAGC GATTCTCTGG
ACCGATGCCG ACGGCCAGTG GCAGCCCCTG GTATCCCAGT TGCGCCCGCT TATGCCGGAG
CTTTTAACCC TGGGAGATTA CAACCCGGAA GAAAAAACCG GGCCGGCAAT CTGGCTGCGG
TGTGTTATTG AACGAATGCT ACCTGATGTT GAACTACCGG ATAAAGCCAT TCCTATCATT
TACCTGCCCA ACGTTAGCCG GCAGGTGCTC CGGGCCGGTG AGGAGTGCCC GGAAAGCCTC
AAACCGCTGG TGGAACTGCA GTACCGCGGG ACGGTTTGGA CCCAGCGCAA CGGTAAAGAC
TGGACAGTTG AGGCGTTTCT TGTCTCTGAA GATGGCCTGG GCCTGGATGT GGCTAAAGAT
AAACAGACCC GCCAGGCGAT GCTTCGGGCA CTACCCCAAC TGGCCACCGC TCCTGTCGCA
CGCCTGCGGG GCAAACGCCT GGAGGCCGAG GATTTTGACA GGTTGATGGT GGCAGATACC
CAGCGGGACT TGCTGTCATG GCTCAGCGAC CCGGCTCGTA CCCGCGAAAA ATGGGGTGAA
GAGAAGTGGA TGGCCTTTTG CTCCCGCTGT AAGGTTGAAT ATGAAATCGA CCCGGATAGG
GATGGCGAAA TTGCTGTCGC CGAGAAAATG GGATTGCAGG ATAATGAAGC CTGGCAGGGA
TTGTGGCGCA GGTATGCAGA AGCCCCGGCC CTTTACCCCG GCATCCCCGC AATTTTGCGG
CGCGCCAAAC CATCAAGATT GTTCGTTAAC CGGGAGCCCT GGCCGGATGA AAACGAAGCT
GAAGAGGAGG CCTTAAGGCA AAGCCTTTTG GAGCTGGAAA AATTATCTTC CCCTGATGCT
AGGCAAAAAA TCGAAGAACT CGAGAAGGAA CACGGCGAGC GGAGGGAGTG GATCTGGGCC
CAGCTGGGAC AGAGCCCCCT GGCCGGGGCT TTAAAGCACC TGGTGACCCT GGCGCGAAAG
ACTGCCCGAG GCCTGGGAGG TGACATGCCC CAGGCAATGG CCGAACTATA CATCGAGGGC
GGGTATCTGG CCGATGATGC CGTCTTACAG GCAACTGGCA GCGTAAAATC ATTAGAGGAT
GCGCAGGCAA TACAGGCCGC TGTCCGCAGT ATCTACCTTC CCTGGCTGGA GGATGTAGCC
CGGCATTTCC AGGATTTGAT TAAAACCTTC CCTTTGCCGA ACGCTGATGA TAAAGACCGT
ACTTTAATTG CCGCCAATCC CGGGCAGTGT TTGCTCTTTA TAGACGGGCT CCGCTTTGAC
ATTGCCCGGC GACTGGTTGC CATGGCTGAA GCAAGACAGC TCCGGGTAAA TATAAATTGG
CGCTGGGCCG GACTGCCCAC GGTAACGGCA ACGGCAAAAC CGGCAGCCTC CCCCATAGCA
GGAAAGCTTT CCGGCCATTT ACCCGGCGAA TACTTTATTC CAGAGATTGC CGGGGCTAAT
CTCCCCCTGA CTCCCGACCG GTTTCGCAAG CTCTTAGCGG AAGCAGGCTA TCAGGTGTTC
AATTCTCCGG AGACGGGGCA CCCGGGTGAA CCTGGAGCCC GGGGGTGGAC GGAATTTGGT
GAATTTGACC GATTGGGACA TACGCTGCAA AGCAGGCTTG CTGCCCGCAT CGATGAACAG
CTTGAGCTTG TCCTGGATCG AATCCAGGGT TTGCTGGAGG CCGGCTGGCA GCAGGTGCGC
GTAGTAACAG ACCATGGGTG GCTTTTAGTC CCGGGCGGGC TGCCGGCCAT GAAGCTGCCC
AAATACCTCA CCGAAAGCCG CTGGACACGG TGTGCCGCCA TCCGGCCGGG TGCCCATGTT
GATGTGCCGA CTGCCGGATG GTACTGGAAT GCATACCAGC ACATCGCTTT TGCTCCGGGG
GTATATTGTT TCATAAACGG CAACGAATAT GCCCATGGCG GCGTCAGCCT TCAGGAATGC
CTGCTTCCTG ACTTGACTTT TAATTCCAGC GGGCTAACCC CAGTTACGGT TAGCATAAGG
GAAATTCAAT GGTATGGGAT GCGCTGTCGC GTTGCAGTTG ATACGAGCAG CAGCGAAGTC
ATGGCCGACC TGCGGACCAA ACCCAATGAT CCCCATTCCA GTATTACCAC GCCCAAACCG
ATTGATTCCG GTGGACGTGT TGGCCTTCTT GTCGCGGACG ATGCCCTTGA AGGCACTACG
GTCAGCCTCG TCCTGCTCGA CCCGTCGGGA CGGGTACTGG CAAAGCAGGC GACTACCGTT
GGAGGTGATG AGTAG
 
Protein sequence
MSSRTSEMLA NTLLEAVRDS LLRAGRYDSA TIVPPAAILW TDADGQWQPL VSQLRPLMPE 
LLTLGDYNPE EKTGPAIWLR CVIERMLPDV ELPDKAIPII YLPNVSRQVL RAGEECPESL
KPLVELQYRG TVWTQRNGKD WTVEAFLVSE DGLGLDVAKD KQTRQAMLRA LPQLATAPVA
RLRGKRLEAE DFDRLMVADT QRDLLSWLSD PARTREKWGE EKWMAFCSRC KVEYEIDPDR
DGEIAVAEKM GLQDNEAWQG LWRRYAEAPA LYPGIPAILR RAKPSRLFVN REPWPDENEA
EEEALRQSLL ELEKLSSPDA RQKIEELEKE HGERREWIWA QLGQSPLAGA LKHLVTLARK
TARGLGGDMP QAMAELYIEG GYLADDAVLQ ATGSVKSLED AQAIQAAVRS IYLPWLEDVA
RHFQDLIKTF PLPNADDKDR TLIAANPGQC LLFIDGLRFD IARRLVAMAE ARQLRVNINW
RWAGLPTVTA TAKPAASPIA GKLSGHLPGE YFIPEIAGAN LPLTPDRFRK LLAEAGYQVF
NSPETGHPGE PGARGWTEFG EFDRLGHTLQ SRLAARIDEQ LELVLDRIQG LLEAGWQQVR
VVTDHGWLLV PGGLPAMKLP KYLTESRWTR CAAIRPGAHV DVPTAGWYWN AYQHIAFAPG
VYCFINGNEY AHGGVSLQEC LLPDLTFNSS GLTPVTVSIR EIQWYGMRCR VAVDTSSSEV
MADLRTKPND PHSSITTPKP IDSGGRVGLL VADDALEGTT VSLVLLDPSG RVLAKQATTV
GGDE