Gene Moth_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2145 
Symbol 
ID3833145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2245416 
End bp2247305 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content51% 
IMG OID637830068 
Producthypothetical protein 
Protein accessionYP_430978 
Protein GI83590969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0965405 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTGGG TTTGCAGATA CAGGTTATTT ATAGTGCCTT TATTGGTTTT CTTCCTTGTT 
TCGGGTTTTG GCCTTGCGTG GGGCGCGGAT GAAACTAGTA CAATGCAGAG TCCGGAAGTT
GAATGGGAAA AGACCCTCGG AAAAGGGATA GGTTATTCCG TCCAGCAGAC ATCGGATGGT
GGCTACATTA TTGTAGGTTC CACACAATTT CGCGGCGCTG GCGATGTTTA TCTAATCAAG
ACTGATGCCA ATGGCAATAA GCTTTGGGAA AAGACTTTTG GTGGAAGCGG TTCGGATGAA
GGTTATTCTG TCCAGCAGAC GACCGATGGC GGCTACATTA TTGCAGGTTC CACGCATTCT
TACGGTGGCG GTGACGATGA CGTATACTTG ATCAAGACTG ATGCCAACGG TAACAAGCTC
TGGGAAAAGG TTTTTAAGGG AGAAGAGCTA ATTGAGGTCA AAGGCGGAAT AGCCCGGATA
AAGACCTTAA AGGGAGAGCT GATTAAAGAG ATCGATATCA CCAAGGATTG GGAAAAGTAT
TGGCCAGAGA CGTTAGGGGC AGAGCTGATT AACGAGGATA CTACCAACAA TTACTGGCTA
TGGAAAAAGA CTTTAGGAGG AAAAGGGCGT TCCGTCCAGC AGACGGCCGA TGGGGGTTAC
ATTATTGCGG GTTACACAAA CACTTACAAC GTTTATCTGA TCAAGACTGA TACTAATGGC
GACACGCTTT GGGAAAGGAT CTTTGGGAGT AATTATACTG AAGTCTATTC CGTCCAGCAG
ACGACCGACG GTGGCTACAT TATTGCAGGT TACATAGACC CTGGTAGTGT CGGGAAGGGT
AACGTTTACC TGATCAAGAC CGACGCTAAA GGCAACATGG TCTGGGAGAA GACTTTCGGG
GGAAGTAATT GGGATAAAGG CTATTCCGTC CGGCAGACGA CCGACGGTGG CTATATTATT
GCAGGTTTCA CGCGCTCTTA CGGTGTCGGT AACGATGACG TATACTTGAT CAAGACTGAT
GCCAACGGTA ACAAGCTCTG GGAAAAGAAC CTTGGGGGAA ATTATTGGGA GGGAGGCTAT
TCCGTCCAGC AGACGACCGA CGGCGGCTAC ATTGTTGCAG GTGTAGGCGA TTATTCTCAG
ATCAAGACCG ATGGCGACGG TAACTTGCTC TGGAAAAAGA CCTTAAGAGG GGAAGGACGT
TCCGTCCAGC AGACGACCGA CGGTGGTTAC ATTATTGCGG GTTACACATT CTCTCGCAGT
ACCGATAGTG ATGTTTACTT GATCAAACTT AAACCCGAAA CTCCCCCCGC AAACCAGCCT
CCAGTGGTAA GTTTAAAGGA TATGCAGGGC CACTGGGCGG CCGACGCGGT GGACAGGCTG
GTTGAGACGG GGGTTGTCTC CGGTTACCCA GACGGGACTT TCAGGCCCGA CCTGGAAGTG
ACCCGTGCCG AAATTGCGGC TATCTTGGTG CGCGCCCTAA AGCTCACACC AACCAACAAT
CAGGAGCTAA AGTTCAAGGA TGATGCAACC ATCCCGACCT GGGCCAAGGA CGCGGTAAGT
ATAGCGGTTA AGGAAGGCCT GGTTAAGGGC TACCTTCAGC CGGATGGGAC AATGACCTTC
GAAGCCGACC GCCCCGTCAC ACGAGCAGAA ATGGCTGTAT TAGTGGCGCG CGTCCTCCGG
AAAAAACTCG GGGAGGTCAC CCCGATGGAG CTTAAATTCA CCGACGCTGT CATGATCCCG
GCTTGGGCCA AATCGGACGT CGGCGTTGCT GTGGCGGAAG GCATCGTTGT CGGGTATCCC
GACAATACCT TCCGTGCAGA GAACCATGTC ACCCGTGCGG AGGCTGCGGT AATGATCCTG
CGGCTCCTAA GGGTGCTTGG CAGAATATAA
 
Protein sequence
MFWVCRYRLF IVPLLVFFLV SGFGLAWGAD ETSTMQSPEV EWEKTLGKGI GYSVQQTSDG 
GYIIVGSTQF RGAGDVYLIK TDANGNKLWE KTFGGSGSDE GYSVQQTTDG GYIIAGSTHS
YGGGDDDVYL IKTDANGNKL WEKVFKGEEL IEVKGGIARI KTLKGELIKE IDITKDWEKY
WPETLGAELI NEDTTNNYWL WKKTLGGKGR SVQQTADGGY IIAGYTNTYN VYLIKTDTNG
DTLWERIFGS NYTEVYSVQQ TTDGGYIIAG YIDPGSVGKG NVYLIKTDAK GNMVWEKTFG
GSNWDKGYSV RQTTDGGYII AGFTRSYGVG NDDVYLIKTD ANGNKLWEKN LGGNYWEGGY
SVQQTTDGGY IVAGVGDYSQ IKTDGDGNLL WKKTLRGEGR SVQQTTDGGY IIAGYTFSRS
TDSDVYLIKL KPETPPANQP PVVSLKDMQG HWAADAVDRL VETGVVSGYP DGTFRPDLEV
TRAEIAAILV RALKLTPTNN QELKFKDDAT IPTWAKDAVS IAVKEGLVKG YLQPDGTMTF
EADRPVTRAE MAVLVARVLR KKLGEVTPME LKFTDAVMIP AWAKSDVGVA VAEGIVVGYP
DNTFRAENHV TRAEAAVMIL RLLRVLGRI