Gene Moth_2153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2153 
Symbol 
ID3833002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2255942 
End bp2256940 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID637830075 
Producthypothetical protein 
Protein accessionYP_430985 
Protein GI83590976 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.101261 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGAAA ACGCAGGGCA GCTCCGGCCT CCCCACCACC CCCACGACAA GGGTTACAGG 
CAGCTTCTTT CCGACAAGAG AGTTTTCCTG GAATTGCTGA AAACCTTCGT CCGGGAAACC
TGGGTAGAGG CTATCGACGA AAAAGATCTC ATCCTGGTGA ACAAATCCTA CGTCCTCCAG
GATTTCAGCG AGAAAGAAGC CGATATCGTT TACCGGCTTA AGACGAAAGA TAGAAACGTC
ATCTTTTACG TCCTGCTGGA GCTGCAGTCA ACGGTAGACT ACCTGATACC CTTCCGGCTG
CTGCTCTATA TGGTCGAGAT CTGGCGGGAA ATCTACACCA ACACCCCGCA GCACGAGCGG
GAGAGCAAGC ATTTCCGCCT GCCGCCCATC ATCCCGGCGG TGCTCTACAA CGGGGCCGGA
TCCTGGACGG CGGCGCTCTC CTTCAAAGAA ATGCTGGACA GTTACCAGGA TTTCAGCGGG
CATCTCCTGG ACTTTCATTA CCTGCTGTTT GATGTCAACC GTTACAGCGA AGAAGAGCTG
ATCAGAGCGG CAAACCTGAT CGCCGGTGTC TTTCTCCTGG ACCAAAAGAT GCGGCCGGAA
GAGCTGGTTG GACGGCTGCA GAAACTGGCG GGGGTCTTAA GGCGGCTCAC GCCTGACGAG
TTCCGCCATA TCACCAGCTG GCTGAAGAAC GTCGTCAAGC CCAGAATGCC TGAAGGTTTT
AAGGAAAAGG TTGACCGCAT CCTGGACGCA AGCAATCCCT GGGAGGTGGA ACGGATGATT
TACAATCTGG AATTAACCCT GGAAGAGATG CAACAGCAGG CTTTGTTAAA AGGCTTAAAA
GAAGGCGAAC AGAAGGGGAA ATTGGAAGGG AAATTGGAGG GCAAACGAGA AGTAGCCCGG
AACCTGCTAC TGCTCAACGT CGATATAGAG ACCATTGTTA AAGCTACGGG GCTTACTCCG
GACGAGATCG CCGTGTTAAA GAAACAGCTG GAACAGTGA
 
Protein sequence
MPENAGQLRP PHHPHDKGYR QLLSDKRVFL ELLKTFVRET WVEAIDEKDL ILVNKSYVLQ 
DFSEKEADIV YRLKTKDRNV IFYVLLELQS TVDYLIPFRL LLYMVEIWRE IYTNTPQHER
ESKHFRLPPI IPAVLYNGAG SWTAALSFKE MLDSYQDFSG HLLDFHYLLF DVNRYSEEEL
IRAANLIAGV FLLDQKMRPE ELVGRLQKLA GVLRRLTPDE FRHITSWLKN VVKPRMPEGF
KEKVDRILDA SNPWEVERMI YNLELTLEEM QQQALLKGLK EGEQKGKLEG KLEGKREVAR
NLLLLNVDIE TIVKATGLTP DEIAVLKKQL EQ