Gene Moth_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0409 
Symbol 
ID3832091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp413769 
End bp414779 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID637828346 
Producthypothetical protein 
Protein accessionYP_429286 
Protein GI83589277 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.650641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00708611 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCCGGAAA ACGCAGGGCA GCTCCGGCCT CCCTACCACC CCCACGACAA GGGTTACAGG 
CAGCTTCTTT CCGACAAGAG AGTTTTCCTG GAATTGCTGA AAACCTTCGT CCGGGAAACC
TGGGTAGAGG CTATCGACGA AAAAGATCTC ATCCTGGTGA ACAAATCCTA CGTCCTCCAG
GATTTCAGCG AGAAAGAAGC CGATATCGTT TACCGGCTTA AGACGAAAGA TAGAAACGTC
ATCTTTTACG TCCTGCTGGA ACTGCAGTCA ACGGTAGACT ACCTGATACC CTTCCGGCTG
CTGCTCTATA TGGTCGAGAT CTGGCGGGAA ATCTACACCA ACACCCCGCA GCACGAGCGG
GAGAGCAAGC ATTTCCGCCT GCCGCCCATC ATCCCGGCGG TGCTCTACAA CGGGGCCGGA
TCCTGGACGG CGGCGCTCTC CTTCAAAGAA ATGCTGGACA GTTACCAGGA TTTCAGCGGG
CATCTCCTGG ACTTTCATTA CCTGCTGTTT GATGTCAACC GTTACAGCGA AGAAGAGCTG
ATCAGAGCGG CAAACCTGAT CGCCGGTGTC TTTCTCCTGG ACCAAAAGAT GCGGCCGGAA
GAGCTGGTTG GACGGCTGCA GAAACTGGCG GGGGTCTTAA GGCGGCTCAC GCCTGACGAG
TTCCGCCATA TCACCAGCTG GCTGAAGAAC GTCGTCAAGC CCAGAATGCC CGAAGATTTT
AGGAAAAAGG TTGACCGCAT CCTGGACGCA AGCAATCCGT GGGAGGTGGA ACGGATGATT
TATAACCTGG AATTAACCCT GGAAGAGATG CAACAGCAGG CTTTGTTAAA AGGCTTAAAA
GAAGGCGAAC AGAAGGGGAA ATTGGAAGGG AAATTGGAAG GAAAATTGGA GGGCAAACGA
GAAGTAGCTC GAAACCTGCT ACTGCTCAAC GTCGATATAG AGACCATTGT TAAAGCTACG
GGGCTTACTC CGGACGAGAT CGCCGTGTTG AAGAAACAGC TGGAACAGTG A
 
Protein sequence
MPENAGQLRP PYHPHDKGYR QLLSDKRVFL ELLKTFVRET WVEAIDEKDL ILVNKSYVLQ 
DFSEKEADIV YRLKTKDRNV IFYVLLELQS TVDYLIPFRL LLYMVEIWRE IYTNTPQHER
ESKHFRLPPI IPAVLYNGAG SWTAALSFKE MLDSYQDFSG HLLDFHYLLF DVNRYSEEEL
IRAANLIAGV FLLDQKMRPE ELVGRLQKLA GVLRRLTPDE FRHITSWLKN VVKPRMPEDF
RKKVDRILDA SNPWEVERMI YNLELTLEEM QQQALLKGLK EGEQKGKLEG KLEGKLEGKR
EVARNLLLLN VDIETIVKAT GLTPDEIAVL KKQLEQ