Gene Moth_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0104 
Symbol 
ID3831994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp101911 
End bp103278 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content62% 
IMG OID637828038 
ProducttRNA(Ile)-lysidine synthetase-like 
Protein accessionYP_428986 
Protein GI83588977 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0037] Predicted ATPase of the PP-loop superfamily implicated in cell cycle control 
TIGRFAM ID[TIGR02432] tRNA(Ile)-lysidine synthetase, N-terminal domain
[TIGR02433] tRNA(Ile)-lysidine synthetase, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.192858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGACA GGGTCCGGCA GACAATTCAG GACTACCATC TGCTAGTACC GGGGGACAAA 
GTAGTTGTCG GGGTGTCCGG GGGGCCGGAT TCCCTGGCTT TATTGCACAG CCTCATGACC
CTGCAGGAAG AGTATGGCTA TACCCTCCAG GTAGCCCATC TAAACCACGG GCTGCGGCCG
GAAGCAGCGG CTGATGCTGA GTATGTTCGC GATTTGGCCA CAGGCTGGGG CCTGCCGGTT
ACCGTCGCCC AACGCGACGT CTTAGCCTAC CGGCAGGAAC ACCACCTATC CATCGAGGCC
GCCGCCAGGG AGGTACGCTA TAATTTCTTC CAGGAAGTGG CGGCAGCAGT CGGGGCGACC
AGGATAGCCG TCGGCCACCA GGCGGAAGAC CAGGCGGAAA CCGTCCTCCT GAACCTCCTG
CGCGGCAGCG GCTTAACCGG GCTAAAAGCC ATGTTGCCCC GACGAGGGCG GCTTATCAGG
CCTTTGCTTT TTGTTACCCG GGCCGAGATT GAGGCCTACT GCCGGGATAA CGGCCTCCAT
CCGCGCCGGG ATTTTACCAA CGAGGACCCG GCCTACCGGC GCAATAAAAT CCGGCACCAG
CTACTCCCCC TCCTGGCCCG GGAGTATAAC CCGGCCATAG TTGCCACCCT GGGGCGGACG
GCCCTGATCC TCCAGGAAGA CGAAGCCCTC CTGGCGGATC TGGCCCACAG AGCCCTGGAA
GGAATTATTA AAAGGCGAGA AGGGGAAACC CTGGTCCTGG ACCGCCAGGG GTGGCAGGAC
CTGGCCCCGG CCCTCCAGCG CCGGGTTCTG AGGCTGGCGG CAGCTACCCT GGGCCGGCGC
GTAAGTTTTA ACCAGGTGGA AAAGGCCCGG GCAGTGGCCC GGGAAGGGGG CACCCTGACC
TGGCCGGGCC GGTTAAGTAT CCGGGCCCGG GGTGCAGAGC TGCACCTCCA GTTACCCGGC
AAGTCAGCTG GTAAAGTCTC CTTCTCTTAT CAACTGCAAG TTCCCGGCCT GACCCCCCTG
CCGGAGGTAG GCAAGGCCAT CAGGGCGGAG ATCGCTCCAC CACCTCGGGC CTTTAAACCC
GGGAGGGAGG ATGAGGCCTG GCTGGACCGC GCCAAATTAA AACAACCCCT CCTGGTACGT
AACTGGCTGC CAGGGGATTG TTTCCGGCCC CTGGGAATGA AGGGGACGAA AAAGCTACAG
GACTATTTTA TCGACAGGCA CCTCCCGGCT GCCCGACGCC CCCTGATACC CCTGGTTATC
AGTGAAGGCC GCATTGCCTG GGTTGCAGGA CTTGGCCTGG CCGAAGACTT CAAAGTCACA
CCCGCAACCC GGGAGACCCT GCATCTGAAA TTGGAGCCAT GGCCGTAA
 
Protein sequence
MLDRVRQTIQ DYHLLVPGDK VVVGVSGGPD SLALLHSLMT LQEEYGYTLQ VAHLNHGLRP 
EAAADAEYVR DLATGWGLPV TVAQRDVLAY RQEHHLSIEA AAREVRYNFF QEVAAAVGAT
RIAVGHQAED QAETVLLNLL RGSGLTGLKA MLPRRGRLIR PLLFVTRAEI EAYCRDNGLH
PRRDFTNEDP AYRRNKIRHQ LLPLLAREYN PAIVATLGRT ALILQEDEAL LADLAHRALE
GIIKRREGET LVLDRQGWQD LAPALQRRVL RLAAATLGRR VSFNQVEKAR AVAREGGTLT
WPGRLSIRAR GAELHLQLPG KSAGKVSFSY QLQVPGLTPL PEVGKAIRAE IAPPPRAFKP
GREDEAWLDR AKLKQPLLVR NWLPGDCFRP LGMKGTKKLQ DYFIDRHLPA ARRPLIPLVI
SEGRIAWVAG LGLAEDFKVT PATRETLHLK LEPWP