Gene Moth_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0421 
Symbol 
ID3832104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp423987 
End bp425219 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content43% 
IMG OID637828356 
ProductROK 
Protein accessionYP_429295 
Protein GI83589286 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0799462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATG AGGTAGGGAA CCTACAGCTA GTAAAAAAGC TAAACAGGAT TAGCATCCTC 
AATATTATCA GAAAGCATAA TACCATTTCT CGACAACAAT TAGCCAAGTT AACTGGCCTT
ACGCCGGCGG CGATAACAGG GATTGTACGC GACCTAGTAG CATCAGGTTA CGTTATTGAA
AAAGGCCTGG GAAAATCTAA CGGTGGAAGG CGCCCTGTTA AGTTGCAATT TAACCCGGAT
TCCGGATATG TACTTGGGGC AGAAATAACA AGAAACAGTA CCACACTGGG CATGGTTAAC
CTCGATGCAA AACCATTAAT ACTTAAACAA TACAATATTG ACATGACCGA TCCTCAACAA
GGTCTAAGTC GACTAGCCGA CGAGGTTACA AAAATGATAA TTGAAAGTGG CATTGAAAGC
AAAAATATCT TGGGCATGGG AGTAGCCTAT CCGGGATTGG TTGACATTAG TACAAGGGTA
GTGAAACGTT CGCCCAATTT GGGTAAAAAA TGGCGGGATA TCCCGATTGA AAATTGGTTA
CAGGAAATGA CTGGAATTAA GATTTTTGTC GAAAATAACT CTAATGCAGC GGCGATGGCT
GAATATTGTT TTGGCCGGGG TAAGGAAACT AAAAATATGG CTTATATTAA CTTGGGTGAA
GGCATTAGTG CGGGTATTAT TCTGAATGGT ATGTTGGCAT ATGGATTCCG GGGCTATACT
GGTGAAATCG GCCATCTCGT GATCGACGAA GACGGACCTC TTTGTAATTG TGGTAATAAT
GGTTGCCTTG AGAGTTTGTG TGCAGTCCCG GCATTGGTTC GCAAGGCCAA TAATGAACTC
TCGTTATATA ACCAAAAAGA TCCCTTGAAA GCAATTTGGC TAGAGAAAGG CGAAGTAAAA
ATAGAAGATA TTATGGCAAA TGCTAATAAT GTAGGATCAT ATGCTCAAAA GTTGATTAGG
CAGGCTGGCT GGTATATTGG TAAAGCTATT GCCGCTATTA TCAATGTCTT TAATCCGGAA
GCCATCTTTA TCGGGGGCAT TTTGGCCGAA GCAGGTAATA GTTTATTAGA CCCCCTGATT
GAAAGCGTGC AAAAACATGC TTTCCCTGAG CTGGTGCGGG AGGTGAGAAT TGAACTTTCG
TCCATGAGGA AGGATACTGG TTTTTATGGT GCATGCGCTA TAGCGATTCG AGCACTGTTT
GAGGGAGGGG TCGATGCACT GCTGGGTGTT TAA
 
Protein sequence
MSNEVGNLQL VKKLNRISIL NIIRKHNTIS RQQLAKLTGL TPAAITGIVR DLVASGYVIE 
KGLGKSNGGR RPVKLQFNPD SGYVLGAEIT RNSTTLGMVN LDAKPLILKQ YNIDMTDPQQ
GLSRLADEVT KMIIESGIES KNILGMGVAY PGLVDISTRV VKRSPNLGKK WRDIPIENWL
QEMTGIKIFV ENNSNAAAMA EYCFGRGKET KNMAYINLGE GISAGIILNG MLAYGFRGYT
GEIGHLVIDE DGPLCNCGNN GCLESLCAVP ALVRKANNEL SLYNQKDPLK AIWLEKGEVK
IEDIMANANN VGSYAQKLIR QAGWYIGKAI AAIINVFNPE AIFIGGILAE AGNSLLDPLI
ESVQKHAFPE LVREVRIELS SMRKDTGFYG ACAIAIRALF EGGVDALLGV