Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0421 |
Symbol | |
ID | 3832104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 423987 |
End bp | 425219 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637828356 |
Product | ROK |
Protein accession | YP_429295 |
Protein GI | 83589286 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0799462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATG AGGTAGGGAA CCTACAGCTA GTAAAAAAGC TAAACAGGAT TAGCATCCTC AATATTATCA GAAAGCATAA TACCATTTCT CGACAACAAT TAGCCAAGTT AACTGGCCTT ACGCCGGCGG CGATAACAGG GATTGTACGC GACCTAGTAG CATCAGGTTA CGTTATTGAA AAAGGCCTGG GAAAATCTAA CGGTGGAAGG CGCCCTGTTA AGTTGCAATT TAACCCGGAT TCCGGATATG TACTTGGGGC AGAAATAACA AGAAACAGTA CCACACTGGG CATGGTTAAC CTCGATGCAA AACCATTAAT ACTTAAACAA TACAATATTG ACATGACCGA TCCTCAACAA GGTCTAAGTC GACTAGCCGA CGAGGTTACA AAAATGATAA TTGAAAGTGG CATTGAAAGC AAAAATATCT TGGGCATGGG AGTAGCCTAT CCGGGATTGG TTGACATTAG TACAAGGGTA GTGAAACGTT CGCCCAATTT GGGTAAAAAA TGGCGGGATA TCCCGATTGA AAATTGGTTA CAGGAAATGA CTGGAATTAA GATTTTTGTC GAAAATAACT CTAATGCAGC GGCGATGGCT GAATATTGTT TTGGCCGGGG TAAGGAAACT AAAAATATGG CTTATATTAA CTTGGGTGAA GGCATTAGTG CGGGTATTAT TCTGAATGGT ATGTTGGCAT ATGGATTCCG GGGCTATACT GGTGAAATCG GCCATCTCGT GATCGACGAA GACGGACCTC TTTGTAATTG TGGTAATAAT GGTTGCCTTG AGAGTTTGTG TGCAGTCCCG GCATTGGTTC GCAAGGCCAA TAATGAACTC TCGTTATATA ACCAAAAAGA TCCCTTGAAA GCAATTTGGC TAGAGAAAGG CGAAGTAAAA ATAGAAGATA TTATGGCAAA TGCTAATAAT GTAGGATCAT ATGCTCAAAA GTTGATTAGG CAGGCTGGCT GGTATATTGG TAAAGCTATT GCCGCTATTA TCAATGTCTT TAATCCGGAA GCCATCTTTA TCGGGGGCAT TTTGGCCGAA GCAGGTAATA GTTTATTAGA CCCCCTGATT GAAAGCGTGC AAAAACATGC TTTCCCTGAG CTGGTGCGGG AGGTGAGAAT TGAACTTTCG TCCATGAGGA AGGATACTGG TTTTTATGGT GCATGCGCTA TAGCGATTCG AGCACTGTTT GAGGGAGGGG TCGATGCACT GCTGGGTGTT TAA
|
Protein sequence | MSNEVGNLQL VKKLNRISIL NIIRKHNTIS RQQLAKLTGL TPAAITGIVR DLVASGYVIE KGLGKSNGGR RPVKLQFNPD SGYVLGAEIT RNSTTLGMVN LDAKPLILKQ YNIDMTDPQQ GLSRLADEVT KMIIESGIES KNILGMGVAY PGLVDISTRV VKRSPNLGKK WRDIPIENWL QEMTGIKIFV ENNSNAAAMA EYCFGRGKET KNMAYINLGE GISAGIILNG MLAYGFRGYT GEIGHLVIDE DGPLCNCGNN GCLESLCAVP ALVRKANNEL SLYNQKDPLK AIWLEKGEVK IEDIMANANN VGSYAQKLIR QAGWYIGKAI AAIINVFNPE AIFIGGILAE AGNSLLDPLI ESVQKHAFPE LVREVRIELS SMRKDTGFYG ACAIAIRALF EGGVDALLGV
|
| |