Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1139 |
Symbol | |
ID | 3833237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1167806 |
End bp | 1170535 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637829069 |
Product | hypothetical protein |
Protein accession | YP_429996 |
Protein GI | 83589987 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.827602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAA ATCGTTTATG GTTCTGTCTT TTAATTATAA TCCCGGGCTT CCTGGTCGCA GCCTACCTGG GCTCCCACTT CCTGACCGAC TGGTACTGGT TTGCCGAAGT GGGCTACCGC CAGGTCTTCC TTACCCGGTT GCTATCGGAA GTGGGAATAC GCCTGGGGAC TATAGCCTTT TTCTTCCTTT TCTTTTATCT AAACCTCCTT TTTACCCGTA AAAGCCTCCA CCTATCGCCT CCTGAAGGGC GGGAAAATTG GACTTTAAAG GAATACCTCA TCGACCGTTT TATTACCAGC AGGCGTTTGG GTATCCTGTA TCTTTTACTA AGCCTGGCGG GGGCCCTCAT CTTCAGCCCC CTGGCCGCCG GTAAGTGGCT GGTGGTCCAG GAGTACCTCA GGGCCACCCC CTTCGGCCTT GCCGATCCCC TTTTTGGCAG GGATGTTAGT TTCTATATCT TTAAACTGCC CCTTTACCAT TTTCTCTATA AATTGCTGAT AACGGCTGTG GTTGGGGCCG TCCTGGTCAC CGGCTTCTTC TACTTCATCT TTAACCCGCG GGAGCTCCTG GGCCTGCGGC GGGGCCATTT TTCCCGTCCT CTGGTCCATT TTTCCACCCT GGTAGCCCTA CTCTTTCTAA TTCAAGCCTG GGGGTTCCGC TTGCAGGCCC TTGATCTGGT TCGATCATCC CGGGGTGTGG CCTTCGGCGC CAGCTATACC GATATCCACG CCCTCCTTCC TGGCTATAAC ATCCTGGGAT GGGTAGCCGT AGCCTGCGGC CTGATTATTG TCCTTAACGC CTTTCGCCGT AACCTAAAGC TAGTCAGTGC CGGTATTCTA TCCTTTATGG CCGCCTATTT CCTGCTGGTG ATAGCTGTAC CCCTGGCAGT GCAAAAATTC CAGGTCGAGC CCAACGAGTT CGCCCGGGAG GAGCCCTACC TGCGCTATAA TATTAACTTT ACCCGCCGGG CCTATGGACT GGACAGGATC ACCATCCAGG AATTCCCGGC TCTGGACAAC TTGACCCCCG CCAGTCTGAG GGAAGAAGGG GCCACCCTGG ACAACATCCG CCTGTGGGAT TACCGACCCC TGGAGCAAAC CTACAGCCAG CTCCAGGAGA TCCGTTCTTA TTATAGTTTT AAGGATATCG ATGTCGACCG CTACACCCTG GATGGCAGGG AGCGGCAGGT CATGCTGGCG GCCCGGGAAC TGGACCAGAA CAAATTGCCT GACCGGGCCC GGACGTGGAT CAACGAAAAA ATGCGCTATA CCCACGGCTA CGGTCTGGCC ATGAACCCGG CCAACACCGT TACTGCCGGC GGCCAGCCGG AGTTTATCGC CGGCGACCTG CCCTTTCACA GCAGTGCCGG CCTCCAGGTT AATGAGCCCC GGATCTATTA CGGTGAACTG ACCGGTGATT ATGTCATTAC CGGCGGTACG GCAGCTGAAT TTGATTACCC TGTCACGGGA GAAGATAATT TCGTTGAAAC CCGATATCAG GGAAGAGGCG GGGTTCCCAT TAATACTCCC TGGCGGCGGC TGGTCTTCGC TTTTCGCTTT CACGATTACC GGCTGCTGAT GAGCAACGGG CTAACTCCCC AGAGTAAGAT ACTCTATTAC CGCAATATCC AGGAACGGGT GCGGAAGATC ATGCCCTATT TGCGCTATGA CGCTGATCCC TACCTGGTTG TTGCCGGGGG CCGCCTTTAC TGGTTCCTAG ACGCCTATAC CATTACTAAT ATGTATCCCT ATTCCGAACC AAACAGCGGC GGCTTCAATT ATATTCGCAA CTCTGTCAAG GTGGTTATCG ATGCTTATAA CGGTAGTGTT GACTACTATC TTGTTGACCC GGGAGACCCC CTGGCCCAAA CCCTGGCCAG AATTTTCCCC GGGCTATTCA AGCCCCGGGA AGACATGCCG GCCGGGCTGC AGCAGCACCT GCGTTATCCC CCCGACCTTT TAAGCATCCA GGCCCAGATG TTGACTAACT ACCATATGGA AAACACCATG CTTTTTTATA ACAAAGAGGA TGCCTGGAGT ATAGCTGAGG AAATGGTCGG CGATAAACGC CAGGCCATGG ATCCCTATTA CACCCTGATG CGTTTACCCG GGGAAACACA GGCGGAGTAT ATCTTAATGC TTCCCTTTAC ACCAGCCCGT AAGGTCAATA TGATAGCCTG GCTAGCAGCC CGCAATGATG GTCCCCATTA CGGCCAGCTT TTGTTATACC AGTTCCCTAA AAACCGCTCT ATTTATGGCC CTATGCAGGT CGAAGCCCGT ATCGACCAGG AACCGCGTAT CTCCCAGCAG CTAACCCTCT GGGACCAGCA TGGCTCCCAG GTCATCCGGG GGAATCTTCT GGTAATCCCC ATTAAAGGCT CCCTGCTTTA TGTAGAGCCA ATCTTCCTCC AGGCCCAGGA AAGTAAATTA CCTGAACTGC GCCAGGTGGT AGTTGCTTAC GAAGAAAAAA TAGCCATGGC CGACACCCTG GCCGGGGCCC TGCAGGTCAT CTTCGGTACC CAGACACCTG CACCCGCCGC CAGTCCTCAA CCGCCATCCC AGGCTGCAAC AGGCAGCCCA GGTAACCTGT CGGAACTCAT CAAAGAAGCC AACCGCCTCT ATAGCGAAGC CCAGGACAGG CTAAAACAGG GTGATTGGGC CGGTTACGGG GAAAACCTGA AAAAGCTGGA GCAGGTCCTC CAGGAGATGG GACAAAAAGT AGCTGAATGA
|
Protein sequence | MKLNRLWFCL LIIIPGFLVA AYLGSHFLTD WYWFAEVGYR QVFLTRLLSE VGIRLGTIAF FFLFFYLNLL FTRKSLHLSP PEGRENWTLK EYLIDRFITS RRLGILYLLL SLAGALIFSP LAAGKWLVVQ EYLRATPFGL ADPLFGRDVS FYIFKLPLYH FLYKLLITAV VGAVLVTGFF YFIFNPRELL GLRRGHFSRP LVHFSTLVAL LFLIQAWGFR LQALDLVRSS RGVAFGASYT DIHALLPGYN ILGWVAVACG LIIVLNAFRR NLKLVSAGIL SFMAAYFLLV IAVPLAVQKF QVEPNEFARE EPYLRYNINF TRRAYGLDRI TIQEFPALDN LTPASLREEG ATLDNIRLWD YRPLEQTYSQ LQEIRSYYSF KDIDVDRYTL DGRERQVMLA ARELDQNKLP DRARTWINEK MRYTHGYGLA MNPANTVTAG GQPEFIAGDL PFHSSAGLQV NEPRIYYGEL TGDYVITGGT AAEFDYPVTG EDNFVETRYQ GRGGVPINTP WRRLVFAFRF HDYRLLMSNG LTPQSKILYY RNIQERVRKI MPYLRYDADP YLVVAGGRLY WFLDAYTITN MYPYSEPNSG GFNYIRNSVK VVIDAYNGSV DYYLVDPGDP LAQTLARIFP GLFKPREDMP AGLQQHLRYP PDLLSIQAQM LTNYHMENTM LFYNKEDAWS IAEEMVGDKR QAMDPYYTLM RLPGETQAEY ILMLPFTPAR KVNMIAWLAA RNDGPHYGQL LLYQFPKNRS IYGPMQVEAR IDQEPRISQQ LTLWDQHGSQ VIRGNLLVIP IKGSLLYVEP IFLQAQESKL PELRQVVVAY EEKIAMADTL AGALQVIFGT QTPAPAASPQ PPSQAATGSP GNLSELIKEA NRLYSEAQDR LKQGDWAGYG ENLKKLEQVL QEMGQKVAE
|
| |