Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0016 |
Symbol | |
ID | 3831888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 15302 |
End bp | 17011 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637827943 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_428899 |
Protein GI | 83588890 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000201046 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTCTTA AGGGAATAGC TATTTCCCCG GGTAAGGTGA TAGGACGGGC ACACCGGTTA ATAGAGCATG CCAACAATTA CTCCAGGAGC TTTTTAACCA GTGAAGTAGA GAAGGAGAAA GAGCTTGCCA GGTTAGCACA GGCTTTCAAC CAGGCTAAGG AACAACTGGC AAGCCTCATT GATAAAACTA AACGCGAGAT TGGCCTCCAG GAAGCTAAAA TTTTTGAAGC CCATCTTTTA CTTCTTTCCG ATCCGCTATT GGGGGAAAAA ATAAGAACAA AAATCCAGGT GGAAGGGAAA GATGCTGTCT GGGCGGTTAG TGAAGCTACC GAAGAAATAG CTCAGGAGTT TGCCTCCCTG GAGGATGAAT ACTTCCGCGA AAGGGCTGTG GACATTCGCG ATATAGGCCG GCGTCTAATT GCCTGTCTGG GAAATGGCTT AGAGGAAGTA AGCGACGAGG CAACCAACAC CATAATTGTA GCTGAAGAAT TAACGCCTTC CCAAACAGCC AGTTTTTCTC GCCGCACAGT AGCCGGGATT ATTACCGAAA AGGGAGGACC TACCAGTCAT ACAGCAGTGG TAGCCAGGTC ATTAGGTATA CCTGCAGTCT CCGGAGTAAC CAACTTATTA GACCTGGTTA AAGATGGTGA TTTCGTTTTC ATAGACGGAA ATAATGGGGT AATACATTTA AACCCCGATG TAGAAGAGAT CAAGAACCTG CCGACCAACG GGCAAAACAA TTGGAGAGAA AAAAGGTGTA TAAGGAATGC TTTAGCTAAA GCCATAACCC GTGATGGACA TGAAATAGAG GTAGCAACAA ATCTCCGCAA CTTTGAAGAG GCGGAGTTAG CGCTGGCATA TGGCGCCGAA GGCGTAGGGC TCTTCCGAAC AGAATTTCTT TATATGAATC GCCAGGAGCC TCCGGGAGAA GAAGAACAAT TTTTGATTTA CCGGGACGTG TTACTTGCCT TAAAAGATAA ACCCGTGATT GTGCGCACCC TGGACATTGG GGGCGATAAA CATCTGCCTT ACCTTGAGCA AGATAAAGAA GATAACCCAT TTTTAGGCTT GAGGGGAATA AGGCTGTGTT TAAAACACAA AGACCTTTTT AAGACACAAC TGAAAGCCTT GCTACGTGCT TCTTCTTACG GTAATCTCAA GCTCATGTTT CCCATGGTAA CGACCTTGGA GGAAATCCGC CAGGCTAAAA CTCTCCTGGA AGAAGCACGA GAAGAGTTGC AGGTTGCTGG TATAGAAACA AGTATAAATA TTCCCATCGG TATTATGATC GAAGTGCCAG CGGCGGCTCT CATGGCCGAT ATTTTAGCCA GCGAAGTGGA CTTTTTCAGC ATAGGCACCA ATGATTTAAC CCAATACACA TTGGCCGTAG ATCGGGACAA TGAAAAAGTA GCAGTTCTTT ACGATGCCTA CCATCCGGCA GTTTTAAAAT TTATTTACCA GGTGGTGGAT GCTGCTCACC GAAAGGGCAA ATGGGTAGGT CTTTGTGGTG AGCTGGGCGG AGATAGCCTG GCTGCACCTC TCCTGGTAGG TCTGGGGCTG GACGAGATAA GTATGAGTCC TGTATTTATT CCCGCAATGA AGGAAAGAAT ACGAGCACTT TCTTATGAGG AGGGACGGGA ATTGGTGCGG CGACTGCTGG AACTGCCCGG TGCTAAAGAA GTAAGAGAGG TGTTAGTAAA AACATCATAG
|
Protein sequence | MLLKGIAISP GKVIGRAHRL IEHANNYSRS FLTSEVEKEK ELARLAQAFN QAKEQLASLI DKTKREIGLQ EAKIFEAHLL LLSDPLLGEK IRTKIQVEGK DAVWAVSEAT EEIAQEFASL EDEYFRERAV DIRDIGRRLI ACLGNGLEEV SDEATNTIIV AEELTPSQTA SFSRRTVAGI ITEKGGPTSH TAVVARSLGI PAVSGVTNLL DLVKDGDFVF IDGNNGVIHL NPDVEEIKNL PTNGQNNWRE KRCIRNALAK AITRDGHEIE VATNLRNFEE AELALAYGAE GVGLFRTEFL YMNRQEPPGE EEQFLIYRDV LLALKDKPVI VRTLDIGGDK HLPYLEQDKE DNPFLGLRGI RLCLKHKDLF KTQLKALLRA SSYGNLKLMF PMVTTLEEIR QAKTLLEEAR EELQVAGIET SINIPIGIMI EVPAAALMAD ILASEVDFFS IGTNDLTQYT LAVDRDNEKV AVLYDAYHPA VLKFIYQVVD AAHRKGKWVG LCGELGGDSL AAPLLVGLGL DEISMSPVFI PAMKERIRAL SYEEGRELVR RLLELPGAKE VREVLVKTS
|
| |