Gene Moth_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0016 
Symbol 
ID3831888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp15302 
End bp17011 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content46% 
IMG OID637827943 
Productphosphoenolpyruvate--protein phosphotransferase 
Protein accessionYP_428899 
Protein GI83588890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000201046 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTCTTA AGGGAATAGC TATTTCCCCG GGTAAGGTGA TAGGACGGGC ACACCGGTTA 
ATAGAGCATG CCAACAATTA CTCCAGGAGC TTTTTAACCA GTGAAGTAGA GAAGGAGAAA
GAGCTTGCCA GGTTAGCACA GGCTTTCAAC CAGGCTAAGG AACAACTGGC AAGCCTCATT
GATAAAACTA AACGCGAGAT TGGCCTCCAG GAAGCTAAAA TTTTTGAAGC CCATCTTTTA
CTTCTTTCCG ATCCGCTATT GGGGGAAAAA ATAAGAACAA AAATCCAGGT GGAAGGGAAA
GATGCTGTCT GGGCGGTTAG TGAAGCTACC GAAGAAATAG CTCAGGAGTT TGCCTCCCTG
GAGGATGAAT ACTTCCGCGA AAGGGCTGTG GACATTCGCG ATATAGGCCG GCGTCTAATT
GCCTGTCTGG GAAATGGCTT AGAGGAAGTA AGCGACGAGG CAACCAACAC CATAATTGTA
GCTGAAGAAT TAACGCCTTC CCAAACAGCC AGTTTTTCTC GCCGCACAGT AGCCGGGATT
ATTACCGAAA AGGGAGGACC TACCAGTCAT ACAGCAGTGG TAGCCAGGTC ATTAGGTATA
CCTGCAGTCT CCGGAGTAAC CAACTTATTA GACCTGGTTA AAGATGGTGA TTTCGTTTTC
ATAGACGGAA ATAATGGGGT AATACATTTA AACCCCGATG TAGAAGAGAT CAAGAACCTG
CCGACCAACG GGCAAAACAA TTGGAGAGAA AAAAGGTGTA TAAGGAATGC TTTAGCTAAA
GCCATAACCC GTGATGGACA TGAAATAGAG GTAGCAACAA ATCTCCGCAA CTTTGAAGAG
GCGGAGTTAG CGCTGGCATA TGGCGCCGAA GGCGTAGGGC TCTTCCGAAC AGAATTTCTT
TATATGAATC GCCAGGAGCC TCCGGGAGAA GAAGAACAAT TTTTGATTTA CCGGGACGTG
TTACTTGCCT TAAAAGATAA ACCCGTGATT GTGCGCACCC TGGACATTGG GGGCGATAAA
CATCTGCCTT ACCTTGAGCA AGATAAAGAA GATAACCCAT TTTTAGGCTT GAGGGGAATA
AGGCTGTGTT TAAAACACAA AGACCTTTTT AAGACACAAC TGAAAGCCTT GCTACGTGCT
TCTTCTTACG GTAATCTCAA GCTCATGTTT CCCATGGTAA CGACCTTGGA GGAAATCCGC
CAGGCTAAAA CTCTCCTGGA AGAAGCACGA GAAGAGTTGC AGGTTGCTGG TATAGAAACA
AGTATAAATA TTCCCATCGG TATTATGATC GAAGTGCCAG CGGCGGCTCT CATGGCCGAT
ATTTTAGCCA GCGAAGTGGA CTTTTTCAGC ATAGGCACCA ATGATTTAAC CCAATACACA
TTGGCCGTAG ATCGGGACAA TGAAAAAGTA GCAGTTCTTT ACGATGCCTA CCATCCGGCA
GTTTTAAAAT TTATTTACCA GGTGGTGGAT GCTGCTCACC GAAAGGGCAA ATGGGTAGGT
CTTTGTGGTG AGCTGGGCGG AGATAGCCTG GCTGCACCTC TCCTGGTAGG TCTGGGGCTG
GACGAGATAA GTATGAGTCC TGTATTTATT CCCGCAATGA AGGAAAGAAT ACGAGCACTT
TCTTATGAGG AGGGACGGGA ATTGGTGCGG CGACTGCTGG AACTGCCCGG TGCTAAAGAA
GTAAGAGAGG TGTTAGTAAA AACATCATAG
 
Protein sequence
MLLKGIAISP GKVIGRAHRL IEHANNYSRS FLTSEVEKEK ELARLAQAFN QAKEQLASLI 
DKTKREIGLQ EAKIFEAHLL LLSDPLLGEK IRTKIQVEGK DAVWAVSEAT EEIAQEFASL
EDEYFRERAV DIRDIGRRLI ACLGNGLEEV SDEATNTIIV AEELTPSQTA SFSRRTVAGI
ITEKGGPTSH TAVVARSLGI PAVSGVTNLL DLVKDGDFVF IDGNNGVIHL NPDVEEIKNL
PTNGQNNWRE KRCIRNALAK AITRDGHEIE VATNLRNFEE AELALAYGAE GVGLFRTEFL
YMNRQEPPGE EEQFLIYRDV LLALKDKPVI VRTLDIGGDK HLPYLEQDKE DNPFLGLRGI
RLCLKHKDLF KTQLKALLRA SSYGNLKLMF PMVTTLEEIR QAKTLLEEAR EELQVAGIET
SINIPIGIMI EVPAAALMAD ILASEVDFFS IGTNDLTQYT LAVDRDNEKV AVLYDAYHPA
VLKFIYQVVD AAHRKGKWVG LCGELGGDSL AAPLLVGLGL DEISMSPVFI PAMKERIRAL
SYEEGRELVR RLLELPGAKE VREVLVKTS