Gene Moth_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1867 
Symbol 
ID3831498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1929845 
End bp1931593 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content62% 
IMG OID637829799 
Productpyruvate kinase 
Protein accessionYP_430710 
Protein GI83590701 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0469] Pyruvate kinase 
TIGRFAM ID[TIGR01064] pyruvate kinase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCACA CGAAGATTGT CTGTACCATG GGCCCGGCCA GCGAGCGGGT CGAGGTAATC 
AAGGCTATGA TCCGGGCGGG GATGAACGTA GCCCGGTTCA ATTTTTCCCA TGGCAGCCAC
GCCGAGCACG GGGCGCGGAT GGCTGCCGTG CGCCAGGCGG CAGCTGAACT GGGCGCCAGG
GTAGCGTTAA TGCTGGATAA TAAGGGGCCC GAAATTCGCC TGGGAGAGAT CCAGGGCGAG
GTCACCCTGA AGGACGGCGA CCAGGTGACC CTGACCACAG AACCTATTAT TGGTGACGCC
AGGCGTTTGC CGGTGAGCTT TGCCGGTCTG CCGGGGGACG TCCGGCCGGG CCAGATCATT
CTCCTGGACG ACGGCCTGGT GGAGCTGGAG GTCCTGGCGA CCACCGCCAC CGAGATTCAC
TGCCGCGTCC GTCACGGCGA TGTTATTTCC AGCCATAAGG GCGTCAACGT CCCCGGGGCC
GAGATCAGCC TGCCTCCTTT TACCGAGCAG GATATTAAAG ACCTTGAGTT CGGCCTCCAG
CAGGGGATAG ATTTTATCGC CCTCTCCTTT GTCCGGACGG CCGGGGATGT CCTGGCAGTA
CGCCGGGAGC TTGAGAAGCG CAACGCCAGG GTAGCCATTA TCGCCAAGAT AGAAAACCAT
GCCGGGGTCA ATAACATCCA CGAGATCCTT GAGGTGGCCG ACGGGGTCAT GGTGGCCCGG
GGTGACCTGG GGGTAGAGAT CCCCGTGGAA GAGGTCCCCC TGGTGCAGAA AAAGATTATC
GAGGCGTGTA ACCTGGCCGG CAAGCCGGTT ATCACGGCCA CCCAGATGCT GGAGTCTATG
ATTCATAACC CGCGGCCGAC CCGGGCCGAA GCCAGCGATG TGGCCAATGC CATCTTTGAC
GGAACGGATG CCATTATGCT CTCCGGGGAA ACGGCTACGG GCCGTTATCC GGTAGAGGCT
GTGGCGACCA TGGCCCGCAT CGCCCGCCGG GCCGAGAGGG GTTTGCCCTA TGGTGACCTG
TTGACGAAAA AGGGTCTGGC TGCCGAGCGG ACGGCCACCG ATGCCATCAG CCACGCGAGC
TGCACCATTG CCTATGAACT CGACGCCGCC GCCATTATCA CCCCCACGGC TTCCGGTTCC
ACCGCCCGCC GGGTGGCCAA ATACCGTCCC CGGGCGCCTA TCCTGGCCAC CAGCCCCAAC
GAGAAGGTTT TGAACCAGCT CTGCCTGGTC TGGGGGGTTG AACCCCTCCT GGTGGAGCCG
ACAAGCGGCA CCGACGAAAT GGTTAATGCC GCGGTGGCGG CGGCCATACT CTCCGGCCGG
GTGAAACAGG GCGACCTGGT GGTTATTACT GCCGGCGTGC CTGCCGGTGT TCCGGGTACC
ACCAACCTCC TCAAGGTCCA CATCGTCGGC GAGGTCCTGG TGCGGGGACG GGGGATCGGC
AAAGAAGTAA CCAGTGGCCC GGTCCGGCTG GTAAAGACTG CTGCTGACGC CGTGGCAAGG
GTCAAAAAAG GCGACATTCT GGTGACCACT GAGACCGGCC CTGAATTCCT GCCAGCTATG
GAAAGGGCGG CAGCAGTAAT TACGGAAACC GGGGGGCTGA GTTCCCATGC CGCGGTAACC
GGCCTGAGCC TGGGTATACC GGTAGTCGTC GGGGCAAAGG GGGCCACTGA AAAGCTAACC
GATGATCTGG TCGTAACCAT AGACGTTGTC CGCGGCCTGG TCTACCGCGG TCAGACGCGG
GTGTTGTGA
 
Protein sequence
MRHTKIVCTM GPASERVEVI KAMIRAGMNV ARFNFSHGSH AEHGARMAAV RQAAAELGAR 
VALMLDNKGP EIRLGEIQGE VTLKDGDQVT LTTEPIIGDA RRLPVSFAGL PGDVRPGQII
LLDDGLVELE VLATTATEIH CRVRHGDVIS SHKGVNVPGA EISLPPFTEQ DIKDLEFGLQ
QGIDFIALSF VRTAGDVLAV RRELEKRNAR VAIIAKIENH AGVNNIHEIL EVADGVMVAR
GDLGVEIPVE EVPLVQKKII EACNLAGKPV ITATQMLESM IHNPRPTRAE ASDVANAIFD
GTDAIMLSGE TATGRYPVEA VATMARIARR AERGLPYGDL LTKKGLAAER TATDAISHAS
CTIAYELDAA AIITPTASGS TARRVAKYRP RAPILATSPN EKVLNQLCLV WGVEPLLVEP
TSGTDEMVNA AVAAAILSGR VKQGDLVVIT AGVPAGVPGT TNLLKVHIVG EVLVRGRGIG
KEVTSGPVRL VKTAADAVAR VKKGDILVTT ETGPEFLPAM ERAAAVITET GGLSSHAAVT
GLSLGIPVVV GAKGATEKLT DDLVVTIDVV RGLVYRGQTR VL