Gene Mvan_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0097 
Symbol 
ID4644980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp110199 
End bp111989 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content71% 
IMG OID639803608 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_950954 
Protein GI120401125 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.168524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGTTT TGCGCCAGAT CAATGTTGAT ATTGTTGGTT TTACTCCTGA ATGTGTTGAC 
TTGTCAACGG TTTCTTGTAA CCTGGTTCAC ATGACCGCGT CGATTCCGCT CACCTCACAC
CCGACCGCCG GCACCGTGCT GCACGGTGTG CCCGTGGTGG GGGGAGTCCA GTACGCGCCG
GTGATCCGGC CCGGCAAGCT GCCGTCGACA GACGTCGGCG ACACGATGCC CGACCTCGCC
GAAGACCGGC GGGCCGCCGA AAACGAACGC TTCGCCGCCG CGGCCGCCAC CGTCGCCGGC
CGGTTGCGGG AGCGGGCCGC ACACGCAACC GGTGCCGCAT CCGAGGTGCT CGCCGCGACG
GCGACGCTGG CTCAGGACCG GGGCTGGCTC GGCGCCGCCG AGAAACGCAT CAAAGAGGGC
ACCCCCGCGG TGCGGGCGGT CAACGGCGCG ATCGACCAGT TCATCGAGAT GTTCACCAAG
CTCGGTGGTC TGATGGCCGA GCGCGTCACC GACCTGCGTG ACATCCGGGA CCGCGTGGTC
GCCGAGCTCA ACGGGTTGCC CGAACCGGGT GTTCCGTTGC CCGACGAGCC GTCGATCCTG
TGTGCCGAGG ATCTCGCGCC CGCCGACACC GCCGGTCTGG ACCCCGCCCT CGTCGTCGCG
CTGGCCACCA CGCTCGGTGG CCCGACCAGC CACACCGCGA TCATCGCGCG CCAGCTCGGC
ATCCCGTGCG TCGTCGCCGT CGCCGGGCTC GACGACGTTC CCGCCGGCGC CATGGTGCTC
GTCGACGGAA CCCGGGGCAC GGTCACCGTC GGCCCGGACG AGGTGTCCGC GCGTGAGGCG
GTGGCCGAGG CCAAACGGGC CGCCGAAACC GCGGCGCAGT GGTCGGGTCC CGGCGTCACC
GCAGACGGTC ACGCCGTTGC CGTGCTCGCC AACGTCCAGG ACGGGGCGGC CGCCCGTGCG
GCGCGGCAGA CCCCGGCCGA GGGGGTGGGC CTGTTCCGCA CCGAACTGTG CTTCCTCAAC
AGCGACACCG AGCCCGCCGT CGACGAGCAG GCGACCATCT ACTCCGAGGT GCTGGAGGCG
TTCGCCGGCA ACAAGGTGGT GATCCGCACC CTCGACGCCG GATCCGACAA GCCGCTCAAG
TTTGCCGGCC ATCCCGACGA GGCCAACCCG GCCCTGGGAG TGCGCGGGGT CCGCATCGCG
GCCAACAACC CCGGTCTGCT GGACCGTCAG CTCGAGGCGA TCGCCGCGGC GGGGCAACGC
ACCGGCAACC CGCCCTGGGT GATGGCGCCG ATGATCGCCA CCGCCGAGGA GGCGAAGAGC
TTCGCGGACA GGGCGCGGAC ACACGGGCTG ACGCCGGGCG TGATGATCGA GGTGCCGGCG
GCGGCCTTGC TGGCCGACCG GATCCTCGAG CATGTCGACT TCCTGTCGAT CGGCACCAAC
GACCTGGCGC AGTACACGAT GGCCGCCGAC CGGATGTCGG CCGAGCTGGC CACCCTCACC
GATCCGTGGC AGCCGGCCGT TCTCGCGTTG GTCGCGATGA CGGTGAACGC GGGTGCGGCC
GCGGGGAAGC CGGTCGGCGT CTGCGGCGAG GCCGCCGCCG ATCCGCTGCT GGCATGCGTG
CTCACCGGAT TCGGTGTGAC GTCCCTGTCG GCGGCCGCAG CAGCCGTCCA GGCCGTGGGC
GCCAAGCTCG CCCAGGTCAC GCTCCAGCAG TGTCGCGATG CCGCCGAGGC CGTGCTCCGC
ACCGCCAGCG CCGCGGACGC CCGCGCCGTC GCAATGTCGG TGCTGGGTTA G
 
Protein sequence
MWVLRQINVD IVGFTPECVD LSTVSCNLVH MTASIPLTSH PTAGTVLHGV PVVGGVQYAP 
VIRPGKLPST DVGDTMPDLA EDRRAAENER FAAAAATVAG RLRERAAHAT GAASEVLAAT
ATLAQDRGWL GAAEKRIKEG TPAVRAVNGA IDQFIEMFTK LGGLMAERVT DLRDIRDRVV
AELNGLPEPG VPLPDEPSIL CAEDLAPADT AGLDPALVVA LATTLGGPTS HTAIIARQLG
IPCVVAVAGL DDVPAGAMVL VDGTRGTVTV GPDEVSAREA VAEAKRAAET AAQWSGPGVT
ADGHAVAVLA NVQDGAAARA ARQTPAEGVG LFRTELCFLN SDTEPAVDEQ ATIYSEVLEA
FAGNKVVIRT LDAGSDKPLK FAGHPDEANP ALGVRGVRIA ANNPGLLDRQ LEAIAAAGQR
TGNPPWVMAP MIATAEEAKS FADRARTHGL TPGVMIEVPA AALLADRILE HVDFLSIGTN
DLAQYTMAAD RMSAELATLT DPWQPAVLAL VAMTVNAGAA AGKPVGVCGE AAADPLLACV
LTGFGVTSLS AAAAAVQAVG AKLAQVTLQQ CRDAAEAVLR TASAADARAV AMSVLG