Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_2707 |
Symbol | |
ID | 4645462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 2866579 |
End bp | 2869389 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639806188 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_953520 |
Protein GI | 120403691 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.136783 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTCAAG CTCCGGAGAT GTCCGAGACC GCGCTCGAAC CGATCGGAGC CGTCCACCGC ACCCAGGTGG GCCGGGACGC CACCGAGCCG ATGCGCGAGG ACATCCGGCT GCTCGGGGCG ATCCTCGGTG ACACCGTGCG CGAGCAGAAC GGCGACGAGG TGTTCGACCT CATCGAGCGG GCCCGTGTCG AGGCGTTCCG GGTGCGCCGC TCCGAGATCG ACCGTGCCGA CATCGCCAAG TTGTTCGACG GGGTCGACGT CCACCTCGCC CTCCCGGTGA TCCGCGCCTT CACGCACTTC GCCCTGCTGG CCAACGTGGC CGAGGACATC CACCGGGAAC GCCGCAGGGC AGTCCACGAA GCGGCCGGGG AACCGCCGCA GGACAGCAGC CTTGCCGCCA CCTACCTCAA ACTCGACAGC GCCGAACTCG ATGCGGTCGC CGTTGCCGAC GCTCTCGCCG GCGCGCTGGT CTCCCCGGTC ATCACCGCCC ATCCCACCGA GACGCGCCGC CGCACCGTCT TCGACACCCA GCACCGCATC ACCGAGCTCA TGCGGTTGCG ACTGCACGGG CACACCACCG CGGACGGCCG GGACATCGAC CGCGAACTGC GCCGCCACAT CCTCACGTTG TGGCAGACCG CCTTGATCCG GTTGTCCCGC TTGAAGATCC AGGATGAGAT CGAGACGGGG TTGCGGTACT ACGCGGCGGC GTTCTTCGAG GTGATCCCAC AGGTGAACGC CGAGGTGCGC ACGGCGTTGC AGGCCCGCTG GCCGGAGGGC GGGCTGCTCG AAGAGCCGAT CGTGCGGCCC GGGTCGTGGA TCGGTGGCGA CCGCGACGGC AACCCGAACG TGACCGCCGA TGTGGTCCGG CTCGCCACCG GAAGCGCCGC CCACACCGCG TTCGCGCACT ACTTCGCCGA GCTCACCGCC CTCGAGCAGG AGCTGTCGAT GTCGGCGCGG CTCGTGCACA TCAGCGATCA GCTGGGAGCC CTTGCCGACG CCTGCCACGA GCCGGCGCGG GCCGACGAAC CGTACCGTCG GGCCCTTCGT GTGGTGCACG CTCGGCTGAC CGCGACCGCC CGGCAGATCC TCGACCGTCA GCCCGAGCAC GAGCTCGACC TCGGAATGGA TCCCTACTCC GCACCCGGCG AGCTCCTGGA CGACCTCGAC GTCATCGACG CGTCGCTGCG CGCCAACGGC AGCGCGGTCC TGGCCGACGA CAGGCTGGCT CGTCTGCGGG AAGCGGTGCG GGTGTTCGGG TTCCATCTGT CAGGGCTCGA CATGCGGCAG AACTCCGACG TGCACGAAGA GGTCGTCGCC GAGCTGTTGG CGTGGGCCGG TGTGCACCCC GACTACGCGA GTCTGGGCGA GTCCGAACGC GTCGAGATCC TGGCCGCCGA GTTGGGTACC CGCCGCCCAC TGATCGGCCA AGACGCCGAG CTGTCCGAAC TGGCGCGCAA GGAACTCGAC ATCGTGGCGG CCGCGGCCAG AGCCGTGCAC GTGTTCGGGC CGGAGGCCGT GCCGAACTAT GTCATCTCGA TGTGCCAGTC CGTCTCGGAC ATGCTCGAGG CCGCGATCCT GCTGAAAGAG GCGGGACTGC TCGATGCGAC CTCGGCGCAC CCGTACGCAC CCGTCGGGAT CGTGCCGCTG TTCGAAACCA TCGACGACCT CCAGCATGGC GCGTCCATTC TCGAAGCGGC GCTTGACCTT CCGCTGTACC GGGCAGTCGT CACGGCCCGC GGCGGCAGCC AGGAGGTCAT GCTCGGCTAC TCGGACTCCA ACAAGGACGG CGGCTATCTG GCCGCGAACT GGGCGCTCTA CCGCGCGGAG CTGGACCTGG TGGAGTCGGC GCGCAGGACC GGCATCCGCC TGCGCCTGTT CCACGGCCGC GGCGGCACCG TAGGCCGCGG CGGTGGACCC AGTTACGACG CGATCCTGGC GCAGCCGCCC GGCGCAGTGC AGGGTTCGCT GCGCCTCACC GAGCAGGGCG AGGTCATCGC CGCGAAGTAC GCCGAGCCCA GGATCGCGCG CCGCAACCTG GAGACGCTGC TCGCCGCCAC GCTGGAGGCC ACCCTGCTCG ACGTCGAGGG GCTCGGAGAC GCCGCGAACC CGGCCTACGA GGTGCTCGAC GATCTCGCGG CGCGCGCGCA GCGGGCCTAC GCCGAACTTG TGCACGAGAC ACCGGGTTTC GTCGAGTACT TCAAGGCATC GACACCCGTC AGCGAGATCG GTGCGCTCAA CATCGGCAGC CGTCCCACCT CGCGCAAGCC GACGACGTCC ATCTCGGACC TGCGCGCCAT CCCGTGGGTG CTGGCGTGGA GCCAGTCCCG GGTCATGCTG CCCGGCTGGT ACGGCACCGG AACCGCGTTC GAGGAGTATG TGGGCGAAGG CCCCGGCAGT GCGGAGCGCC TCGCGGTGCT GCAGGACCTG TACCGACGGT GGCCGTTCTT CGCGACGGTG CTGTCGAACA TGGCTCAGGT GCTCGCGAAA TCCGATCTGG GGCTGGCGTA CCGGTACGCG GAGCTTGTCG AGGACGAGGC GCTGCGCCGA CGGGTGTTCG ACAAGATCGC CGACGAACAC CAACGCACGA TCCGCATGCA CGAGCTGATC ACCGGGCACG ACGATCTGCT GGCCGACAAC CCGGCGCTGG CCCGGTCGGT GTTCAACCGT TTCCCGTACC TGGAGCCGCT CAATCATCTT CAGGTGGAGC TGCTGCGCCG CTACCGTTCC GGCGATCAGG ACGAACTGGT GCAGCGCGGC ATCCTGCTGA CGATGAGCGG ACTGGCCACC GCGCTCCGCA ACAGCGGCTG A
|
Protein sequence | MGQAPEMSET ALEPIGAVHR TQVGRDATEP MREDIRLLGA ILGDTVREQN GDEVFDLIER ARVEAFRVRR SEIDRADIAK LFDGVDVHLA LPVIRAFTHF ALLANVAEDI HRERRRAVHE AAGEPPQDSS LAATYLKLDS AELDAVAVAD ALAGALVSPV ITAHPTETRR RTVFDTQHRI TELMRLRLHG HTTADGRDID RELRRHILTL WQTALIRLSR LKIQDEIETG LRYYAAAFFE VIPQVNAEVR TALQARWPEG GLLEEPIVRP GSWIGGDRDG NPNVTADVVR LATGSAAHTA FAHYFAELTA LEQELSMSAR LVHISDQLGA LADACHEPAR ADEPYRRALR VVHARLTATA RQILDRQPEH ELDLGMDPYS APGELLDDLD VIDASLRANG SAVLADDRLA RLREAVRVFG FHLSGLDMRQ NSDVHEEVVA ELLAWAGVHP DYASLGESER VEILAAELGT RRPLIGQDAE LSELARKELD IVAAAARAVH VFGPEAVPNY VISMCQSVSD MLEAAILLKE AGLLDATSAH PYAPVGIVPL FETIDDLQHG ASILEAALDL PLYRAVVTAR GGSQEVMLGY SDSNKDGGYL AANWALYRAE LDLVESARRT GIRLRLFHGR GGTVGRGGGP SYDAILAQPP GAVQGSLRLT EQGEVIAAKY AEPRIARRNL ETLLAATLEA TLLDVEGLGD AANPAYEVLD DLAARAQRAY AELVHETPGF VEYFKASTPV SEIGALNIGS RPTSRKPTTS ISDLRAIPWV LAWSQSRVML PGWYGTGTAF EEYVGEGPGS AERLAVLQDL YRRWPFFATV LSNMAQVLAK SDLGLAYRYA ELVEDEALRR RVFDKIADEH QRTIRMHELI TGHDDLLADN PALARSVFNR FPYLEPLNHL QVELLRRYRS GDQDELVQRG ILLTMSGLAT ALRNSG
|
| |