Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3361 |
Symbol | |
ID | 4647644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 3578063 |
End bp | 3580792 |
Gene Length | 2730 bp |
Protein Length | 909 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639806839 |
Product | DNA polymerase I |
Protein accession | YP_954164 |
Protein GI | 120404335 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00788758 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.41459 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCTG CCAAGACTGC GTCGGAGACG AAAGCCGGTA AACCGGACAC CAGCGAGAAG CCGACGCTGA TGCTTCTGGA CGGCAACTCA CTGGCCTTTC GCGCGTTCTA CGCACTGCCG GCAGAGAACT TCAAAACCAA GAGCGGGCTG ACCACCAACG CGGTGTACGG CTTCACCGCG ATGCTGATCA ACCTGCTGCG CGACGAGCAG CCCACCCACA TCGCGGCCGC GTTCGACGTG TCCCGCCAGA CGTTCCGCAA GGACAAATAC CCGGAGTACA AGGAGGGCCG TTCGGCGACG CCCGACGAAT TCCGCGGCCA GATCGACATC ACCAAGGAGG TGCTCGGCGC GCTGGGGATC ACGGTGCTGG CCGAGCCCGG TTTCGAGGCC GACGACATCA TCGCGACACT CGCCACGCAG GCCGAGAACG AAGGACACCG CGTGCTGGTG GTGACCGGCG ACCGCGATTC GCTGCAGCTG GTCAGCGACG ACGTCACCGT CCTCTATCCC CGCAAGGGGG TCAGCGAGTT GACCCGGTTC ACCCCGGAGG CGGTGCAGGA GAAGTACGGG CTCACACCGG CGCAGTACCC CGACTTCGCG GCGCTGCGCG GCGACCCCAG CGACAACCTG CCCGGCATCC CGGGCGTGGG GGAGAAGACC GCGACCAAGT GGATCGCCGA GTACGGCTCA CTGCAGTCGC TGGTGGACAA CGTCGACAAG GTCAAGGGCA AGGTCGGCGA CGCGCTGCGT GCCCACCTGT CCAGCGTCGT GCTCAACCGT GAGCTCACCG ACCTGGTCAA AGACGTGCCG TTGGCGCACA CCCCGGACAC GCTGCGGATG CAGCCGTGGG ACCGCGACCA GATCCACCGG CTCTTCGACG ACCTCGAGTT CCGGGTACTG CGCGACCGGC TGTTCGACAC GCTGGCCTCC GCCGACCCCG AGGTGGAAGA GGGTTTCGAA GTGCGCGGCG AGGCGCTGGA GGCGGGCACA CTGGCCGCCT GGCTGGCCGA GCACAGCAAC GGTCAGCGGT TCGGGGTCGC CGTCGTCGGC AACCACCTGG CGTTCGACAG CGACGCCACC GCGGTCGCGC TCGTCGCCTC CGACGGCGAC GGACGCTACA TCGACACCAC CCGGCTGGAC CCCGAGGACG AGAAGGCGCT GGCATCCTGG CTTGCCGATC CCGATGCGCC GAAGGCGCTG CACGAGGCCA AGCTCGCGAT GCACGACCTG CAGGGCCGGG GCTGGACGCT GGCCGGGGTC ACCTCCGACA CCGCGCTGGC CGCCTACCTG GTGCGGCCCG GTCAACGCAG CTTCGCGCTC GATGACCTGT CGCTGCGCTA CCTGAAACGC GAACTACGCG CCGATAACCC CGAACAGCAA CAGCTTTCGT TGCTCGATGA CAGCGACGCC GTCGACGACC AGGCGGTGCA GACCTTGCTG TTGCGGGCCG GCGCTGTCGT GGACCTCGCC GACGCCCTCG ACGAGGAACT GGCCCGCATC GACTCCTCGG CGCTGCTGGG CAGCATGGAA CTGCCGGTGC AGCGGGTGCT GGCCCAGCTG GAAACCGTGG GCATCGCGGT CGACCTCGCG ATGCTCTCGG AGCTGCAGAG CGAGTTTGCC GGCCAGATCC GCGACGCAGC CGAGGCGGCC TACGCGGTGA TCGGCAAACA GATCAACCTC GGTTCACCGA AGCAGCTGCA GGTCGTGTTG TTCGACGAGC TGGAGATGCC CAAGACCAAG CGCACCAAGA CCGGCTACAC CACCGACGCC GACGCACTGC AATCGCTGTT CGACAAGACC GGGCACCCGT TCCTGCAGCA TCTGCTCGCC CATCGGGACG CCACCCGGCT GAAGGTCACC GTCGACGGGC TGCTGAATTC GGTTGCCGCA GATGGGCGAA TTCATACCAC GTTCAACCAG ACGATCGCGG CGACCGGCCG GCTGTCCTCG ACCGAGCCGA ACCTGCAGAA CATCCCGATC CGCACCGAGG CCGGGCGGCG CATCCGCGAC GGATTTGTGG TGGGCGAGGG CTACGCCGAT TTGATGACCG CCGACTACAG CCAGATCGAG ATGCGGATCA TGGCGCATCT CTCGAAGGAC GCAGGCCTGA TCGAAGCGTT CAACACGGGG GAGGACCTGC ACTCGTTCGT CGCATCCCGG GCGTTCTCGG TGCCGCTCGA CGAGGTCACC CCCGAGCTGC GGCGTCGGGT GAAAGCCATG TCGTACGGAC TGGCGTACGG GTTGAGCGCC TACGGGCTCT CCCAGCAGCT CAAGATCTCC ACCGAGGAGG CCAAGGAGCA GATGGAGCAG TACTTCGCCC GGTTCGGCGG GGTGCGCGAC TATCTGCGCG ACGTGGTCGA CCAGGCCCGC AAGGACGGAT ACACGTCGAC GGTGTTCGGT CGCCGGCGTT ACCTGCCCGA GCTCGACAGC AGCAACCGGC AGGTGCGGGA GGCTGCCGAA CGTGCCGCGC TCAACGCGCC GATCCAGGGC AGTGCCGCCG ACATCATCAA GGTCGCGATG ATCAACGTCG ACCAGGCCAT CAAGGAGGCG GGGCTGTCGT CTCGCATGCT GCTCCAGGTG CACGACGAGC TGCTCTTCGA GGTTGCCGAC GGCGAACGCG ACGCGCTCGA TGCCCTGGTC CGCGAGCACA TGGGCAGCGC CTATCGGCTG GATGTGCCGC TCGAGGTCTC GGTCGGATAC GGCCGCAGCT GGGACAGTGC CGCTCATTAG
|
Protein sequence | MSPAKTASET KAGKPDTSEK PTLMLLDGNS LAFRAFYALP AENFKTKSGL TTNAVYGFTA MLINLLRDEQ PTHIAAAFDV SRQTFRKDKY PEYKEGRSAT PDEFRGQIDI TKEVLGALGI TVLAEPGFEA DDIIATLATQ AENEGHRVLV VTGDRDSLQL VSDDVTVLYP RKGVSELTRF TPEAVQEKYG LTPAQYPDFA ALRGDPSDNL PGIPGVGEKT ATKWIAEYGS LQSLVDNVDK VKGKVGDALR AHLSSVVLNR ELTDLVKDVP LAHTPDTLRM QPWDRDQIHR LFDDLEFRVL RDRLFDTLAS ADPEVEEGFE VRGEALEAGT LAAWLAEHSN GQRFGVAVVG NHLAFDSDAT AVALVASDGD GRYIDTTRLD PEDEKALASW LADPDAPKAL HEAKLAMHDL QGRGWTLAGV TSDTALAAYL VRPGQRSFAL DDLSLRYLKR ELRADNPEQQ QLSLLDDSDA VDDQAVQTLL LRAGAVVDLA DALDEELARI DSSALLGSME LPVQRVLAQL ETVGIAVDLA MLSELQSEFA GQIRDAAEAA YAVIGKQINL GSPKQLQVVL FDELEMPKTK RTKTGYTTDA DALQSLFDKT GHPFLQHLLA HRDATRLKVT VDGLLNSVAA DGRIHTTFNQ TIAATGRLSS TEPNLQNIPI RTEAGRRIRD GFVVGEGYAD LMTADYSQIE MRIMAHLSKD AGLIEAFNTG EDLHSFVASR AFSVPLDEVT PELRRRVKAM SYGLAYGLSA YGLSQQLKIS TEEAKEQMEQ YFARFGGVRD YLRDVVDQAR KDGYTSTVFG RRRYLPELDS SNRQVREAAE RAALNAPIQG SAADIIKVAM INVDQAIKEA GLSSRMLLQV HDELLFEVAD GERDALDALV REHMGSAYRL DVPLEVSVGY GRSWDSAAH
|
| |