Gene Mvan_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3361 
Symbol 
ID4647644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3578063 
End bp3580792 
Gene Length2730 bp 
Protein Length909 aa 
Translation table11 
GC content67% 
IMG OID639806839 
ProductDNA polymerase I 
Protein accessionYP_954164 
Protein GI120404335 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00788758 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.41459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCTG CCAAGACTGC GTCGGAGACG AAAGCCGGTA AACCGGACAC CAGCGAGAAG 
CCGACGCTGA TGCTTCTGGA CGGCAACTCA CTGGCCTTTC GCGCGTTCTA CGCACTGCCG
GCAGAGAACT TCAAAACCAA GAGCGGGCTG ACCACCAACG CGGTGTACGG CTTCACCGCG
ATGCTGATCA ACCTGCTGCG CGACGAGCAG CCCACCCACA TCGCGGCCGC GTTCGACGTG
TCCCGCCAGA CGTTCCGCAA GGACAAATAC CCGGAGTACA AGGAGGGCCG TTCGGCGACG
CCCGACGAAT TCCGCGGCCA GATCGACATC ACCAAGGAGG TGCTCGGCGC GCTGGGGATC
ACGGTGCTGG CCGAGCCCGG TTTCGAGGCC GACGACATCA TCGCGACACT CGCCACGCAG
GCCGAGAACG AAGGACACCG CGTGCTGGTG GTGACCGGCG ACCGCGATTC GCTGCAGCTG
GTCAGCGACG ACGTCACCGT CCTCTATCCC CGCAAGGGGG TCAGCGAGTT GACCCGGTTC
ACCCCGGAGG CGGTGCAGGA GAAGTACGGG CTCACACCGG CGCAGTACCC CGACTTCGCG
GCGCTGCGCG GCGACCCCAG CGACAACCTG CCCGGCATCC CGGGCGTGGG GGAGAAGACC
GCGACCAAGT GGATCGCCGA GTACGGCTCA CTGCAGTCGC TGGTGGACAA CGTCGACAAG
GTCAAGGGCA AGGTCGGCGA CGCGCTGCGT GCCCACCTGT CCAGCGTCGT GCTCAACCGT
GAGCTCACCG ACCTGGTCAA AGACGTGCCG TTGGCGCACA CCCCGGACAC GCTGCGGATG
CAGCCGTGGG ACCGCGACCA GATCCACCGG CTCTTCGACG ACCTCGAGTT CCGGGTACTG
CGCGACCGGC TGTTCGACAC GCTGGCCTCC GCCGACCCCG AGGTGGAAGA GGGTTTCGAA
GTGCGCGGCG AGGCGCTGGA GGCGGGCACA CTGGCCGCCT GGCTGGCCGA GCACAGCAAC
GGTCAGCGGT TCGGGGTCGC CGTCGTCGGC AACCACCTGG CGTTCGACAG CGACGCCACC
GCGGTCGCGC TCGTCGCCTC CGACGGCGAC GGACGCTACA TCGACACCAC CCGGCTGGAC
CCCGAGGACG AGAAGGCGCT GGCATCCTGG CTTGCCGATC CCGATGCGCC GAAGGCGCTG
CACGAGGCCA AGCTCGCGAT GCACGACCTG CAGGGCCGGG GCTGGACGCT GGCCGGGGTC
ACCTCCGACA CCGCGCTGGC CGCCTACCTG GTGCGGCCCG GTCAACGCAG CTTCGCGCTC
GATGACCTGT CGCTGCGCTA CCTGAAACGC GAACTACGCG CCGATAACCC CGAACAGCAA
CAGCTTTCGT TGCTCGATGA CAGCGACGCC GTCGACGACC AGGCGGTGCA GACCTTGCTG
TTGCGGGCCG GCGCTGTCGT GGACCTCGCC GACGCCCTCG ACGAGGAACT GGCCCGCATC
GACTCCTCGG CGCTGCTGGG CAGCATGGAA CTGCCGGTGC AGCGGGTGCT GGCCCAGCTG
GAAACCGTGG GCATCGCGGT CGACCTCGCG ATGCTCTCGG AGCTGCAGAG CGAGTTTGCC
GGCCAGATCC GCGACGCAGC CGAGGCGGCC TACGCGGTGA TCGGCAAACA GATCAACCTC
GGTTCACCGA AGCAGCTGCA GGTCGTGTTG TTCGACGAGC TGGAGATGCC CAAGACCAAG
CGCACCAAGA CCGGCTACAC CACCGACGCC GACGCACTGC AATCGCTGTT CGACAAGACC
GGGCACCCGT TCCTGCAGCA TCTGCTCGCC CATCGGGACG CCACCCGGCT GAAGGTCACC
GTCGACGGGC TGCTGAATTC GGTTGCCGCA GATGGGCGAA TTCATACCAC GTTCAACCAG
ACGATCGCGG CGACCGGCCG GCTGTCCTCG ACCGAGCCGA ACCTGCAGAA CATCCCGATC
CGCACCGAGG CCGGGCGGCG CATCCGCGAC GGATTTGTGG TGGGCGAGGG CTACGCCGAT
TTGATGACCG CCGACTACAG CCAGATCGAG ATGCGGATCA TGGCGCATCT CTCGAAGGAC
GCAGGCCTGA TCGAAGCGTT CAACACGGGG GAGGACCTGC ACTCGTTCGT CGCATCCCGG
GCGTTCTCGG TGCCGCTCGA CGAGGTCACC CCCGAGCTGC GGCGTCGGGT GAAAGCCATG
TCGTACGGAC TGGCGTACGG GTTGAGCGCC TACGGGCTCT CCCAGCAGCT CAAGATCTCC
ACCGAGGAGG CCAAGGAGCA GATGGAGCAG TACTTCGCCC GGTTCGGCGG GGTGCGCGAC
TATCTGCGCG ACGTGGTCGA CCAGGCCCGC AAGGACGGAT ACACGTCGAC GGTGTTCGGT
CGCCGGCGTT ACCTGCCCGA GCTCGACAGC AGCAACCGGC AGGTGCGGGA GGCTGCCGAA
CGTGCCGCGC TCAACGCGCC GATCCAGGGC AGTGCCGCCG ACATCATCAA GGTCGCGATG
ATCAACGTCG ACCAGGCCAT CAAGGAGGCG GGGCTGTCGT CTCGCATGCT GCTCCAGGTG
CACGACGAGC TGCTCTTCGA GGTTGCCGAC GGCGAACGCG ACGCGCTCGA TGCCCTGGTC
CGCGAGCACA TGGGCAGCGC CTATCGGCTG GATGTGCCGC TCGAGGTCTC GGTCGGATAC
GGCCGCAGCT GGGACAGTGC CGCTCATTAG
 
Protein sequence
MSPAKTASET KAGKPDTSEK PTLMLLDGNS LAFRAFYALP AENFKTKSGL TTNAVYGFTA 
MLINLLRDEQ PTHIAAAFDV SRQTFRKDKY PEYKEGRSAT PDEFRGQIDI TKEVLGALGI
TVLAEPGFEA DDIIATLATQ AENEGHRVLV VTGDRDSLQL VSDDVTVLYP RKGVSELTRF
TPEAVQEKYG LTPAQYPDFA ALRGDPSDNL PGIPGVGEKT ATKWIAEYGS LQSLVDNVDK
VKGKVGDALR AHLSSVVLNR ELTDLVKDVP LAHTPDTLRM QPWDRDQIHR LFDDLEFRVL
RDRLFDTLAS ADPEVEEGFE VRGEALEAGT LAAWLAEHSN GQRFGVAVVG NHLAFDSDAT
AVALVASDGD GRYIDTTRLD PEDEKALASW LADPDAPKAL HEAKLAMHDL QGRGWTLAGV
TSDTALAAYL VRPGQRSFAL DDLSLRYLKR ELRADNPEQQ QLSLLDDSDA VDDQAVQTLL
LRAGAVVDLA DALDEELARI DSSALLGSME LPVQRVLAQL ETVGIAVDLA MLSELQSEFA
GQIRDAAEAA YAVIGKQINL GSPKQLQVVL FDELEMPKTK RTKTGYTTDA DALQSLFDKT
GHPFLQHLLA HRDATRLKVT VDGLLNSVAA DGRIHTTFNQ TIAATGRLSS TEPNLQNIPI
RTEAGRRIRD GFVVGEGYAD LMTADYSQIE MRIMAHLSKD AGLIEAFNTG EDLHSFVASR
AFSVPLDEVT PELRRRVKAM SYGLAYGLSA YGLSQQLKIS TEEAKEQMEQ YFARFGGVRD
YLRDVVDQAR KDGYTSTVFG RRRYLPELDS SNRQVREAAE RAALNAPIQG SAADIIKVAM
INVDQAIKEA GLSSRMLLQV HDELLFEVAD GERDALDALV REHMGSAYRL DVPLEVSVGY
GRSWDSAAH