Gene Mvan_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1457 
Symbol 
ID4643314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1545021 
End bp1549250 
Gene Length4230 bp 
Protein Length1409 aa 
Translation table11 
GC content63% 
IMG OID639804956 
ProductTP901 family phage tail tape measure protein 
Protein accessionYP_952296 
Protein GI120402467 
COG category 
COG ID 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.730205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATCA CGCTGAAGGT TGAGGCGCAG GCCGACAACC GTTCTTTCAA GCAGGCCGCT 
GATCAGGCGG AGCGGGTTTT CGCCGATGCC GGTAAGGCGG CGTCGGGGTC TTTCGCGAAG
GCTTTCGGGG AGGGTTCGAA AGAGGTCAAG CAGGCGACTT CGCAGGCGGT CAAGGCTTAT
GACGCGGTGG CTGATGCGGT GGGTAAGGCC ACTGTGGCGG AGAAGCAGCG GCAGCAGGCC
GTGGCGAAGT CTGAGGATTT AGCGAAGAAG GCCGCCGCTG CGGAGAAGAA GCTGAACTCC
GCCCGCGATG CCGGGGATAC GAAAGCTGTT GCGTCCGCGG AGAAGGAGTT GGAGCGGGTT
CGGGATCAGC AGGCCCGCAC CACCATGCAG GTGGTCCGGT CAGCAGATGC GGCTTCTAGG
GCTCGTCGGC AGGAGCAGCG GGAAACCCGC GAGGCTGTCC AGGCATACCG GGAGTTGCAG
AACGCTCAGG TGAGGGCTTC CCAGAGTGGC GGTTCCACCC GTATGGCGGG CGGGCTTCTC
TCGGGCATCA CGAGCCAATC ATCCGGTGTG GTAGGGCAGT TCACCAGTTT GGGCGGTTCG
GCAGGTAAGG CGTTCATCGG CGGAGCGGTC GCCGCGATTG TGGCGGGCGG CCTGGTCTCC
GCGGGGGCTA AGGCCGCGGG GATGGTGTTG GACGGGTTCA AGTCCGTCTT GGACACCGGG
ATAGATTTCT CGAAGACAGT CAACAGCTTT CAGGGCGTCA CCGAGTCTTC CCCGGCGCAG
ACGGAGCGGA TGGCCGCCGC GGCTAGGGCT TTGGGTGCGG ACACGACGAT GGCTGGGGTG
TCGGCTTCTG ACGCTGCTCG GGCGATGACG GAGTTGGCTA AGGCTGGGTT CTCTGTCGAT
GAGGCGATCT CTTCGGCGCG GGGGACGATG CAGCTTGCCA CTGCAGCCGA GATCGACGCC
GCCGAGGCCG CCGAAATCCA AGCGAATGCA ATCAACGCGT TCGGGTTATC CGCCGATGAC
GCCGCTCATG TGGCGGATGT TCTGGCGAAC GCCGCCGTCG GGTCTGCTGC GGACATCCCC
GATTTGGCGT TGGCGCTGCA GCAGGTAGGT GGAATCGCAC AAGGCTTTGG TGAAGACATC
GAAGGAACCG TCGCCGCGAT TGCGATGCTC GCCGATGTCG GCATCAAAGG CTCCGACGCC
GGCACGCTAC TGAAGACCAC ATTGCAGTCC ATCACGAAGC AAGGCGACCC AGCTCGGGAT
GCCATGGAGG CGTTGGGGCT GTCGCTGTAC AACCTTGACA CCGGGCAGTT CGTCGGGTTC
CGGGAAATGT TCCGGCAGTT GGACGAAGCC CGCGCCCGCC TGCGTCCGCA AGATTTCCAG
GCCCAGACGA ACATCCTGTT CGGATCTGAT GCGATGCGCT CCGCCATGTT GGGGACGGTC
GCGGACTTCG ACACGATGGA AGCGACGATC AACCGTGTCG GCACGGCCGG AGACATGGCC
AGAGCCAAGA TGCAGGGCTG GCCCGGCATC ATGGAGGGGA TCAACAACAC CATCGGCGAA
CTGAAGCTCT CACTCTTCGA CGACATTTTC AACACCCCCG CCGGGCAGGA GTTCGGCAAC
AAGATCGTGG AATCCCTGGA CGGTCTGGTG GAGTGGGTGA ACACCCACAA GCCGGAGATC
ATCGGCTTCG TCGCGGCGAT CGGGTCGGCG GGCGCTTCCA TCGCTGACAC GTTCCTGATG
TTCGGTGCCC GCATCATGGA CACGGGCGCC ACCATGATCG ATTTCGTGAA CATGGTGTTC
ACGTCGATGA TCGAGGGTGG ATCGAAGACC GCGCAGTTGT TCGGCGGGAT CATCAAGCAC
ATCCCCGGCT TCCAAAGTGT CGGCGAGGGC ATCGAGGACA TGGGGGCGAA GTTCGACGGC
TGGGCCGACA AACTGCAGGC CCTTCCGGGT CAGATGCGGA CCGCCGCGAA CGGTCTCGAC
TCGTTCCGCG ACGGTATTCG GGGGATGCGC GACGATTTCG TCGGCTCCAT GGGTGAGATG
GCGTTGGCGG AGCAGAAGAA CCGTTTCTAC GCGCAGTCGT TCAAGCAGAT TCAGTCTGCG
GTGGAGTTGA TCCCCGAAAC GAAGCAGATT GTGGTTGCGG ACAACTCGCC TGAGGTGAAG
CAGAAGCTCA TCCAGTTGGG TTTCGCTGTG CAGACGTTGC CGAACGGGAA ACTGGTCATC
AACGTTGAGT ACCGCGATCC CAGTGGGAAG CTGGTCGACC CGTCCCAGTT GGGGGTTTCT
CAAAGGCAGT TGGATGACCG GGATTCCCGC CAGCACGACT GGGGTATCGA CCCGCCCGCC
GGGCCGGCAC CCTTGGGAAC GCAATCCATC CCCGCAGGTG GGGGTTCCTC ATCTTTGCCT
GACGCCCCGG TGTTGCCGAT CAACTACACC AACACTGCGG GGATGACCGC CGAGTTGGCG
TCGGCGCAGA GTCGGGTGGA TGAAACCCGA CACACGTTGG CGGAGAAGCA AGCGAGGCTG
AATCAGCTTC TCGAGTCCGG TGTCGCGGAT GAGGCTGAGA TTCAGAAGGC CCGCAACGAT
GTCGCGAAGG CCGGGCAGGA TGCCAACGAG GCGCAGATGC GGTTCGTGGA TGCGCAGAGG
AAGGTCAGCG AGAAGCAGTC TAACCAACTC AAGGGTGCCA CCACGGATTT GAATGAGTTC
GGCGCCCAAC TGGATTCCGA CTTCGGGATT TCGAAGGGTT TGGCTGGCAT CGCGGACAAC
CTGGTGAGGT TCCTCGGCGC GCTGGCTCTG GCGGGTCCGG TGGCGAAGCT GCAGCAGATC
TCCGACGCTG CCGGGGATGA GGGCTCCGGT TTGATGGGGA TTCTCGCCTC CAACGGGGCG
TTCGGGCCGC AGTTTATGCC GGGGGCGGGA CGGGGTTCGA GTTACGTCGG CACGCCGTAT
GGGTCGGCGG GAATCCCAAG GGGAGGCGCC CCGACTGAGG ATCAGGTTAA GCAGATCGCC
GCCGCGTTCG GGCTGCAGGT TACTTCCGAG GATCGCCCCG GCGACCCGGG ATATCACGGC
CAGGGGATGG CTCTCGACGT CTCGAACGGT TCCGGGAACA CACCGCAGAT GCGGGAGTTC
GCCGAATACA TGAGCACGAA TTTCGGTTCC TCGCTGAAAG AGTTGATCTA CAGTGACGGG
TCGTTCTCCG GACTGATTGG TGACGGCAAG AACGTCACCG GCACTGGCTA CTACGATTCA
GGGACCCTCG CAGAGCACCA AAATCACGTT CACGTCGCCG CAGATTGGGG CGGAAGTGCC
TTGCAATCCA GCGGTGGCCC GGTCCCGGTG AACGTCGTCA ACGGCGGCAG CATCGCAGCG
CCACTCACGT CGGCAATAGG GCAGTGGTCG GCCGACTGGA ACGCCATTGC ACAGGCCGAG
TCGGGCGGCA ACTGGTCCAT CAACACCGGA AACGGCTATT CGGGCGGTTT GCAGTTCTCC
CCGTCGAGTT GGGCTGCTGC GGGCGGTACT CAGTACGCCC CGTCTGCCTA TCAAGCCTCC
CCCTACCAGC AGGCACTCAC CGCAGAGCGC CTACTTGCGA TGCAAGGGCC TGGCGCGTGG
CCGAACACAT TCGTGCCGGG CAGCACGGGG CCAAGTCCCG ATGCGGTCGG ACCGGCCGGG
CCTCTTCCGA GTTTCATTCC CGGCGGCGCC GCTGGTGGGC CGTTGGGTAC TGGTTTCCCG
CAAGGGCTCC CAGGTTTGGG CGGTCAGGCG TACCCCGCCC AAGGTGGCGA AGGCGGCGTC
GGGATGGGCG GTATGGCGAT GGACGCCGCG ATGCTCGGAA CCAGCGCCCT GGACATGATG
GCCCCCGGCG CTGGTGCTGC CGCGAAGGTC GGCATCCAGT TGGCGAACCG GACCATCAAA
TACGCCGGGC AGGTCGCCGG CATCGGCGTT TCCGGTTTGT TGGAGACGTT CTCCCCCGCA
GGGGACAACC CGAAGGCTTC TATCGGAAAT TCGTGGCTGG GGAAGATCAT GGGCGGCCTT
GCTGGGGCGG CTCCTTCGTT GCCGAACATG GCGGGCGGGA AGAAGCCCGA CGCCATCAAC
GGTGGGGACG CGCAAGCCGG AGGTAAGGCG GGCGGCAACA CGGTCAGCAT CACCAACAAT
CTGACGAACA ATCACGCCAC TGAGGACATG GTCGGCAACC AGCTCGTTCG TGAACAGGCC
GCAATGTACA CACAGTCAGG TGTCCAGTGA
 
Protein sequence
MPITLKVEAQ ADNRSFKQAA DQAERVFADA GKAASGSFAK AFGEGSKEVK QATSQAVKAY 
DAVADAVGKA TVAEKQRQQA VAKSEDLAKK AAAAEKKLNS ARDAGDTKAV ASAEKELERV
RDQQARTTMQ VVRSADAASR ARRQEQRETR EAVQAYRELQ NAQVRASQSG GSTRMAGGLL
SGITSQSSGV VGQFTSLGGS AGKAFIGGAV AAIVAGGLVS AGAKAAGMVL DGFKSVLDTG
IDFSKTVNSF QGVTESSPAQ TERMAAAARA LGADTTMAGV SASDAARAMT ELAKAGFSVD
EAISSARGTM QLATAAEIDA AEAAEIQANA INAFGLSADD AAHVADVLAN AAVGSAADIP
DLALALQQVG GIAQGFGEDI EGTVAAIAML ADVGIKGSDA GTLLKTTLQS ITKQGDPARD
AMEALGLSLY NLDTGQFVGF REMFRQLDEA RARLRPQDFQ AQTNILFGSD AMRSAMLGTV
ADFDTMEATI NRVGTAGDMA RAKMQGWPGI MEGINNTIGE LKLSLFDDIF NTPAGQEFGN
KIVESLDGLV EWVNTHKPEI IGFVAAIGSA GASIADTFLM FGARIMDTGA TMIDFVNMVF
TSMIEGGSKT AQLFGGIIKH IPGFQSVGEG IEDMGAKFDG WADKLQALPG QMRTAANGLD
SFRDGIRGMR DDFVGSMGEM ALAEQKNRFY AQSFKQIQSA VELIPETKQI VVADNSPEVK
QKLIQLGFAV QTLPNGKLVI NVEYRDPSGK LVDPSQLGVS QRQLDDRDSR QHDWGIDPPA
GPAPLGTQSI PAGGGSSSLP DAPVLPINYT NTAGMTAELA SAQSRVDETR HTLAEKQARL
NQLLESGVAD EAEIQKARND VAKAGQDANE AQMRFVDAQR KVSEKQSNQL KGATTDLNEF
GAQLDSDFGI SKGLAGIADN LVRFLGALAL AGPVAKLQQI SDAAGDEGSG LMGILASNGA
FGPQFMPGAG RGSSYVGTPY GSAGIPRGGA PTEDQVKQIA AAFGLQVTSE DRPGDPGYHG
QGMALDVSNG SGNTPQMREF AEYMSTNFGS SLKELIYSDG SFSGLIGDGK NVTGTGYYDS
GTLAEHQNHV HVAADWGGSA LQSSGGPVPV NVVNGGSIAA PLTSAIGQWS ADWNAIAQAE
SGGNWSINTG NGYSGGLQFS PSSWAAAGGT QYAPSAYQAS PYQQALTAER LLAMQGPGAW
PNTFVPGSTG PSPDAVGPAG PLPSFIPGGA AGGPLGTGFP QGLPGLGGQA YPAQGGEGGV
GMGGMAMDAA MLGTSALDMM APGAGAAAKV GIQLANRTIK YAGQVAGIGV SGLLETFSPA
GDNPKASIGN SWLGKIMGGL AGAAPSLPNM AGGKKPDAIN GGDAQAGGKA GGNTVSITNN
LTNNHATEDM VGNQLVREQA AMYTQSGVQ