Gene Mvan_5042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5042 
Symbol 
ID4644779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5395300 
End bp5398020 
Gene Length2721 bp 
Protein Length906 aa 
Translation table11 
GC content67% 
IMG OID639808513 
Producthypothetical protein 
Protein accessionYP_955820 
Protein GI120405991 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.745587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTGA CGCGTGGTGG TCGTCCGGCG GGAAGCAGTC GACCCTTTCG GCCGACGCTG 
CGCTGGGCAC AGGTCGGGGT GGCCGGCACC GGGATGGCGT TTGCGCTGCT GGCCGCACCG
GCTGTGGCCA CCGCCGACAC GGGCGCCGAC GAGGGAGGCG CCGACTCGTC GGCCACCACG
GATGCGGCGT CGACGCCCTC ACCGCGTGCC GATTCACTTT CACGGCACAC CGCTGTCGGG
GACAGCACCG GTGAATCCGA CGCTCGCTCC GCCGAAGACG AGGCAGAGCG GGTGCTCGGC
ACCCTTCACC CGGAGACCTC GGAACGTGCC GTGTCCCCAC CAGGCCCGAG GACCGAGCAC
GAGGCGCTGT CAGACGATCC GGAGCCCGCG GTAGAACCGG ACCCACCGGA GGCGGGCCCA
GCGCTGGACA CCCCATCGCG GGACACAGCC CCCACCCCGG GGAAACCGGA ATCCACCCAC
GATCCCCAGG ACGGTGACGC CGAGACCGCG GCTGACGGGG CCCGGGCAAC TAGCGCAACG
TCGACACCGT CGGCCCGGAT GACCCTCCGT GGCCCGACCG CGCCCGGTTC CTCCACGGCC
CCGCTGTCGG ACACCGCACA GATGGCGAAC GAAGCTGGAA GCCCGATGCT CTCACCGGAC
CCGCAGGACG AATCCGTGAC TGAACCAGGG GGCGTGGAGG CGCTGCCGCT CGGCGCTCTG
CACGCCGCTG CTTCTTCGAC CAGCTACCCG GCCTATCCGG CGCCGGTGGA CGCACCGGTG
ACCTGGCGCT CGATCGTCTC CGACGCCCTG TCCTGGATCG GGCTGGGAAT GGCGACCGAC
CAGCACATCC CAGATGGCCC GATCAATGAT CTGCTGGCCG GGCTGTGGGT TGGCATGCGC
CGACTGCACT ACACGTTCTT CAACTCGTCA CCCGAGCTGG ACCACGGCGC CGCCACCGAA
GATCCCGACA CCGGGATCAT CACCGGCGAC CTCGAAGCCC ATGACGCCGA CGGTGATGTC
ATCACCTTCG TGCTGACCGG CGCACCCACT CACGGCACAG TGAGCTTCGC CGAAGACGGC
CGGTACACCT ACATTCCCGA CACCGCCTTC GCCGCCACCG GCGGCACCGA CACCTTCACC
GTCACCGCCA CGGACACCGG CGGCGCCAAC CCGTGGCACA CCAACCTTCC CCGCCTGCTC
TGGTCGGCCC TGAGACCGTT GCTGTCCGCA CTCGGATTCA CCGCGCCTGT TGATTCCTCC
AGCACGACCA CCGTCACCGT CACCGTCACC CCGCAGGCCT GCAGGACCGA CGGCGCCGGA
GCTGAGTGCG CGGCGGCCAG GGCGCCAAAG ATCACCTTGC ACAACAACTC TGAGCACACG
ATCTGGGTGT ACAACCTGCC GAGCTCCGGC GACTACAGCA TCGGCGCGGA CTTCACCCCG
GTGTCCATCG CGAAGGGCGC CAGCGCACCG GTGACCCTCG CCGTCGGCAC CGGGTCACCC
GGTTCACCCC AGAACCGGAT CTACATCGTC GAGGGCGAGA CCGGTTTCAC GCTGCCGGTC
AGCTCGTCAT CCGGGGTGGA CGCATTCAAC CCGACCGCAC CGTCGGAAGG AAACTCCTTC
CTGAACTACA ACTTCGTGGA GTACTACCTC TATCCCGATG GCGGCGGCGG ATACCAGTAC
ACCATCGACA CCTCCTACAT CGACGAGTGG TCGCTGCCGA TCCAGTACAA ATTCACGCTC
AACGGCGCCC GGTGGTCGGG GGCCGTTGAC GGACATACCT ACGGCTTCGA CGACTACGAC
ACGGTGGTCA ATCAACTCAA TGCCGCCGGG GGGCCCTACA AGCATCTGGT TTGGGGCGGC
GGCACACCGT GGGCCCCCCA GCCGCCGTCC ACGGTGCATC GAATCATCGG GCCCGACAAG
GTCTGGACCG CACAGGCGAG TCAGCCGGCA AGCAATGTCA ACATGAACCA CGTCGGCTGG
GTGCCCACCT CGTATCAAGA CTTCGTCCAA TACGACTCCC ACACGGAACC GGACGGACAC
GTCGTCTACC CCTACGCGCA GAACGGCACG AAGTACTCTC GCGACGGCAA TTTCAGCTTC
TGGAAGAACG AAGTGGATGC CCCGGCGTCC ACGCCCTATC CGATTGCCTT GCGCACGGCC
GCCGTCCTCG ACGGCTTTCC CGCCAAGAAC GGTGTGTACG GATTCTTCAC CTATCCCAAT
GACGAGACGG CTGGCCAGTT CACCAACATC CCCACGTCGG TGTCCCTCGA CATCTACGTT
CACGGCTCCT CAGACGGGGT CAGCGACAGT GTGATCGAAG GGGGCAGCTG GTTCTACACC
AGCACGACTT CACCGTCCGG GCGGGGGTTG GCGAATCGCC GGCACGTGGT CACCGGGTCG
AGCGCCACCG ACACCTTCAT CCTGGATTCG GTGTTCACCC GCAGCCGAAC CGCACCGGTC
GTCGTCGCCG AGGCCGTCCA GGGCGACATC GTGGTGATCG ACCGGACAGC TTTGGGGGCA
ACCAGCTACG AAGTGGACGT CGTTGACCGC GCGTGGTTCC TCGGGGGCGG GCTCGCCAAG
TACGACAGCC AGTTCGTCTA CGACCGCTCG ACCGGAATCC TGTACTACGA CCAAGATCCC
GACCGGTTCG GCTACACCGG CGTCCTGGCC AACCTGTCGT GCAGCTCTGC CGACGCGGCC
AGCGTGGTGT TCGTGCTCTG A
 
Protein sequence
MSLTRGGRPA GSSRPFRPTL RWAQVGVAGT GMAFALLAAP AVATADTGAD EGGADSSATT 
DAASTPSPRA DSLSRHTAVG DSTGESDARS AEDEAERVLG TLHPETSERA VSPPGPRTEH
EALSDDPEPA VEPDPPEAGP ALDTPSRDTA PTPGKPESTH DPQDGDAETA ADGARATSAT
STPSARMTLR GPTAPGSSTA PLSDTAQMAN EAGSPMLSPD PQDESVTEPG GVEALPLGAL
HAAASSTSYP AYPAPVDAPV TWRSIVSDAL SWIGLGMATD QHIPDGPIND LLAGLWVGMR
RLHYTFFNSS PELDHGAATE DPDTGIITGD LEAHDADGDV ITFVLTGAPT HGTVSFAEDG
RYTYIPDTAF AATGGTDTFT VTATDTGGAN PWHTNLPRLL WSALRPLLSA LGFTAPVDSS
STTTVTVTVT PQACRTDGAG AECAAARAPK ITLHNNSEHT IWVYNLPSSG DYSIGADFTP
VSIAKGASAP VTLAVGTGSP GSPQNRIYIV EGETGFTLPV SSSSGVDAFN PTAPSEGNSF
LNYNFVEYYL YPDGGGGYQY TIDTSYIDEW SLPIQYKFTL NGARWSGAVD GHTYGFDDYD
TVVNQLNAAG GPYKHLVWGG GTPWAPQPPS TVHRIIGPDK VWTAQASQPA SNVNMNHVGW
VPTSYQDFVQ YDSHTEPDGH VVYPYAQNGT KYSRDGNFSF WKNEVDAPAS TPYPIALRTA
AVLDGFPAKN GVYGFFTYPN DETAGQFTNI PTSVSLDIYV HGSSDGVSDS VIEGGSWFYT
STTSPSGRGL ANRRHVVTGS SATDTFILDS VFTRSRTAPV VVAEAVQGDI VVIDRTALGA
TSYEVDVVDR AWFLGGGLAK YDSQFVYDRS TGILYYDQDP DRFGYTGVLA NLSCSSADAA
SVVFVL