Gene Mvan_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1720 
Symbol 
ID4648104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1824570 
End bp1830419 
Gene Length5850 bp 
Protein Length1949 aa 
Translation table11 
GC content63% 
IMG OID639805209 
ProductYVTN beta-propeller repeat-containing protein 
Protein accessionYP_952549 
Protein GI120402720 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01965] VCBS repeat
[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGATCAC GTTCGGCTGC GCATGGCGGC AGAAGTTGGC GCAGCGGACG CCATCGCAAG 
CAGCGCAGAA TCGAACCGTA TGCCTGGCTT GGCGCCGGGG CGGTCACCCT CGGCATCGGC
GCCGCAGCAC TGAGCGGCGC AGGCATCGCG GCGGCCGACG ACCCTAGCAC AGACTCGACA
TCAACTTCAT CAAGCACCGA GGACAACACC TCGTCGGACC CACGAACGCC GTCTTCAACC
CGTAGCACCG ACGCGGAAGA GCCCTCCAAC GACAAGAACA TCGAGCCGCC CGGCAACGAC
AGCGACGATG AGGCCGACCA CGAGGTCGAC ACCACCACCG AGACCGACGC TGCTGCTCGG
GACGGTGCTG ATGGCGTCGA TGAGGAGCCG CGCAGGAATA AAGGACGCGA GGACGGGATC
GAGACATACG ATCCATCGCC TTCGGTCGAC AAACAAGACA ACGACACGGA AGCGAACGAG
CTCACAGCCG TCGTCGACGA GGATGACGCA CCACGCCCGC GCGGCGCCGA AAGCCAACAG
CGACCCGCCG TTGGCGCGGT CTCACCAGCA CTCACGAATA CTCAGTCAGC TCCGGTTCCC
CTTGCGCCCC ACGAAACGAC GACAACACAG CTGTCACCGG AGACACCAGA GAATGCCACA
ACCACTCGGT TACTGGACGC CCCCACAGAG CCGTTCGACT CCCCCCTGGC CTGGCTCTTA
CTAGCCTCCT CCCGCCGACA AATCGGCCGA GTCGCAGACG AGGATGCGCA CTCACCGGCC
GCCGATGCCC TGGCGGTTGA CGCAGAAAAC ACCGCGCCGA CAGCGGCGCT CAGAGGGCAG
AGTTCACCAG GCTGGTTCAC CGGGCGGGTC ACGGGGCGAG TCGCTGCCAG CGATGTTGAC
GGCGATCGAC TGTCCTTCAC CGGCATGACG ACCGCCAAGG GCACCGTGAC CGTGACGCCC
TGGGGCACGT TCACCTATCG CCCAAGCAAG GCCGCCCGCC ATGCAGCCGC GGCTACCACG
GCTTCGGACG CAGAGAAATT CGATACGTTC GCCATCACCG TCAGTGACGG CAACGGAGGA
TGGACCGAGG TACCCGTCAC GGTCGCGGTC CGGCCGGTGA ACAGCAGTCC GTCGTGGTTG
AGGTCGACGA AGACCAAACC TAATCCCGTC AGCGGGGAGG TCAACGGCCG TATCATCGCC
ATCGACCGCG ACGGCGACGC TTTCACCTAT ACCGCATCCG CACCCGGCAA GGGCGCGGTC
GTCGTCAACC TCGATGGCAC GTTCAGCTAC ACCGCTTCGG ATTCGGCACG GGCCGCCGCT
CGCAACACCT GGTACACCGA CACCGACCGC TTCATCGTCG TCGTCAATGA CGGTCACGGC
GGCACGAGGT CGGTCAGCGT GCGTGTTGAA GTCGCACCGA GCAACAACGC CCCCACAACG
GGGGCTCCGA ACTTGGACGC ACCCGATCCT GGTACCGGCG CCGTGCGGGG TACCGTCAAT
GCCGTTGATC CCGACGGCGA CCGGATCACC TACCGTCGCG AAACCATCAT GACTGCCAAG
GGTGTGCTCA CGATCGGGAG CACCGGCGCC TTCAACTACA CGCCCACCAC CGCCGCGCGG
CATGCCGCCG CGACCTCCGA CCCGAACTTG GCTGCTGACA GCGTGACGAT CACCGCCCGG
GATCGCTTCG GCGGGGTGGG GGCCATCGTC TTGACCATCC CGATCGCGCC GACCAACACA
TCACCTGTCG GGAGCGCTGC GGGGGCTGGT GTTACCGATC CCGTCACCGG AGTGGTCAGA
GGCACTGTGA CCGCCACGGA CTCCGACGGA GATGACCTGC TCTTCAGCGG AACCACCACG
ACGGAAAAGG GCAGTGTCGT CGTGGATTCC ATCGGCACCT ATACCTATAC ACCGACTACT
GCCGCCCGCC ACCACGTCAA CGCCGAGGAT GCGACCGACG CCGACAGGCG CGACAGCTTC
GTTGTCACCG TCACCGACGG GCACGGCGGG ACCGCGCAAG TCCTGGTTTC TGTCGCGATA
GCCCCGTCGT CCAATCAGGC GCCAGGCGGT GTTTCATACA GCGCAAACCC GAACACGGAC
ACGGGCGTCG TCGAGGGTCG TGTCACTGCG ACCGACCCCG AGGGTGACAG CCTGACGTTC
TCCGGGTCCG CAGAAACCGG CAAGGGCACC GTAGCTGTCG CGCCCGACGG GTCCTTCGTC
TACACGCCCA CCGATGATGC GCGTTTGAGC GCCGGCGCGC CGGGTGCGCC TGTGGCCTCG
AAGGAAGACT CCTTCGTCGT GGACGTGAGC GACGGTCACG GCGGAACCAC GAGCATCTCG
GTGTCTGTCG AGATCGTTCC CCTCATCGAC AACGAATCTC CAGTCGCCGG TACACCGATC
GTGGGCGATC CCACCCCCGG AACGGGAGTC GTCAGCGGGA CACTGGGTTT CACTGATCCA
GAGGGATCGT CACTCGCCTA TACGGTGACC GGACCACCCG CCAAGGGCTT GGTCTCGATC
GACACGACGG GCGGATTCAC CTACACGCCG AACCCGGAGG ATCGGCCGGA GGCCGGGGAG
GCACCAGGGT ATGACGCCTT CACCGTGGTG GCCACCGATC CGCAGGGATT GGCAGCCCAG
GTCACGGTGG ATGTTGTCGT GGCACCGCTG CTCCCGCCAA GTGACGGACC CGTCGTGGGA
ACTCCCCCCT ATGACATCGA CTCGGTCGAC GAGGTCACAG GTGTCATCAG CGGACATGTG
ACTGCGTATA CATCAAACGG CACCGCCTTG ACCTTCGCGG TAGCCGAATC GCCGGACGTC
GCGTCTGGGC GTGTGGCGCT CGATTCGGCA ACAGGCCGTT GGACATTCAC GCCGACCGCA
TCGACTTTGG TCAGCGCATG GTCCTCAGAC ACTCCCACGC CGGTCACGTT CACCATCCGA
GTAAGCGACG GCGAGAAGAG CACCGACATC ACCGTCTCCG CCACCGTTTC TCCGTCGGAG
CAGGCGGTCA TCAACAGCGT CGAGTACCTC GGTAGCGAGC CGTCCGGCGT CACCGTCGGC
CCAGATGGCC GGATGTATGT CATCGACTCC GGCGCAAACA CGCTGTCCAT CATCAACCCC
GCCGACGCGT CCATGATCAC GGTCACGGTC GGGAAGAACC CCACCTCCGT CGCGTCCGAC
GATCTCGGAC GACTCTGGGT GACGAACTCG GGTGACCACA CGGTGACCGT CCTGAACGCC
GAAGCCGAGA TTCTTCGTAC CGTTCAAGTG GGGCTGGTAC CGGCCTCCGT CGTCATCAGG
GGCGACCTCG CCTATATCGC CAACTTCGGC GGCAACAGCC TGTCGGTCAT CGATGCCGCC
GATGACTACA GCGTCCGTTC CGTCGATGTC GGCACCAATC CGATCGACAT CGCGATCGGG
TCTGACGGGC GGATATACGT CGCCAACTTC GGTAACGGCA CAATCTCTGT GCTGAGGCCT
GACGATATGG ACGATGTTCT ACTCGTCGAC AGCGGCGGTG AACATCCGCA CGGAATCCTC
GTCGATGACG ACGGCACCGT TTACGTCACC CATCCGCTCG ACGACACGGT GACCGTGTTG
AATCCTGCTC CAGTGAACAG GTTCTCGCTA CGCTCGCTGT TCTCGGCCGA CGCAACCTCG
GGCCAGTACA CCTATCGCAG CGTGACGGTG ATCGGCGCAC CGACCAAGAT CACCAAGGAC
ACTGCCGGCC GGATCTATGT AACCAACAGT TCCGGCGCAA CCATCACTGT GCTGGATCCA
TTGACGTTGG CAGCCAACGA AATCCACACC GGCGCGAACC CCAGCAGTGT GTATGTCGAC
CGCTTCGGCA ACCTCTATGT CACCAATGCC GGCCAGAAGA CGGTCGCCGT CATTCACGCC
CAGACCCGCA ACATCACGAC CTACCGGGCG GACGTCAAGA CATCCCAGGT GACCACCGAT
GACGACGGAA ACCTAGTCAT GGTCAGCACT TACGACGGAC ACCGGTCCGT ACTGTCAACC
GACACCGTCA GCACCGGTGC CGACGCATTG CAGATCGCGA ACGTGTGGGG TTCCTACGGC
AGCCTGGTCG CGAGCCCGGA TGGCAAATGG CTCTACGCCG TTCGCAATAG GCGTGTCGAC
ACGACTGGCG GTTTGTATCC GAACCGGTAC GACATCGTGG CGATCGACAC CTCCAACAAC
TCGGTCAGGA CGTTCGCGTG GGATGCGCAC ATCAACGGTT CGATGGTCTT GACCATGAAC
CGCGACGGCG ATCGTTTGTA CGCTGCGTAT TACACGCTGG ACCCAGAGGT TTTTGACGGT
GTCGTTGCGA AGGTCGCGAT CATCAACGTC GATGGCTCGA CATTGCAGTT GGACGGCGCC
CCTGTGGAAT TGAACCTGGG TAGCACTACT GCGTTTGTCT CCGACATCGT CGTGAACCCG
GCAGGCACCA AGGTCTTCGT GATCGGGGAC CGGCTGGCGG TGCCCGGTCA GTGGGATGCA
CCGGAAGGCC AGTGGAACAT GTACGGCAAC CTGTGGATGG TCGATCTCGA GAACGATCGC
GCCGTCTCGA GGCCCCCAAT CCCTAGGCCT CTGCAGGCGG AGGGCGGGGT GCCGGACCTA
GCCATATCGC CCGATGGCAG GTACGCCTAC CTAGTGATGA CGCCCCTGCC TGGCTGGGAG
GGCAACCATT ATCTCGGCAT CTTCGACACT CAAAGCTTTG CACTGACCAC CGTTCTCCTC
GGTCACTTCG AAGCCCCGAA CGGCGTTCGA CGTCTGGTTG TCAGTCCGGA CGGCACGCGC
ATCTACCTCT CCGACGGTCA GATCGTCGGA GTGACCGGCA CTCAGCATGA ACTTCTCGAG
GACCGGATCC AGGGCATCGA CCAGGAACGC TTCGTCGACA TCGCGATCAA TCCCGAGGGA
ACTCGCCTGT ACACCACCAC TTGGGACGGC GTGCTGACTA CCATCGACAC GTCGACAGGT
CTGCCGGTCG GTAACCCGAC CGATCTGGGT AACCGCGCCT GGGGCATGAC CTTTGGGCCG
AAGGGCGACA CGCTGTATGT CCGCCGCGGA GTCGACTACG TCGCCTCCCA TCGGGCCCCG
CAATCGATCG CAGACCTCTG GGAAAACGTC AGGGACCTGC CGAACGGCGA CAACGAGGGC
ATCTTCACCC AGGTCGTGCG CCATGCGGAC GGCACGAACC GGATGGTCGT CTACCTCGGC
GGTACCAACC CACGCAACTG GCTTGTCGGC GAGCAGGCTA TCGGCGAGAA CGTGGTGAGC
GAACTCGGCA TCCTGAAGGA TGAGCACCTC GGTGCGATCA CCCGCGCCCT GGCGCATTGC
CGCAATGATG CGACATGCGG CGACATCGCC GATGTCATGC TCGTGGGCTT CAGCCAAGGC
GGAATCGACG GGCAGAACTT CGCATCACAA TGGGACCGGT TGGGATTCGG CGTTCAGCTG
TCGGCACTCG TTACCTTCGG AAGCCCGATC ACGAAGAACC CGAACGTGCC CACGCTGCAC
ATCCAAGATA TCGATGATGA GGTGGTCAAC ACTGAGTTGT TGGCCCGTCT CGCGGCTACA
GTGCAGTACG TACCGATGCC CAGTTGGCTG CGCATCAATT TCGCTGCAGC AACCGTCACC
GCGATGGCTC GGGGTCAGCT CTACCAAGGC GCCGCCGACA CCGACATCAG TGTGCTCGAC
ATCTTCGCCG TGCACGGAAA CCGGGCGACT TACCAGCAGC TTGCATCCGA AATGGACGCC
CGCACCGGGA CCGACTTCGC GGCCCTGGGA CCGGTAAGGA CGTTCTTCGG CGGCACGGCC
CTGATCCCCG GCGATCCCAC GCCCTTGTGA
 
Protein sequence
MGSRSAAHGG RSWRSGRHRK QRRIEPYAWL GAGAVTLGIG AAALSGAGIA AADDPSTDST 
STSSSTEDNT SSDPRTPSST RSTDAEEPSN DKNIEPPGND SDDEADHEVD TTTETDAAAR
DGADGVDEEP RRNKGREDGI ETYDPSPSVD KQDNDTEANE LTAVVDEDDA PRPRGAESQQ
RPAVGAVSPA LTNTQSAPVP LAPHETTTTQ LSPETPENAT TTRLLDAPTE PFDSPLAWLL
LASSRRQIGR VADEDAHSPA ADALAVDAEN TAPTAALRGQ SSPGWFTGRV TGRVAASDVD
GDRLSFTGMT TAKGTVTVTP WGTFTYRPSK AARHAAAATT ASDAEKFDTF AITVSDGNGG
WTEVPVTVAV RPVNSSPSWL RSTKTKPNPV SGEVNGRIIA IDRDGDAFTY TASAPGKGAV
VVNLDGTFSY TASDSARAAA RNTWYTDTDR FIVVVNDGHG GTRSVSVRVE VAPSNNAPTT
GAPNLDAPDP GTGAVRGTVN AVDPDGDRIT YRRETIMTAK GVLTIGSTGA FNYTPTTAAR
HAAATSDPNL AADSVTITAR DRFGGVGAIV LTIPIAPTNT SPVGSAAGAG VTDPVTGVVR
GTVTATDSDG DDLLFSGTTT TEKGSVVVDS IGTYTYTPTT AARHHVNAED ATDADRRDSF
VVTVTDGHGG TAQVLVSVAI APSSNQAPGG VSYSANPNTD TGVVEGRVTA TDPEGDSLTF
SGSAETGKGT VAVAPDGSFV YTPTDDARLS AGAPGAPVAS KEDSFVVDVS DGHGGTTSIS
VSVEIVPLID NESPVAGTPI VGDPTPGTGV VSGTLGFTDP EGSSLAYTVT GPPAKGLVSI
DTTGGFTYTP NPEDRPEAGE APGYDAFTVV ATDPQGLAAQ VTVDVVVAPL LPPSDGPVVG
TPPYDIDSVD EVTGVISGHV TAYTSNGTAL TFAVAESPDV ASGRVALDSA TGRWTFTPTA
STLVSAWSSD TPTPVTFTIR VSDGEKSTDI TVSATVSPSE QAVINSVEYL GSEPSGVTVG
PDGRMYVIDS GANTLSIINP ADASMITVTV GKNPTSVASD DLGRLWVTNS GDHTVTVLNA
EAEILRTVQV GLVPASVVIR GDLAYIANFG GNSLSVIDAA DDYSVRSVDV GTNPIDIAIG
SDGRIYVANF GNGTISVLRP DDMDDVLLVD SGGEHPHGIL VDDDGTVYVT HPLDDTVTVL
NPAPVNRFSL RSLFSADATS GQYTYRSVTV IGAPTKITKD TAGRIYVTNS SGATITVLDP
LTLAANEIHT GANPSSVYVD RFGNLYVTNA GQKTVAVIHA QTRNITTYRA DVKTSQVTTD
DDGNLVMVST YDGHRSVLST DTVSTGADAL QIANVWGSYG SLVASPDGKW LYAVRNRRVD
TTGGLYPNRY DIVAIDTSNN SVRTFAWDAH INGSMVLTMN RDGDRLYAAY YTLDPEVFDG
VVAKVAIINV DGSTLQLDGA PVELNLGSTT AFVSDIVVNP AGTKVFVIGD RLAVPGQWDA
PEGQWNMYGN LWMVDLENDR AVSRPPIPRP LQAEGGVPDL AISPDGRYAY LVMTPLPGWE
GNHYLGIFDT QSFALTTVLL GHFEAPNGVR RLVVSPDGTR IYLSDGQIVG VTGTQHELLE
DRIQGIDQER FVDIAINPEG TRLYTTTWDG VLTTIDTSTG LPVGNPTDLG NRAWGMTFGP
KGDTLYVRRG VDYVASHRAP QSIADLWENV RDLPNGDNEG IFTQVVRHAD GTNRMVVYLG
GTNPRNWLVG EQAIGENVVS ELGILKDEHL GAITRALAHC RNDATCGDIA DVMLVGFSQG
GIDGQNFASQ WDRLGFGVQL SALVTFGSPI TKNPNVPTLH IQDIDDEVVN TELLARLAAT
VQYVPMPSWL RINFAAATVT AMARGQLYQG AADTDISVLD IFAVHGNRAT YQQLASEMDA
RTGTDFAALG PVRTFFGGTA LIPGDPTPL