Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1720 |
Symbol | |
ID | 4648104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 1824570 |
End bp | 1830419 |
Gene Length | 5850 bp |
Protein Length | 1949 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639805209 |
Product | YVTN beta-propeller repeat-containing protein |
Protein accession | YP_952549 |
Protein GI | 120402720 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01965] VCBS repeat [TIGR02276] 40-residue YVTN family beta-propeller repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGATCAC GTTCGGCTGC GCATGGCGGC AGAAGTTGGC GCAGCGGACG CCATCGCAAG CAGCGCAGAA TCGAACCGTA TGCCTGGCTT GGCGCCGGGG CGGTCACCCT CGGCATCGGC GCCGCAGCAC TGAGCGGCGC AGGCATCGCG GCGGCCGACG ACCCTAGCAC AGACTCGACA TCAACTTCAT CAAGCACCGA GGACAACACC TCGTCGGACC CACGAACGCC GTCTTCAACC CGTAGCACCG ACGCGGAAGA GCCCTCCAAC GACAAGAACA TCGAGCCGCC CGGCAACGAC AGCGACGATG AGGCCGACCA CGAGGTCGAC ACCACCACCG AGACCGACGC TGCTGCTCGG GACGGTGCTG ATGGCGTCGA TGAGGAGCCG CGCAGGAATA AAGGACGCGA GGACGGGATC GAGACATACG ATCCATCGCC TTCGGTCGAC AAACAAGACA ACGACACGGA AGCGAACGAG CTCACAGCCG TCGTCGACGA GGATGACGCA CCACGCCCGC GCGGCGCCGA AAGCCAACAG CGACCCGCCG TTGGCGCGGT CTCACCAGCA CTCACGAATA CTCAGTCAGC TCCGGTTCCC CTTGCGCCCC ACGAAACGAC GACAACACAG CTGTCACCGG AGACACCAGA GAATGCCACA ACCACTCGGT TACTGGACGC CCCCACAGAG CCGTTCGACT CCCCCCTGGC CTGGCTCTTA CTAGCCTCCT CCCGCCGACA AATCGGCCGA GTCGCAGACG AGGATGCGCA CTCACCGGCC GCCGATGCCC TGGCGGTTGA CGCAGAAAAC ACCGCGCCGA CAGCGGCGCT CAGAGGGCAG AGTTCACCAG GCTGGTTCAC CGGGCGGGTC ACGGGGCGAG TCGCTGCCAG CGATGTTGAC GGCGATCGAC TGTCCTTCAC CGGCATGACG ACCGCCAAGG GCACCGTGAC CGTGACGCCC TGGGGCACGT TCACCTATCG CCCAAGCAAG GCCGCCCGCC ATGCAGCCGC GGCTACCACG GCTTCGGACG CAGAGAAATT CGATACGTTC GCCATCACCG TCAGTGACGG CAACGGAGGA TGGACCGAGG TACCCGTCAC GGTCGCGGTC CGGCCGGTGA ACAGCAGTCC GTCGTGGTTG AGGTCGACGA AGACCAAACC TAATCCCGTC AGCGGGGAGG TCAACGGCCG TATCATCGCC ATCGACCGCG ACGGCGACGC TTTCACCTAT ACCGCATCCG CACCCGGCAA GGGCGCGGTC GTCGTCAACC TCGATGGCAC GTTCAGCTAC ACCGCTTCGG ATTCGGCACG GGCCGCCGCT CGCAACACCT GGTACACCGA CACCGACCGC TTCATCGTCG TCGTCAATGA CGGTCACGGC GGCACGAGGT CGGTCAGCGT GCGTGTTGAA GTCGCACCGA GCAACAACGC CCCCACAACG GGGGCTCCGA ACTTGGACGC ACCCGATCCT GGTACCGGCG CCGTGCGGGG TACCGTCAAT GCCGTTGATC CCGACGGCGA CCGGATCACC TACCGTCGCG AAACCATCAT GACTGCCAAG GGTGTGCTCA CGATCGGGAG CACCGGCGCC TTCAACTACA CGCCCACCAC CGCCGCGCGG CATGCCGCCG CGACCTCCGA CCCGAACTTG GCTGCTGACA GCGTGACGAT CACCGCCCGG GATCGCTTCG GCGGGGTGGG GGCCATCGTC TTGACCATCC CGATCGCGCC GACCAACACA TCACCTGTCG GGAGCGCTGC GGGGGCTGGT GTTACCGATC CCGTCACCGG AGTGGTCAGA GGCACTGTGA CCGCCACGGA CTCCGACGGA GATGACCTGC TCTTCAGCGG AACCACCACG ACGGAAAAGG GCAGTGTCGT CGTGGATTCC ATCGGCACCT ATACCTATAC ACCGACTACT GCCGCCCGCC ACCACGTCAA CGCCGAGGAT GCGACCGACG CCGACAGGCG CGACAGCTTC GTTGTCACCG TCACCGACGG GCACGGCGGG ACCGCGCAAG TCCTGGTTTC TGTCGCGATA GCCCCGTCGT CCAATCAGGC GCCAGGCGGT GTTTCATACA GCGCAAACCC GAACACGGAC ACGGGCGTCG TCGAGGGTCG TGTCACTGCG ACCGACCCCG AGGGTGACAG CCTGACGTTC TCCGGGTCCG CAGAAACCGG CAAGGGCACC GTAGCTGTCG CGCCCGACGG GTCCTTCGTC TACACGCCCA CCGATGATGC GCGTTTGAGC GCCGGCGCGC CGGGTGCGCC TGTGGCCTCG AAGGAAGACT CCTTCGTCGT GGACGTGAGC GACGGTCACG GCGGAACCAC GAGCATCTCG GTGTCTGTCG AGATCGTTCC CCTCATCGAC AACGAATCTC CAGTCGCCGG TACACCGATC GTGGGCGATC CCACCCCCGG AACGGGAGTC GTCAGCGGGA CACTGGGTTT CACTGATCCA GAGGGATCGT CACTCGCCTA TACGGTGACC GGACCACCCG CCAAGGGCTT GGTCTCGATC GACACGACGG GCGGATTCAC CTACACGCCG AACCCGGAGG ATCGGCCGGA GGCCGGGGAG GCACCAGGGT ATGACGCCTT CACCGTGGTG GCCACCGATC CGCAGGGATT GGCAGCCCAG GTCACGGTGG ATGTTGTCGT GGCACCGCTG CTCCCGCCAA GTGACGGACC CGTCGTGGGA ACTCCCCCCT ATGACATCGA CTCGGTCGAC GAGGTCACAG GTGTCATCAG CGGACATGTG ACTGCGTATA CATCAAACGG CACCGCCTTG ACCTTCGCGG TAGCCGAATC GCCGGACGTC GCGTCTGGGC GTGTGGCGCT CGATTCGGCA ACAGGCCGTT GGACATTCAC GCCGACCGCA TCGACTTTGG TCAGCGCATG GTCCTCAGAC ACTCCCACGC CGGTCACGTT CACCATCCGA GTAAGCGACG GCGAGAAGAG CACCGACATC ACCGTCTCCG CCACCGTTTC TCCGTCGGAG CAGGCGGTCA TCAACAGCGT CGAGTACCTC GGTAGCGAGC CGTCCGGCGT CACCGTCGGC CCAGATGGCC GGATGTATGT CATCGACTCC GGCGCAAACA CGCTGTCCAT CATCAACCCC GCCGACGCGT CCATGATCAC GGTCACGGTC GGGAAGAACC CCACCTCCGT CGCGTCCGAC GATCTCGGAC GACTCTGGGT GACGAACTCG GGTGACCACA CGGTGACCGT CCTGAACGCC GAAGCCGAGA TTCTTCGTAC CGTTCAAGTG GGGCTGGTAC CGGCCTCCGT CGTCATCAGG GGCGACCTCG CCTATATCGC CAACTTCGGC GGCAACAGCC TGTCGGTCAT CGATGCCGCC GATGACTACA GCGTCCGTTC CGTCGATGTC GGCACCAATC CGATCGACAT CGCGATCGGG TCTGACGGGC GGATATACGT CGCCAACTTC GGTAACGGCA CAATCTCTGT GCTGAGGCCT GACGATATGG ACGATGTTCT ACTCGTCGAC AGCGGCGGTG AACATCCGCA CGGAATCCTC GTCGATGACG ACGGCACCGT TTACGTCACC CATCCGCTCG ACGACACGGT GACCGTGTTG AATCCTGCTC CAGTGAACAG GTTCTCGCTA CGCTCGCTGT TCTCGGCCGA CGCAACCTCG GGCCAGTACA CCTATCGCAG CGTGACGGTG ATCGGCGCAC CGACCAAGAT CACCAAGGAC ACTGCCGGCC GGATCTATGT AACCAACAGT TCCGGCGCAA CCATCACTGT GCTGGATCCA TTGACGTTGG CAGCCAACGA AATCCACACC GGCGCGAACC CCAGCAGTGT GTATGTCGAC CGCTTCGGCA ACCTCTATGT CACCAATGCC GGCCAGAAGA CGGTCGCCGT CATTCACGCC CAGACCCGCA ACATCACGAC CTACCGGGCG GACGTCAAGA CATCCCAGGT GACCACCGAT GACGACGGAA ACCTAGTCAT GGTCAGCACT TACGACGGAC ACCGGTCCGT ACTGTCAACC GACACCGTCA GCACCGGTGC CGACGCATTG CAGATCGCGA ACGTGTGGGG TTCCTACGGC AGCCTGGTCG CGAGCCCGGA TGGCAAATGG CTCTACGCCG TTCGCAATAG GCGTGTCGAC ACGACTGGCG GTTTGTATCC GAACCGGTAC GACATCGTGG CGATCGACAC CTCCAACAAC TCGGTCAGGA CGTTCGCGTG GGATGCGCAC ATCAACGGTT CGATGGTCTT GACCATGAAC CGCGACGGCG ATCGTTTGTA CGCTGCGTAT TACACGCTGG ACCCAGAGGT TTTTGACGGT GTCGTTGCGA AGGTCGCGAT CATCAACGTC GATGGCTCGA CATTGCAGTT GGACGGCGCC CCTGTGGAAT TGAACCTGGG TAGCACTACT GCGTTTGTCT CCGACATCGT CGTGAACCCG GCAGGCACCA AGGTCTTCGT GATCGGGGAC CGGCTGGCGG TGCCCGGTCA GTGGGATGCA CCGGAAGGCC AGTGGAACAT GTACGGCAAC CTGTGGATGG TCGATCTCGA GAACGATCGC GCCGTCTCGA GGCCCCCAAT CCCTAGGCCT CTGCAGGCGG AGGGCGGGGT GCCGGACCTA GCCATATCGC CCGATGGCAG GTACGCCTAC CTAGTGATGA CGCCCCTGCC TGGCTGGGAG GGCAACCATT ATCTCGGCAT CTTCGACACT CAAAGCTTTG CACTGACCAC CGTTCTCCTC GGTCACTTCG AAGCCCCGAA CGGCGTTCGA CGTCTGGTTG TCAGTCCGGA CGGCACGCGC ATCTACCTCT CCGACGGTCA GATCGTCGGA GTGACCGGCA CTCAGCATGA ACTTCTCGAG GACCGGATCC AGGGCATCGA CCAGGAACGC TTCGTCGACA TCGCGATCAA TCCCGAGGGA ACTCGCCTGT ACACCACCAC TTGGGACGGC GTGCTGACTA CCATCGACAC GTCGACAGGT CTGCCGGTCG GTAACCCGAC CGATCTGGGT AACCGCGCCT GGGGCATGAC CTTTGGGCCG AAGGGCGACA CGCTGTATGT CCGCCGCGGA GTCGACTACG TCGCCTCCCA TCGGGCCCCG CAATCGATCG CAGACCTCTG GGAAAACGTC AGGGACCTGC CGAACGGCGA CAACGAGGGC ATCTTCACCC AGGTCGTGCG CCATGCGGAC GGCACGAACC GGATGGTCGT CTACCTCGGC GGTACCAACC CACGCAACTG GCTTGTCGGC GAGCAGGCTA TCGGCGAGAA CGTGGTGAGC GAACTCGGCA TCCTGAAGGA TGAGCACCTC GGTGCGATCA CCCGCGCCCT GGCGCATTGC CGCAATGATG CGACATGCGG CGACATCGCC GATGTCATGC TCGTGGGCTT CAGCCAAGGC GGAATCGACG GGCAGAACTT CGCATCACAA TGGGACCGGT TGGGATTCGG CGTTCAGCTG TCGGCACTCG TTACCTTCGG AAGCCCGATC ACGAAGAACC CGAACGTGCC CACGCTGCAC ATCCAAGATA TCGATGATGA GGTGGTCAAC ACTGAGTTGT TGGCCCGTCT CGCGGCTACA GTGCAGTACG TACCGATGCC CAGTTGGCTG CGCATCAATT TCGCTGCAGC AACCGTCACC GCGATGGCTC GGGGTCAGCT CTACCAAGGC GCCGCCGACA CCGACATCAG TGTGCTCGAC ATCTTCGCCG TGCACGGAAA CCGGGCGACT TACCAGCAGC TTGCATCCGA AATGGACGCC CGCACCGGGA CCGACTTCGC GGCCCTGGGA CCGGTAAGGA CGTTCTTCGG CGGCACGGCC CTGATCCCCG GCGATCCCAC GCCCTTGTGA
|
Protein sequence | MGSRSAAHGG RSWRSGRHRK QRRIEPYAWL GAGAVTLGIG AAALSGAGIA AADDPSTDST STSSSTEDNT SSDPRTPSST RSTDAEEPSN DKNIEPPGND SDDEADHEVD TTTETDAAAR DGADGVDEEP RRNKGREDGI ETYDPSPSVD KQDNDTEANE LTAVVDEDDA PRPRGAESQQ RPAVGAVSPA LTNTQSAPVP LAPHETTTTQ LSPETPENAT TTRLLDAPTE PFDSPLAWLL LASSRRQIGR VADEDAHSPA ADALAVDAEN TAPTAALRGQ SSPGWFTGRV TGRVAASDVD GDRLSFTGMT TAKGTVTVTP WGTFTYRPSK AARHAAAATT ASDAEKFDTF AITVSDGNGG WTEVPVTVAV RPVNSSPSWL RSTKTKPNPV SGEVNGRIIA IDRDGDAFTY TASAPGKGAV VVNLDGTFSY TASDSARAAA RNTWYTDTDR FIVVVNDGHG GTRSVSVRVE VAPSNNAPTT GAPNLDAPDP GTGAVRGTVN AVDPDGDRIT YRRETIMTAK GVLTIGSTGA FNYTPTTAAR HAAATSDPNL AADSVTITAR DRFGGVGAIV LTIPIAPTNT SPVGSAAGAG VTDPVTGVVR GTVTATDSDG DDLLFSGTTT TEKGSVVVDS IGTYTYTPTT AARHHVNAED ATDADRRDSF VVTVTDGHGG TAQVLVSVAI APSSNQAPGG VSYSANPNTD TGVVEGRVTA TDPEGDSLTF SGSAETGKGT VAVAPDGSFV YTPTDDARLS AGAPGAPVAS KEDSFVVDVS DGHGGTTSIS VSVEIVPLID NESPVAGTPI VGDPTPGTGV VSGTLGFTDP EGSSLAYTVT GPPAKGLVSI DTTGGFTYTP NPEDRPEAGE APGYDAFTVV ATDPQGLAAQ VTVDVVVAPL LPPSDGPVVG TPPYDIDSVD EVTGVISGHV TAYTSNGTAL TFAVAESPDV ASGRVALDSA TGRWTFTPTA STLVSAWSSD TPTPVTFTIR VSDGEKSTDI TVSATVSPSE QAVINSVEYL GSEPSGVTVG PDGRMYVIDS GANTLSIINP ADASMITVTV GKNPTSVASD DLGRLWVTNS GDHTVTVLNA EAEILRTVQV GLVPASVVIR GDLAYIANFG GNSLSVIDAA DDYSVRSVDV GTNPIDIAIG SDGRIYVANF GNGTISVLRP DDMDDVLLVD SGGEHPHGIL VDDDGTVYVT HPLDDTVTVL NPAPVNRFSL RSLFSADATS GQYTYRSVTV IGAPTKITKD TAGRIYVTNS SGATITVLDP LTLAANEIHT GANPSSVYVD RFGNLYVTNA GQKTVAVIHA QTRNITTYRA DVKTSQVTTD DDGNLVMVST YDGHRSVLST DTVSTGADAL QIANVWGSYG SLVASPDGKW LYAVRNRRVD TTGGLYPNRY DIVAIDTSNN SVRTFAWDAH INGSMVLTMN RDGDRLYAAY YTLDPEVFDG VVAKVAIINV DGSTLQLDGA PVELNLGSTT AFVSDIVVNP AGTKVFVIGD RLAVPGQWDA PEGQWNMYGN LWMVDLENDR AVSRPPIPRP LQAEGGVPDL AISPDGRYAY LVMTPLPGWE GNHYLGIFDT QSFALTTVLL GHFEAPNGVR RLVVSPDGTR IYLSDGQIVG VTGTQHELLE DRIQGIDQER FVDIAINPEG TRLYTTTWDG VLTTIDTSTG LPVGNPTDLG NRAWGMTFGP KGDTLYVRRG VDYVASHRAP QSIADLWENV RDLPNGDNEG IFTQVVRHAD GTNRMVVYLG GTNPRNWLVG EQAIGENVVS ELGILKDEHL GAITRALAHC RNDATCGDIA DVMLVGFSQG GIDGQNFASQ WDRLGFGVQL SALVTFGSPI TKNPNVPTLH IQDIDDEVVN TELLARLAAT VQYVPMPSWL RINFAAATVT AMARGQLYQG AADTDISVLD IFAVHGNRAT YQQLASEMDA RTGTDFAALG PVRTFFGGTA LIPGDPTPL
|
| |