Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4736 |
Symbol | |
ID | 4647742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5069368 |
End bp | 5073669 |
Gene Length | 4302 bp |
Protein Length | 1433 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639808205 |
Product | putative outer membrane adhesin like proteiin |
Protein accession | YP_955516 |
Protein GI | 120405687 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.159607 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCG CTAGCCCACG AAAACCGCGC CGGACTACGG GTCGCCATCG CAAGCCGGTG ACTGCCCGAG CCTCGATGTG TGCACGGTGG TTGCGTGTCG GCGCGGCCGG CGTCGGGATG GCCGCGGCGA TCGCCGGCGG CCAGGGCGTC GCTTCGGCCA CTCCGGATGA CGCGTCGGCC GGCAGCAGTC CCAGCGCCAG TGCCGACGAC AACAGCAGCG GTACGTCGGA CACCAGCGGG GCGGACTCCT CGTCGCAAGC AGGCGCGGGT ACTGACAACG AACCCGACTC CGATGCCGAT GCGGACTCGT CCCCGGTCAC CGACAGCGAA ACCGAATCGC TGACGGAGAC TGTCGACGCG GACGACGGGG TCACCGGCGA AGAGGACGCG GTCACCGGCG AAGAGTCGGA ATCCGAAGAG CCGCAATCCG GTCCGGCTCT TGATGCAGGC GGGGAAGCGG CACCCACCGC GCAGACCGCG TCGCAGAACC AGGATCCCGG CGACAACGCG GAGTCGGCGG CGGTCTCGGA GCAGTCCGTC ACGCCGGATG AGTCCGTCAC GCCTCCGAAC GACGAACTTC CCTCCGCGCA GACTGAAGAC GACGCGGCGC CGTCCGTCAC ACCGACATCG GCGTCGCAGC CAGAAGGGGC CGCCGCACTC ATCGACGACT CCGAGGCGAC TGCCGCCGCG TCCACCACTC AGAACGCCCA AGTCGGGACG ACGTCAGCGG CTGCGACCGG CATCGTCACC GACTTGCTGA ACTGGTTGGG GCTGGGTACG GGTGGAGCCA TCCCATTGCC CGGTGGTGCG GTCGAGTCGC TGTGGCTGGC GATCCGGCAA TCGCGGGGTC CGCAGGCTGA GCAGCCCGTC GTGGTCAGCC CGGCCGCGGC GCTGTGGTCC GGTGACTCCG CGCTCAAAGA ACTGCTCAGG GAGGTGATTC AGGCCGATTT CAGCAGGAAG TACGGCTGGA TCCCGGTCGT GGGCACGGTG GCACATGCCT TCAGCTTGAT GGGTAACGTC GGGGAACTCG GGGCCGCGAT CCTGCGTGGC GACCGCGCCG ACGTCGCCGA CGAACTCCGC GACGTCACCC GCGACGTCAT CGGTACCGTC CCGATCGTCG GAGCACCGAT CGCCGCAGAC CTCTACTTCA AGGAGGCCGG GGCTAGCGTC GAGGACATTT TCTCCGTCAC CGCCCGGTTG GCGACGGCTC AGGCCATGGC ACCTCAGTCC CTGACGCCGG AAGAGGAACA CGCGCTCGAG ACCCTTGCGC AGGTCCTGAA CCAGCTCGCG GGGTGGCCCG AGCCGCCGCA CAACTTCGTC ACGCCGGCCA ACTACACGCT CGACCAAGCC CTCGACGCCC ACGACGACGT GCTGGACCTC CTGTTCGCGA ATCCCACACC GACCACCCAG TGGATTCCCG ACGCGATGGC CTTGATCAAC CTGTTCTTCA AGTCGGCGCT GCCCGGGTAC TCGTTCTCCG ACGGCCTCAA CACATTCGGC GATCTGCTGA ACCGCCTTGT GCCGCCCTAC ACGATCGAAC CGGGCCCCGA CTATGTCATG ACCAAGGCGC AGGTGGCGGC CGCGAGCGTC GGCGCACTCG TCAGCATCCT GAACGAGCTG CTGGGCGGCA ACTTCGATCC GGAGAGTCTG CGTGACGCCG CCATTGCAGG GGGCACGTCG GGGTTCCTCA CCCCGTCGAG CGTGCTGAAC ATGACCTTCA CCACCGGCGC CGAGCCAAAT CCCTATTCGC TCATGGCTTA CATCGCCCTG GTCGGTGTGT ACGAGCGGTT CCAGTGGGTG GCGCTCAACC ACCTCCCGGA GGTGACCGGC CAGACCCAGA ACGGCCAGTT CCTGCTGAGC ATCACCGGCC AGGTGAACGC CACCGATCCC GACGGCGACC CGCTGACCTA CTCGATCAGC CAGCAGCCGG CCAACGGTAT CGTCACTGTC GGGCTCAACG GCAGCTGGCT CTACACGCGG ACGTCGAACT GGACCCACTC CGGACCCGAC ACGTTCACCA TGACGATCGA CGACACTCTC GGCCAGATCG GCGATCTGGG ACTGGATCAT CCGTATTCGC CTCTGGGGCA CAGCATCACC GTCGAGGTCA CGGTCGACTA CACCGGCGTC GCGAACAATC AGCCCACCAT CATCGCGCTG CCCGGGCTGC CGGACTCCAT GGGCATCGTG CGGGGCAGTG CGCCTGGGAA CGACATCGAC GGTGACACGC TGACCTACAG CCTGGTCGAT CCGGGCACCC CGGGTGCGAC GAGCAACTCG ATCTACACCA GCGAGGGCGG CATCGTCGCG CTCGACACCG CGACCGGCGA GTTCGTCTAT ATCCCAAAGG TTTCGACCGA CCTCATTCCG GCATTCGACA CCGACTCATT CCAGGTGCAG GTCTCCGACG GTCGTGGTGG CACCGCCACC ACGACGGTGG TGGTGATCTC GAACCTCAAG CCGGGGACGT CGACCACGGG CACCAGCGCG TACGTCGAGC ACGGCAAGGT CGACATCCCG ACCGCCGATG TCGGCCTGCT GACCTACAGC GTCGGCAGCC AGGGGGCGAA GGGCACGGTG GTCGTCAACG CCGACGGCAC CTACACCTAC ACCCGCAGTC TCACGGCCAC CGGCAGCACG TCGGATACGT TCACGATCAT CGGCACCGAC GCCAACGGCA AGACGGTGAC CTTGCCGGCC GTGTCGGTGG CGCCGCCGCT GATCTCGGTG ACGCCGACGA CCACCGCCAC CGGCGGGACG TTCACCCCGC GCGGCATGAT CAACGGCACC GCCATCACGC CGGCCACCCA GACGACGACC GGAACGATGA GCGGGATCGA CGAGGACGGG AATCCGGTCG ACATATCCGG TGGCATCTAC AGCACGGAGA AGGGCGGCAC CGTCACCATC ACGTCCGGCG GCGGGTTCCT CTACACCAAC ACGACCTACT CCGATCTCTT CCACAAGGCG GCCGCGACCG ACGCGCCCGG GTCCGACAAG GTGGACACCG TGAAAATCAC CGTGACGGAC TCGCTCGGCC GCACCGGCGA GGTCACCTTC TCCATTGTGC TGCGGACGGA GAACTCGAAC CCGTCGTCGA GCTCCTCGGT GGGAAGTGCC GATGCGCTCG GTGTGGTGCG TGGTTCGGTG TCGGGCGACG ACAATGACGG TGACTCGTTC ACGTACTCGT TGGCCGGTGT CGGCAATCCG GCTGGTGCGA CGGCGAATTC GACGTACACC GCCAACGGCG GCATCGTCTC GCTGAATTGG GACGGCACGT TCACCTACAT CCCGAACAAG TCTGCGGCAA CCGCCGACTC GTTCAAGGTG CTGGTTTCCG ACGGGCACGG AGGTGCCACG ACGGAAACGG TGTCCGTGCC GTTGGGCACG CCGTCGCCAC TCGCGAACGT CGTCACCTCG ACGCCGAATG TCGTTACCGG CCAGCTGAAC ATCCCGCCCG CGGACAACGG GCTGATGACC TACAGCGTCG GCATCGGCCC GTCGAAGGGC ACCGTGACGG TCGACCCCGG CGGCACCTTC ACCTACAACC GGACCAGCCC CGGCCACACC ACGACCCCGC CAGATTCGTT CACGATCATC GGCACCGAGG TGGCCACGGG CAAGACAGTG ACGATCGCGA CGGTCAACGT CACTCCGGTG GTGCCCAACG CCGCGCCGGT GGGCGGAGCG GTGACGGTCA CGTCGTCGTC ACTGTCGACC GGTCTGACCC GGAACCAAAC GGCCACAGGC ACCATCGGCG CCACGGACGC CGACGGGGAT CCGCTCACCT TCACGGCGGG AACCATCGAA ACCACCAACG GCGGAGAGAT CGTGCTGGCC GCCGACGGAT CGTTCACCTA CACCATCAGC AAGCTGCTGA CGAGCTCCTA CTACCACGAG GCCGCGAAGA TCGGTGCCAG CGGTTCAGCG GTCAGGGACA CCTTCACGGT AACGGTGAGT GACACCTTCG GTGCCAGCAC CTCTTTCGTG GCCTCGGTGC CCATCTACGC GACCAACACC CCGCCGAACA CCCCCACCTC CGGAGTGTTC TGGGGCCTGG GGGCGAACGA CTGGACTTCG GTCTTCGCGA CCGATCCCAA CGGGGATTCG CTCACCTACA CCATCACGAA GCAGCCCGAG CACGGCTCGG CGTCGTACAG CAGCGGCAGC CAGATACTGA GCACCACCGG CGCGCAATCG GGTGACACCA TCATCCTCAC CGTGACCGAC GGCTACTACG TGGTTGTCGA CGGTGTGGTC ACCGGGACAC CGGCAAGCAG CTCGAGGACG TACACAGTCT GA
|
Protein sequence | MAAASPRKPR RTTGRHRKPV TARASMCARW LRVGAAGVGM AAAIAGGQGV ASATPDDASA GSSPSASADD NSSGTSDTSG ADSSSQAGAG TDNEPDSDAD ADSSPVTDSE TESLTETVDA DDGVTGEEDA VTGEESESEE PQSGPALDAG GEAAPTAQTA SQNQDPGDNA ESAAVSEQSV TPDESVTPPN DELPSAQTED DAAPSVTPTS ASQPEGAAAL IDDSEATAAA STTQNAQVGT TSAAATGIVT DLLNWLGLGT GGAIPLPGGA VESLWLAIRQ SRGPQAEQPV VVSPAAALWS GDSALKELLR EVIQADFSRK YGWIPVVGTV AHAFSLMGNV GELGAAILRG DRADVADELR DVTRDVIGTV PIVGAPIAAD LYFKEAGASV EDIFSVTARL ATAQAMAPQS LTPEEEHALE TLAQVLNQLA GWPEPPHNFV TPANYTLDQA LDAHDDVLDL LFANPTPTTQ WIPDAMALIN LFFKSALPGY SFSDGLNTFG DLLNRLVPPY TIEPGPDYVM TKAQVAAASV GALVSILNEL LGGNFDPESL RDAAIAGGTS GFLTPSSVLN MTFTTGAEPN PYSLMAYIAL VGVYERFQWV ALNHLPEVTG QTQNGQFLLS ITGQVNATDP DGDPLTYSIS QQPANGIVTV GLNGSWLYTR TSNWTHSGPD TFTMTIDDTL GQIGDLGLDH PYSPLGHSIT VEVTVDYTGV ANNQPTIIAL PGLPDSMGIV RGSAPGNDID GDTLTYSLVD PGTPGATSNS IYTSEGGIVA LDTATGEFVY IPKVSTDLIP AFDTDSFQVQ VSDGRGGTAT TTVVVISNLK PGTSTTGTSA YVEHGKVDIP TADVGLLTYS VGSQGAKGTV VVNADGTYTY TRSLTATGST SDTFTIIGTD ANGKTVTLPA VSVAPPLISV TPTTTATGGT FTPRGMINGT AITPATQTTT GTMSGIDEDG NPVDISGGIY STEKGGTVTI TSGGGFLYTN TTYSDLFHKA AATDAPGSDK VDTVKITVTD SLGRTGEVTF SIVLRTENSN PSSSSSVGSA DALGVVRGSV SGDDNDGDSF TYSLAGVGNP AGATANSTYT ANGGIVSLNW DGTFTYIPNK SAATADSFKV LVSDGHGGAT TETVSVPLGT PSPLANVVTS TPNVVTGQLN IPPADNGLMT YSVGIGPSKG TVTVDPGGTF TYNRTSPGHT TTPPDSFTII GTEVATGKTV TIATVNVTPV VPNAAPVGGA VTVTSSSLST GLTRNQTATG TIGATDADGD PLTFTAGTIE TTNGGEIVLA ADGSFTYTIS KLLTSSYYHE AAKIGASGSA VRDTFTVTVS DTFGASTSFV ASVPIYATNT PPNTPTSGVF WGLGANDWTS VFATDPNGDS LTYTITKQPE HGSASYSSGS QILSTTGAQS GDTIILTVTD GYYVVVDGVV TGTPASSSRT YTV
|
| |