Gene Mvan_4736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4736 
Symbol 
ID4647742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5069368 
End bp5073669 
Gene Length4302 bp 
Protein Length1433 aa 
Translation table11 
GC content67% 
IMG OID639808205 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_955516 
Protein GI120405687 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.159607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCG CTAGCCCACG AAAACCGCGC CGGACTACGG GTCGCCATCG CAAGCCGGTG 
ACTGCCCGAG CCTCGATGTG TGCACGGTGG TTGCGTGTCG GCGCGGCCGG CGTCGGGATG
GCCGCGGCGA TCGCCGGCGG CCAGGGCGTC GCTTCGGCCA CTCCGGATGA CGCGTCGGCC
GGCAGCAGTC CCAGCGCCAG TGCCGACGAC AACAGCAGCG GTACGTCGGA CACCAGCGGG
GCGGACTCCT CGTCGCAAGC AGGCGCGGGT ACTGACAACG AACCCGACTC CGATGCCGAT
GCGGACTCGT CCCCGGTCAC CGACAGCGAA ACCGAATCGC TGACGGAGAC TGTCGACGCG
GACGACGGGG TCACCGGCGA AGAGGACGCG GTCACCGGCG AAGAGTCGGA ATCCGAAGAG
CCGCAATCCG GTCCGGCTCT TGATGCAGGC GGGGAAGCGG CACCCACCGC GCAGACCGCG
TCGCAGAACC AGGATCCCGG CGACAACGCG GAGTCGGCGG CGGTCTCGGA GCAGTCCGTC
ACGCCGGATG AGTCCGTCAC GCCTCCGAAC GACGAACTTC CCTCCGCGCA GACTGAAGAC
GACGCGGCGC CGTCCGTCAC ACCGACATCG GCGTCGCAGC CAGAAGGGGC CGCCGCACTC
ATCGACGACT CCGAGGCGAC TGCCGCCGCG TCCACCACTC AGAACGCCCA AGTCGGGACG
ACGTCAGCGG CTGCGACCGG CATCGTCACC GACTTGCTGA ACTGGTTGGG GCTGGGTACG
GGTGGAGCCA TCCCATTGCC CGGTGGTGCG GTCGAGTCGC TGTGGCTGGC GATCCGGCAA
TCGCGGGGTC CGCAGGCTGA GCAGCCCGTC GTGGTCAGCC CGGCCGCGGC GCTGTGGTCC
GGTGACTCCG CGCTCAAAGA ACTGCTCAGG GAGGTGATTC AGGCCGATTT CAGCAGGAAG
TACGGCTGGA TCCCGGTCGT GGGCACGGTG GCACATGCCT TCAGCTTGAT GGGTAACGTC
GGGGAACTCG GGGCCGCGAT CCTGCGTGGC GACCGCGCCG ACGTCGCCGA CGAACTCCGC
GACGTCACCC GCGACGTCAT CGGTACCGTC CCGATCGTCG GAGCACCGAT CGCCGCAGAC
CTCTACTTCA AGGAGGCCGG GGCTAGCGTC GAGGACATTT TCTCCGTCAC CGCCCGGTTG
GCGACGGCTC AGGCCATGGC ACCTCAGTCC CTGACGCCGG AAGAGGAACA CGCGCTCGAG
ACCCTTGCGC AGGTCCTGAA CCAGCTCGCG GGGTGGCCCG AGCCGCCGCA CAACTTCGTC
ACGCCGGCCA ACTACACGCT CGACCAAGCC CTCGACGCCC ACGACGACGT GCTGGACCTC
CTGTTCGCGA ATCCCACACC GACCACCCAG TGGATTCCCG ACGCGATGGC CTTGATCAAC
CTGTTCTTCA AGTCGGCGCT GCCCGGGTAC TCGTTCTCCG ACGGCCTCAA CACATTCGGC
GATCTGCTGA ACCGCCTTGT GCCGCCCTAC ACGATCGAAC CGGGCCCCGA CTATGTCATG
ACCAAGGCGC AGGTGGCGGC CGCGAGCGTC GGCGCACTCG TCAGCATCCT GAACGAGCTG
CTGGGCGGCA ACTTCGATCC GGAGAGTCTG CGTGACGCCG CCATTGCAGG GGGCACGTCG
GGGTTCCTCA CCCCGTCGAG CGTGCTGAAC ATGACCTTCA CCACCGGCGC CGAGCCAAAT
CCCTATTCGC TCATGGCTTA CATCGCCCTG GTCGGTGTGT ACGAGCGGTT CCAGTGGGTG
GCGCTCAACC ACCTCCCGGA GGTGACCGGC CAGACCCAGA ACGGCCAGTT CCTGCTGAGC
ATCACCGGCC AGGTGAACGC CACCGATCCC GACGGCGACC CGCTGACCTA CTCGATCAGC
CAGCAGCCGG CCAACGGTAT CGTCACTGTC GGGCTCAACG GCAGCTGGCT CTACACGCGG
ACGTCGAACT GGACCCACTC CGGACCCGAC ACGTTCACCA TGACGATCGA CGACACTCTC
GGCCAGATCG GCGATCTGGG ACTGGATCAT CCGTATTCGC CTCTGGGGCA CAGCATCACC
GTCGAGGTCA CGGTCGACTA CACCGGCGTC GCGAACAATC AGCCCACCAT CATCGCGCTG
CCCGGGCTGC CGGACTCCAT GGGCATCGTG CGGGGCAGTG CGCCTGGGAA CGACATCGAC
GGTGACACGC TGACCTACAG CCTGGTCGAT CCGGGCACCC CGGGTGCGAC GAGCAACTCG
ATCTACACCA GCGAGGGCGG CATCGTCGCG CTCGACACCG CGACCGGCGA GTTCGTCTAT
ATCCCAAAGG TTTCGACCGA CCTCATTCCG GCATTCGACA CCGACTCATT CCAGGTGCAG
GTCTCCGACG GTCGTGGTGG CACCGCCACC ACGACGGTGG TGGTGATCTC GAACCTCAAG
CCGGGGACGT CGACCACGGG CACCAGCGCG TACGTCGAGC ACGGCAAGGT CGACATCCCG
ACCGCCGATG TCGGCCTGCT GACCTACAGC GTCGGCAGCC AGGGGGCGAA GGGCACGGTG
GTCGTCAACG CCGACGGCAC CTACACCTAC ACCCGCAGTC TCACGGCCAC CGGCAGCACG
TCGGATACGT TCACGATCAT CGGCACCGAC GCCAACGGCA AGACGGTGAC CTTGCCGGCC
GTGTCGGTGG CGCCGCCGCT GATCTCGGTG ACGCCGACGA CCACCGCCAC CGGCGGGACG
TTCACCCCGC GCGGCATGAT CAACGGCACC GCCATCACGC CGGCCACCCA GACGACGACC
GGAACGATGA GCGGGATCGA CGAGGACGGG AATCCGGTCG ACATATCCGG TGGCATCTAC
AGCACGGAGA AGGGCGGCAC CGTCACCATC ACGTCCGGCG GCGGGTTCCT CTACACCAAC
ACGACCTACT CCGATCTCTT CCACAAGGCG GCCGCGACCG ACGCGCCCGG GTCCGACAAG
GTGGACACCG TGAAAATCAC CGTGACGGAC TCGCTCGGCC GCACCGGCGA GGTCACCTTC
TCCATTGTGC TGCGGACGGA GAACTCGAAC CCGTCGTCGA GCTCCTCGGT GGGAAGTGCC
GATGCGCTCG GTGTGGTGCG TGGTTCGGTG TCGGGCGACG ACAATGACGG TGACTCGTTC
ACGTACTCGT TGGCCGGTGT CGGCAATCCG GCTGGTGCGA CGGCGAATTC GACGTACACC
GCCAACGGCG GCATCGTCTC GCTGAATTGG GACGGCACGT TCACCTACAT CCCGAACAAG
TCTGCGGCAA CCGCCGACTC GTTCAAGGTG CTGGTTTCCG ACGGGCACGG AGGTGCCACG
ACGGAAACGG TGTCCGTGCC GTTGGGCACG CCGTCGCCAC TCGCGAACGT CGTCACCTCG
ACGCCGAATG TCGTTACCGG CCAGCTGAAC ATCCCGCCCG CGGACAACGG GCTGATGACC
TACAGCGTCG GCATCGGCCC GTCGAAGGGC ACCGTGACGG TCGACCCCGG CGGCACCTTC
ACCTACAACC GGACCAGCCC CGGCCACACC ACGACCCCGC CAGATTCGTT CACGATCATC
GGCACCGAGG TGGCCACGGG CAAGACAGTG ACGATCGCGA CGGTCAACGT CACTCCGGTG
GTGCCCAACG CCGCGCCGGT GGGCGGAGCG GTGACGGTCA CGTCGTCGTC ACTGTCGACC
GGTCTGACCC GGAACCAAAC GGCCACAGGC ACCATCGGCG CCACGGACGC CGACGGGGAT
CCGCTCACCT TCACGGCGGG AACCATCGAA ACCACCAACG GCGGAGAGAT CGTGCTGGCC
GCCGACGGAT CGTTCACCTA CACCATCAGC AAGCTGCTGA CGAGCTCCTA CTACCACGAG
GCCGCGAAGA TCGGTGCCAG CGGTTCAGCG GTCAGGGACA CCTTCACGGT AACGGTGAGT
GACACCTTCG GTGCCAGCAC CTCTTTCGTG GCCTCGGTGC CCATCTACGC GACCAACACC
CCGCCGAACA CCCCCACCTC CGGAGTGTTC TGGGGCCTGG GGGCGAACGA CTGGACTTCG
GTCTTCGCGA CCGATCCCAA CGGGGATTCG CTCACCTACA CCATCACGAA GCAGCCCGAG
CACGGCTCGG CGTCGTACAG CAGCGGCAGC CAGATACTGA GCACCACCGG CGCGCAATCG
GGTGACACCA TCATCCTCAC CGTGACCGAC GGCTACTACG TGGTTGTCGA CGGTGTGGTC
ACCGGGACAC CGGCAAGCAG CTCGAGGACG TACACAGTCT GA
 
Protein sequence
MAAASPRKPR RTTGRHRKPV TARASMCARW LRVGAAGVGM AAAIAGGQGV ASATPDDASA 
GSSPSASADD NSSGTSDTSG ADSSSQAGAG TDNEPDSDAD ADSSPVTDSE TESLTETVDA
DDGVTGEEDA VTGEESESEE PQSGPALDAG GEAAPTAQTA SQNQDPGDNA ESAAVSEQSV
TPDESVTPPN DELPSAQTED DAAPSVTPTS ASQPEGAAAL IDDSEATAAA STTQNAQVGT
TSAAATGIVT DLLNWLGLGT GGAIPLPGGA VESLWLAIRQ SRGPQAEQPV VVSPAAALWS
GDSALKELLR EVIQADFSRK YGWIPVVGTV AHAFSLMGNV GELGAAILRG DRADVADELR
DVTRDVIGTV PIVGAPIAAD LYFKEAGASV EDIFSVTARL ATAQAMAPQS LTPEEEHALE
TLAQVLNQLA GWPEPPHNFV TPANYTLDQA LDAHDDVLDL LFANPTPTTQ WIPDAMALIN
LFFKSALPGY SFSDGLNTFG DLLNRLVPPY TIEPGPDYVM TKAQVAAASV GALVSILNEL
LGGNFDPESL RDAAIAGGTS GFLTPSSVLN MTFTTGAEPN PYSLMAYIAL VGVYERFQWV
ALNHLPEVTG QTQNGQFLLS ITGQVNATDP DGDPLTYSIS QQPANGIVTV GLNGSWLYTR
TSNWTHSGPD TFTMTIDDTL GQIGDLGLDH PYSPLGHSIT VEVTVDYTGV ANNQPTIIAL
PGLPDSMGIV RGSAPGNDID GDTLTYSLVD PGTPGATSNS IYTSEGGIVA LDTATGEFVY
IPKVSTDLIP AFDTDSFQVQ VSDGRGGTAT TTVVVISNLK PGTSTTGTSA YVEHGKVDIP
TADVGLLTYS VGSQGAKGTV VVNADGTYTY TRSLTATGST SDTFTIIGTD ANGKTVTLPA
VSVAPPLISV TPTTTATGGT FTPRGMINGT AITPATQTTT GTMSGIDEDG NPVDISGGIY
STEKGGTVTI TSGGGFLYTN TTYSDLFHKA AATDAPGSDK VDTVKITVTD SLGRTGEVTF
SIVLRTENSN PSSSSSVGSA DALGVVRGSV SGDDNDGDSF TYSLAGVGNP AGATANSTYT
ANGGIVSLNW DGTFTYIPNK SAATADSFKV LVSDGHGGAT TETVSVPLGT PSPLANVVTS
TPNVVTGQLN IPPADNGLMT YSVGIGPSKG TVTVDPGGTF TYNRTSPGHT TTPPDSFTII
GTEVATGKTV TIATVNVTPV VPNAAPVGGA VTVTSSSLST GLTRNQTATG TIGATDADGD
PLTFTAGTIE TTNGGEIVLA ADGSFTYTIS KLLTSSYYHE AAKIGASGSA VRDTFTVTVS
DTFGASTSFV ASVPIYATNT PPNTPTSGVF WGLGANDWTS VFATDPNGDS LTYTITKQPE
HGSASYSSGS QILSTTGAQS GDTIILTVTD GYYVVVDGVV TGTPASSSRT YTV