Gene Mvan_5913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5913 
Symbol 
ID4647524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6299263 
End bp6302454 
Gene Length3192 bp 
Protein Length1063 aa 
Translation table11 
GC content70% 
IMG OID639809389 
Productputative outer membrane adhesin like proteiin 
Protein accessionYP_956683 
Protein GI120406854 
COG category 
COG ID 
TIGRFAM ID[TIGR01965] VCBS repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.278208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACCA ACAGGCGCTC CGGAATGCTG GAAGACTCCA CCGCGGCCGG GCTGCCGCGT 
CGCGGCCGGC ACCGCAGGGG CCGGCCACGC AGGCTGCAAC CCTATGCCTG GCTGGGTGCC
GGAGCGGTCG GGTTCGGTCT TGGCGCCGCC GCATTGACCG GTGCGGGCTC CGCCCACGCC
GACGAGAGCG CCGCAGACCC GTCGTCGTCC TCCGCGTCGT CGGTCTCCTC GAGCGAGGAC
GCGAACGAGT CCAGAGCTGT GTCCTCCGGC GCGGACCCCG ACGACGCGAC CGAGGACCGA
GACCCGCCGG AATCCGAGGC CCCTGAGTCG CAGGCCGACA CGGATGACAG CGCGACAGCG
GAAGCGGCAG TCCCCGAGGA AACAGCAACG GAGACAGACA TTGTCGACGG CGGAGGCCGC
CGCGCGCACC CGGGTTCCGA CGCCGAATCG CGTGCAGCGG CGGTCGAGGA CCTTCAGGAC
GACGACGCCG CCACCGTCGA CCTGCCGACC GACGTCGGCG GATCAGCGGC GGCGAACACG
GAGCCGACGG CGGTACTGAG CTGGCAGAGT TCACCCGGCT GGTTGACGGG CAGGGTCACG
GGTCGAATCC GCGCGGCGGA CCCCGACGGC GACCGTCTCA CCTATGCAGG CACCGCCACC
ACGGGGACGG TGGCCGTCAC GTCGTGGGGA TCGTTCACCT ACACCCCGAG CGCCGCCGCC
CGCCATGCCG CCGCGGCCAC CGACGTCACG CCGGTCAGGG CCACCGACAC CTTCGACATC
ACCGTCAGCG ACGGGTGGGG CGGCATCGTC GCAGTCCCGG TGACCGTGCG GATCCGCCCG
GCCAACAGCG CGCCCGCCTG GCTCAGATCC ACGGTGGCAG CGCCTGATCC GGCCACCGGG
CAGGTGACCG GCCGGGTCAC CGCCACGGAC CGTGACGGGG ACGTCTTCAC CTACACCGCC
GCCGCGCCGT CCGAGGGGGC GGTCACGGTC CATGCCGACG GCTCCTTCAC CTACCACCCG
TCCGACGCGG CGCGCCGGCG GGCCGCGTCG ACCTGGTACA CCGACACCGA CAGTTTCAGA
GTCCTGCTCG ACGACGGTCA CGGCGCAACC CGGGCGATCA CCGTCCGGGT GCGCATCGCG
CCGCACAACT CCGCCCCGGT GTCGGGCACC CCGACGTACG GCCCACCCGA CCCGTCCACC
GGCGCCATCG GCGGTTCGGT CACGGCGACC GATCCCGATG GCGACAGGAT CACCTACCGT
CTCTCCGCCC CGCCGCATAG GGGCGCCCTG GTGGTCACCG GGGACGGACG GTTCACCTAC
ACGCCGACCC CGGTGGCCCG GCACGCCGCC GCGACGGGCC GCGTCGATTC GACGACGGAT
GCTGCCACCG TGGAGGCCGG CGACAGCCTC GGGGCGCTGA CCTCGATTCC GCTGACCTTC
ACGATCCTGC CCGCCAATTC GGTCCCAACC AGCCTGAGAG CCACTGTGGG GCAGCCGGAT
TCGACCACCG GAGTTGTGAC CGGCGCCGTC ACCGCCGACG ACGCCGATGG CGATCCGCTG
ACCTACAGCG GTTCCACGGT GACGGTGAAG GGCGCGGTGA GCGTGGCCGC CGCCGGAACG
TTCGTCTACA CGCCGACCGC CAACGCGCGT CAGAACGCGC AGGCGCCCGA CGCCACGGCG
GCCGACCGCG CCGACTCGTT TGTCGTCACC GTCGACGACG GTCACGGCGG GATGGCCACG
CTGCCGGTCA CCGTGGCCAT CGGGTCCGTG CCGGTCCCGG ATCCGCCCGA CCCCCCGCCA
CCTCCCGGAG CGCTTCCCGC CTTTCCCGGC GCGGAAGGGT TCGGCAGCCT TGCCACCGGT
GGCCGTGGCG GCAGCGTGGT CTACGTGACG AACACCAATG CCGCCGGGCC GGGCTCGCTC
CAGTGGGCGA TCGACCAACC CGGGGCGAAG TACATCCTGT TCAAGGTCAG CGGGGTGATC
GACACCCAGA TCCACCTGAC CAACGGTGAC GTGACCATCG CCGGCCAGAC CTCGCCGGGT
GGCATCACCA TCCGCGGACT GGTCACCGAC GAGAGTCCCT ATCAGGACCA GGCGGTCCGG
GCCCCGGCCG ACTTCGCCGA GAACTGGATT CTCCAGCACA TCCGTATCCG TCCGGGACTG
AACGGACCCA GCGATGACGG GTTGCGCATC CGCTACACCC GCAATGCCGT CCTCGACCAC
GTATCGATCG GCAACGCCAC CGACGAAGCG GTGGAGATCT CCTACTCGAA CAACGTCACG
ATCCAGAACT CGATCATCGC GGAAACGCTT GGCGGCCACT CCTTCTACGG CGGCGTGCTG
ATGAACTACT CCAACCCGGC GCACGGCTTC GGACTGGACA ACATCGCGCT GCACCACAAC
GTCATCAACC GCATCGAGGG CCGCCTGCCC GAAGGCAGCC GGGAGTCGCT TGCCGCGGCG
TACTCCACCA TGAATCTGGA GCTGTCCAAC AATCTCTACT GGGATCCGCG CTTCTTCATC
GCGTTGGGCC CGAACACCAA CATCGTCACC GACAGCAGCG GCAACCCCTA TCCGATCTAC
TGGAATCTCA ACGCCGTCAA CAACTACTTC CGAACCGGAC CACAGTTCCC CTACGGCATG
TTCGACGACC AGATCCTGCG TGTCGTCGGC AACACGCTCT ACGTCAGCGG CAACCGGATG
AGCAGCTATC CGAGCCGTTC GGACTACGAG CTGTTCTACT GCTGCAACGA CTTTGCCTCG
GTCAGCAACC CCGACGACTC CTCGCACCGG GCCCAGAAGC TCAGCGCGCG GCATCCGTTC
CCCGCCATCA CCTACACCCC GACCGAGATG CTGCGGGCCG TCCTGCGGGA CCGCGCCGGC
GCCTGGCCCC GCGACCCCAT GGACATCCGT CTCCTCGAGT CCGTCGCCGG CGACACCATT
TCTCCGGCCG ACCCCGCCAC CAATCCCGCC GGTGACGCCC TCCTGCCGCC GTACACCGGC
GCCGCCCCCG CGGCCCCGCC GGACACCGAC GGCGACGGCA TGCCCGACGC CTGGGAAGTC
GGCAAAGGGT TGAATCCGTT GTCCGCCAAC CACAACGCCA CGACACTCTC GCTGCTGGGC
TACACGGACC TGGAGGTCTA CCTCCACGAG CTGTCCGCGA GTCTCGTAGA CCCTGCCCGC
GCCCTCGGGT GA
 
Protein sequence
MVTNRRSGML EDSTAAGLPR RGRHRRGRPR RLQPYAWLGA GAVGFGLGAA ALTGAGSAHA 
DESAADPSSS SASSVSSSED ANESRAVSSG ADPDDATEDR DPPESEAPES QADTDDSATA
EAAVPEETAT ETDIVDGGGR RAHPGSDAES RAAAVEDLQD DDAATVDLPT DVGGSAAANT
EPTAVLSWQS SPGWLTGRVT GRIRAADPDG DRLTYAGTAT TGTVAVTSWG SFTYTPSAAA
RHAAAATDVT PVRATDTFDI TVSDGWGGIV AVPVTVRIRP ANSAPAWLRS TVAAPDPATG
QVTGRVTATD RDGDVFTYTA AAPSEGAVTV HADGSFTYHP SDAARRRAAS TWYTDTDSFR
VLLDDGHGAT RAITVRVRIA PHNSAPVSGT PTYGPPDPST GAIGGSVTAT DPDGDRITYR
LSAPPHRGAL VVTGDGRFTY TPTPVARHAA ATGRVDSTTD AATVEAGDSL GALTSIPLTF
TILPANSVPT SLRATVGQPD STTGVVTGAV TADDADGDPL TYSGSTVTVK GAVSVAAAGT
FVYTPTANAR QNAQAPDATA ADRADSFVVT VDDGHGGMAT LPVTVAIGSV PVPDPPDPPP
PPGALPAFPG AEGFGSLATG GRGGSVVYVT NTNAAGPGSL QWAIDQPGAK YILFKVSGVI
DTQIHLTNGD VTIAGQTSPG GITIRGLVTD ESPYQDQAVR APADFAENWI LQHIRIRPGL
NGPSDDGLRI RYTRNAVLDH VSIGNATDEA VEISYSNNVT IQNSIIAETL GGHSFYGGVL
MNYSNPAHGF GLDNIALHHN VINRIEGRLP EGSRESLAAA YSTMNLELSN NLYWDPRFFI
ALGPNTNIVT DSSGNPYPIY WNLNAVNNYF RTGPQFPYGM FDDQILRVVG NTLYVSGNRM
SSYPSRSDYE LFYCCNDFAS VSNPDDSSHR AQKLSARHPF PAITYTPTEM LRAVLRDRAG
AWPRDPMDIR LLESVAGDTI SPADPATNPA GDALLPPYTG AAPAAPPDTD GDGMPDAWEV
GKGLNPLSAN HNATTLSLLG YTDLEVYLHE LSASLVDPAR ALG