Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5913 |
Symbol | |
ID | 4647524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 6299263 |
End bp | 6302454 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639809389 |
Product | putative outer membrane adhesin like proteiin |
Protein accession | YP_956683 |
Protein GI | 120406854 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.278208 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACCA ACAGGCGCTC CGGAATGCTG GAAGACTCCA CCGCGGCCGG GCTGCCGCGT CGCGGCCGGC ACCGCAGGGG CCGGCCACGC AGGCTGCAAC CCTATGCCTG GCTGGGTGCC GGAGCGGTCG GGTTCGGTCT TGGCGCCGCC GCATTGACCG GTGCGGGCTC CGCCCACGCC GACGAGAGCG CCGCAGACCC GTCGTCGTCC TCCGCGTCGT CGGTCTCCTC GAGCGAGGAC GCGAACGAGT CCAGAGCTGT GTCCTCCGGC GCGGACCCCG ACGACGCGAC CGAGGACCGA GACCCGCCGG AATCCGAGGC CCCTGAGTCG CAGGCCGACA CGGATGACAG CGCGACAGCG GAAGCGGCAG TCCCCGAGGA AACAGCAACG GAGACAGACA TTGTCGACGG CGGAGGCCGC CGCGCGCACC CGGGTTCCGA CGCCGAATCG CGTGCAGCGG CGGTCGAGGA CCTTCAGGAC GACGACGCCG CCACCGTCGA CCTGCCGACC GACGTCGGCG GATCAGCGGC GGCGAACACG GAGCCGACGG CGGTACTGAG CTGGCAGAGT TCACCCGGCT GGTTGACGGG CAGGGTCACG GGTCGAATCC GCGCGGCGGA CCCCGACGGC GACCGTCTCA CCTATGCAGG CACCGCCACC ACGGGGACGG TGGCCGTCAC GTCGTGGGGA TCGTTCACCT ACACCCCGAG CGCCGCCGCC CGCCATGCCG CCGCGGCCAC CGACGTCACG CCGGTCAGGG CCACCGACAC CTTCGACATC ACCGTCAGCG ACGGGTGGGG CGGCATCGTC GCAGTCCCGG TGACCGTGCG GATCCGCCCG GCCAACAGCG CGCCCGCCTG GCTCAGATCC ACGGTGGCAG CGCCTGATCC GGCCACCGGG CAGGTGACCG GCCGGGTCAC CGCCACGGAC CGTGACGGGG ACGTCTTCAC CTACACCGCC GCCGCGCCGT CCGAGGGGGC GGTCACGGTC CATGCCGACG GCTCCTTCAC CTACCACCCG TCCGACGCGG CGCGCCGGCG GGCCGCGTCG ACCTGGTACA CCGACACCGA CAGTTTCAGA GTCCTGCTCG ACGACGGTCA CGGCGCAACC CGGGCGATCA CCGTCCGGGT GCGCATCGCG CCGCACAACT CCGCCCCGGT GTCGGGCACC CCGACGTACG GCCCACCCGA CCCGTCCACC GGCGCCATCG GCGGTTCGGT CACGGCGACC GATCCCGATG GCGACAGGAT CACCTACCGT CTCTCCGCCC CGCCGCATAG GGGCGCCCTG GTGGTCACCG GGGACGGACG GTTCACCTAC ACGCCGACCC CGGTGGCCCG GCACGCCGCC GCGACGGGCC GCGTCGATTC GACGACGGAT GCTGCCACCG TGGAGGCCGG CGACAGCCTC GGGGCGCTGA CCTCGATTCC GCTGACCTTC ACGATCCTGC CCGCCAATTC GGTCCCAACC AGCCTGAGAG CCACTGTGGG GCAGCCGGAT TCGACCACCG GAGTTGTGAC CGGCGCCGTC ACCGCCGACG ACGCCGATGG CGATCCGCTG ACCTACAGCG GTTCCACGGT GACGGTGAAG GGCGCGGTGA GCGTGGCCGC CGCCGGAACG TTCGTCTACA CGCCGACCGC CAACGCGCGT CAGAACGCGC AGGCGCCCGA CGCCACGGCG GCCGACCGCG CCGACTCGTT TGTCGTCACC GTCGACGACG GTCACGGCGG GATGGCCACG CTGCCGGTCA CCGTGGCCAT CGGGTCCGTG CCGGTCCCGG ATCCGCCCGA CCCCCCGCCA CCTCCCGGAG CGCTTCCCGC CTTTCCCGGC GCGGAAGGGT TCGGCAGCCT TGCCACCGGT GGCCGTGGCG GCAGCGTGGT CTACGTGACG AACACCAATG CCGCCGGGCC GGGCTCGCTC CAGTGGGCGA TCGACCAACC CGGGGCGAAG TACATCCTGT TCAAGGTCAG CGGGGTGATC GACACCCAGA TCCACCTGAC CAACGGTGAC GTGACCATCG CCGGCCAGAC CTCGCCGGGT GGCATCACCA TCCGCGGACT GGTCACCGAC GAGAGTCCCT ATCAGGACCA GGCGGTCCGG GCCCCGGCCG ACTTCGCCGA GAACTGGATT CTCCAGCACA TCCGTATCCG TCCGGGACTG AACGGACCCA GCGATGACGG GTTGCGCATC CGCTACACCC GCAATGCCGT CCTCGACCAC GTATCGATCG GCAACGCCAC CGACGAAGCG GTGGAGATCT CCTACTCGAA CAACGTCACG ATCCAGAACT CGATCATCGC GGAAACGCTT GGCGGCCACT CCTTCTACGG CGGCGTGCTG ATGAACTACT CCAACCCGGC GCACGGCTTC GGACTGGACA ACATCGCGCT GCACCACAAC GTCATCAACC GCATCGAGGG CCGCCTGCCC GAAGGCAGCC GGGAGTCGCT TGCCGCGGCG TACTCCACCA TGAATCTGGA GCTGTCCAAC AATCTCTACT GGGATCCGCG CTTCTTCATC GCGTTGGGCC CGAACACCAA CATCGTCACC GACAGCAGCG GCAACCCCTA TCCGATCTAC TGGAATCTCA ACGCCGTCAA CAACTACTTC CGAACCGGAC CACAGTTCCC CTACGGCATG TTCGACGACC AGATCCTGCG TGTCGTCGGC AACACGCTCT ACGTCAGCGG CAACCGGATG AGCAGCTATC CGAGCCGTTC GGACTACGAG CTGTTCTACT GCTGCAACGA CTTTGCCTCG GTCAGCAACC CCGACGACTC CTCGCACCGG GCCCAGAAGC TCAGCGCGCG GCATCCGTTC CCCGCCATCA CCTACACCCC GACCGAGATG CTGCGGGCCG TCCTGCGGGA CCGCGCCGGC GCCTGGCCCC GCGACCCCAT GGACATCCGT CTCCTCGAGT CCGTCGCCGG CGACACCATT TCTCCGGCCG ACCCCGCCAC CAATCCCGCC GGTGACGCCC TCCTGCCGCC GTACACCGGC GCCGCCCCCG CGGCCCCGCC GGACACCGAC GGCGACGGCA TGCCCGACGC CTGGGAAGTC GGCAAAGGGT TGAATCCGTT GTCCGCCAAC CACAACGCCA CGACACTCTC GCTGCTGGGC TACACGGACC TGGAGGTCTA CCTCCACGAG CTGTCCGCGA GTCTCGTAGA CCCTGCCCGC GCCCTCGGGT GA
|
Protein sequence | MVTNRRSGML EDSTAAGLPR RGRHRRGRPR RLQPYAWLGA GAVGFGLGAA ALTGAGSAHA DESAADPSSS SASSVSSSED ANESRAVSSG ADPDDATEDR DPPESEAPES QADTDDSATA EAAVPEETAT ETDIVDGGGR RAHPGSDAES RAAAVEDLQD DDAATVDLPT DVGGSAAANT EPTAVLSWQS SPGWLTGRVT GRIRAADPDG DRLTYAGTAT TGTVAVTSWG SFTYTPSAAA RHAAAATDVT PVRATDTFDI TVSDGWGGIV AVPVTVRIRP ANSAPAWLRS TVAAPDPATG QVTGRVTATD RDGDVFTYTA AAPSEGAVTV HADGSFTYHP SDAARRRAAS TWYTDTDSFR VLLDDGHGAT RAITVRVRIA PHNSAPVSGT PTYGPPDPST GAIGGSVTAT DPDGDRITYR LSAPPHRGAL VVTGDGRFTY TPTPVARHAA ATGRVDSTTD AATVEAGDSL GALTSIPLTF TILPANSVPT SLRATVGQPD STTGVVTGAV TADDADGDPL TYSGSTVTVK GAVSVAAAGT FVYTPTANAR QNAQAPDATA ADRADSFVVT VDDGHGGMAT LPVTVAIGSV PVPDPPDPPP PPGALPAFPG AEGFGSLATG GRGGSVVYVT NTNAAGPGSL QWAIDQPGAK YILFKVSGVI DTQIHLTNGD VTIAGQTSPG GITIRGLVTD ESPYQDQAVR APADFAENWI LQHIRIRPGL NGPSDDGLRI RYTRNAVLDH VSIGNATDEA VEISYSNNVT IQNSIIAETL GGHSFYGGVL MNYSNPAHGF GLDNIALHHN VINRIEGRLP EGSRESLAAA YSTMNLELSN NLYWDPRFFI ALGPNTNIVT DSSGNPYPIY WNLNAVNNYF RTGPQFPYGM FDDQILRVVG NTLYVSGNRM SSYPSRSDYE LFYCCNDFAS VSNPDDSSHR AQKLSARHPF PAITYTPTEM LRAVLRDRAG AWPRDPMDIR LLESVAGDTI SPADPATNPA GDALLPPYTG AAPAAPPDTD GDGMPDAWEV GKGLNPLSAN HNATTLSLLG YTDLEVYLHE LSASLVDPAR ALG
|
| |