Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5004 |
Symbol | |
ID | 4645067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5352121 |
End bp | 5355240 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639808475 |
Product | hypothetical protein |
Protein accession | YP_955782 |
Protein GI | 120405953 |
COG category | [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.998418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTATG CGCAGTTCGT GGGTCGGGTC GGGGCGTTGG CCGTCGCGCT CGGGGTGGGG GCGGCAACGG TTGCACTGCC CGGGGCGGCG TGGGCGGAGC CGAGCGACGC CGCGTCGTCG AGCAGTGCCG ACAACACCAA AGCCGAGGAC ACAGCCGAGG ATTCGACCGC CGCCGATGAC GCGCGGCCCG ACGACGCGGT TGACGACGAA GTCAATTCCG GCGAGGACCA CGACGAGGAG GCTCCGCAGT CCGAGGACGG CGGCGATGGC AGCGACGAGG ACGGCGGACA ACCCGAGAAA AAGACGCGCG GCTCGTATGG CAGCAATCGT TCAGAAGGCG AGCTAACCGG AGAGGACGCC GACGAGGCTC GGGCCGACCG GGAGGACACC GACACCGAGG CCGACGTCGA CGTGGTCGCC CCCGAAGCGG AGCCCACCGC TGACGAGGCC GACGACGAGG CCGACGGTGG GGATCCAGCC GAAGTGGCCG AGCCCGGAGT CGCAGACGAC GTCGCGGTGG AACAGCCCGC CGAGGTCTCC GACGATGCAC CCGAGCCCGT CGAATCACCG ATATCGGCGC CGTCCTCGAT CGTCACGGCG CTGTTCGCGC CGAAGTCGTG GGGTGACACC GCGCCGGCCG ATCCGGTCGA GTCGCCACTG CTGTGGACGC TGCTGGCGTT CGCGCGGCGC CAGTTCGGTC AACCGCGGAC CGAAATCGGT GACAGCGGTA CGCCGACGGG CACCACCGAA CTGGTGGACC CGGGCGCCGC CGCGGTGCCT GAGCCCGGCG AGGTCACCAA GGGGGACCCC GGGATTTTCA CCGGCACCGT CCGCGGACAG GTGAAGGCGA CCGATCCTGA CGGCGGTTGG CTCACGTACA GCGGATCGAC CAGCACGGAG AAGGGCACCG TCACCGTCAC CCCGTGGGGC ACGTTCCGGT ACACCCCGAG CGCCACGGCG CGCCACGCCG CTGCTGCAGA CGGGGCGTCC ACCGAGGCCA AGACCGACAC CTTCACGGTC ACGGTCAAAG ACGCCGCAGG CAACGCGGTC GAAGTCCCCG TCACCGTAGA CATCCTGGCG CGCAACGCCG ATCCGTTCGG CGCGCGCGCC CGCGCCGACA ACCCGAGCCT GACCACGGGC ACCGCGATCA TCAGGGTCAG TGCCTACGAC TTCGACCGGG ACCCGCTCGA CATCACCGGA CCGCTCTCGA CCGGTAAGGG TGAGCTCGTC GACAACGGCG ACGGGACGTT CACCTACACG CCGACCGCCG CGGCCCGGGA AGCCGCCAGC GACCCCGACG CACCGGACGA CGCCAGAATC GACACGCTGA CCTTCACGAT CAGCGACGGG CACGGCGGGA TCCGCACGGC CACGGTGGAT GTCATCGTCG CCGCCTACGC CGAGTCGAGC GCGTCGACTC CGGGGCGGGC GGCCGGCCCC GTCCTCGTCA GCTCCAACGG CACCATCTAT CAGGTCACGT ATGACCTCGA CTCGACCAAC AACCCGATCC GGACCCGCGT CAGCATCCTC GACGAGGACG GCCAGGTGCT CAAGACCACC GACGTCATCC CCGGATACCC GGTGGAACAA GCGCTTCCGG TCGTCCGGCC CGACGGCAGC CTCCTGGTGA CCACGTACAA GGCGTCCTCG AACACCTCCA CCGTCTCCAT CGTGGACGGC CAGGGTGAGG TGAAGACCCT CGGAACGGTG ATCGGGCAGC CGTCGGCGCC GATGACGGTC GCGCCGAACG GCGCAGTGTT CTTCAAGACC CGACAGTTCG CGTCCGGATC CGGCGACCGG CTGGTCCGCG TCTCGGCAAC GGGTGGGCTG CGCGTGTATC AGCTCGGGGT GGCCGCTGAC TCGCCGAGCG TGGCGCCCGA CGGCAGCGTT TACATCGTGT CCCGCTCGTT CGGCGTCACG TCGGTGCTGG CAGTCGGGCC GGGCGGCAAC TCGCGGCGGG TGTCACTACC CCTGGGCGCC GACACCGTCA ACGACGTCGT CATCGGCCAG GACGGCCGTG GCTATCTCAC CGTCGAGCGA AACGTATTCG GCACCAAGAC AACTCGCGTG TACACGTTCA CCGGGACGTC GAACACCGTG CGGGAGATCC CCGGCACTCC GGACGGCGCG AAGGTGATCA CCGCAGACGG TGTCTACCAG TACACCTACG ACGAGTCGAC CGGGAAGTCC TACATCTCCC GGATCACCGC GGACACAATC GAAACGTCCG ATCCCATCGA CGGGCGCGTC ATCAACCCGA TCAGCGTCAC CCCCGACGGC ACGGTGTACG TGGCGGTACG TAACTCCGCG ACCGGGACCG ACAGCGTGGC GATCATCAGC ACCTCCGGTG AGGTCACCAC GGTCGACATT CCCGGCACGA TCTTCCCGGT GCTGCCGAGT GTCAGCCCCG CCGTCACCGG CGATGACGCC AACCCGAACA TCGGCGATAA CGGCTACGTC GCCTACCAGT CCGGCGGCGT CAGCTATCTC GCGGTGGTGA ACCCCGACGG GACGATCGCG CGCACCGTCA CACTGCCCGC CGGCGCCGTC GTCGCCACCC CGGTCGACTT CGGCCCGGAC GGCGCGGCGT ATCAGGTCAT CGAGACACGC GACGAGCAGG GGCGGGTCAC TTCGCGGGCC GTGCTCGCGC TGTCCACCGA CACGGTCACG CCCATGCTGC CGGGTGCCCC GTTGCAGCCG AATTACCCGT CGATCCAGTT CGGCCCGGAC GGCAGTGGGG TGCTCATCAC CGTGGAGACC GGCCAGTCGC CGTTCGAGTA CCACTTCCTG CGGTTCGACC AGGACGGCGC GACGATCGCC ACCGCGGACC TCTCGGGGTT CCTCCAGTCG GCGCAGCAGG ATTACGTGTT CTGGCAGGAG GGAGTCGTGT TCGGACCCGA CGGCACCCCG TACGCCACAC TCACCGGTGC CGATCAAGGG GTCTGGGCAT TGACGTCGAC GGGTCCGGTC AAGGTCCTCG AGCTCGACCT CGGACAGGGC GAGCTCGTCG AGCCCGTGAA GTTCGGACCC GATGGCACCC CGTACGTGAC GGTGTCGGAA CGGGTCGACG GCAGCTACGT GACCACGGTG CACACCTTCA CGCCGGTCAC CATGCTGTAA
|
Protein sequence | MGYAQFVGRV GALAVALGVG AATVALPGAA WAEPSDAASS SSADNTKAED TAEDSTAADD ARPDDAVDDE VNSGEDHDEE APQSEDGGDG SDEDGGQPEK KTRGSYGSNR SEGELTGEDA DEARADREDT DTEADVDVVA PEAEPTADEA DDEADGGDPA EVAEPGVADD VAVEQPAEVS DDAPEPVESP ISAPSSIVTA LFAPKSWGDT APADPVESPL LWTLLAFARR QFGQPRTEIG DSGTPTGTTE LVDPGAAAVP EPGEVTKGDP GIFTGTVRGQ VKATDPDGGW LTYSGSTSTE KGTVTVTPWG TFRYTPSATA RHAAAADGAS TEAKTDTFTV TVKDAAGNAV EVPVTVDILA RNADPFGARA RADNPSLTTG TAIIRVSAYD FDRDPLDITG PLSTGKGELV DNGDGTFTYT PTAAAREAAS DPDAPDDARI DTLTFTISDG HGGIRTATVD VIVAAYAESS ASTPGRAAGP VLVSSNGTIY QVTYDLDSTN NPIRTRVSIL DEDGQVLKTT DVIPGYPVEQ ALPVVRPDGS LLVTTYKASS NTSTVSIVDG QGEVKTLGTV IGQPSAPMTV APNGAVFFKT RQFASGSGDR LVRVSATGGL RVYQLGVAAD SPSVAPDGSV YIVSRSFGVT SVLAVGPGGN SRRVSLPLGA DTVNDVVIGQ DGRGYLTVER NVFGTKTTRV YTFTGTSNTV REIPGTPDGA KVITADGVYQ YTYDESTGKS YISRITADTI ETSDPIDGRV INPISVTPDG TVYVAVRNSA TGTDSVAIIS TSGEVTTVDI PGTIFPVLPS VSPAVTGDDA NPNIGDNGYV AYQSGGVSYL AVVNPDGTIA RTVTLPAGAV VATPVDFGPD GAAYQVIETR DEQGRVTSRA VLALSTDTVT PMLPGAPLQP NYPSIQFGPD GSGVLITVET GQSPFEYHFL RFDQDGATIA TADLSGFLQS AQQDYVFWQE GVVFGPDGTP YATLTGADQG VWALTSTGPV KVLELDLGQG ELVEPVKFGP DGTPYVTVSE RVDGSYVTTV HTFTPVTML
|
| |