Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5042 |
Symbol | |
ID | 4644779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 5395300 |
End bp | 5398020 |
Gene Length | 2721 bp |
Protein Length | 906 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639808513 |
Product | hypothetical protein |
Protein accession | YP_955820 |
Protein GI | 120405991 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.745587 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCTGA CGCGTGGTGG TCGTCCGGCG GGAAGCAGTC GACCCTTTCG GCCGACGCTG CGCTGGGCAC AGGTCGGGGT GGCCGGCACC GGGATGGCGT TTGCGCTGCT GGCCGCACCG GCTGTGGCCA CCGCCGACAC GGGCGCCGAC GAGGGAGGCG CCGACTCGTC GGCCACCACG GATGCGGCGT CGACGCCCTC ACCGCGTGCC GATTCACTTT CACGGCACAC CGCTGTCGGG GACAGCACCG GTGAATCCGA CGCTCGCTCC GCCGAAGACG AGGCAGAGCG GGTGCTCGGC ACCCTTCACC CGGAGACCTC GGAACGTGCC GTGTCCCCAC CAGGCCCGAG GACCGAGCAC GAGGCGCTGT CAGACGATCC GGAGCCCGCG GTAGAACCGG ACCCACCGGA GGCGGGCCCA GCGCTGGACA CCCCATCGCG GGACACAGCC CCCACCCCGG GGAAACCGGA ATCCACCCAC GATCCCCAGG ACGGTGACGC CGAGACCGCG GCTGACGGGG CCCGGGCAAC TAGCGCAACG TCGACACCGT CGGCCCGGAT GACCCTCCGT GGCCCGACCG CGCCCGGTTC CTCCACGGCC CCGCTGTCGG ACACCGCACA GATGGCGAAC GAAGCTGGAA GCCCGATGCT CTCACCGGAC CCGCAGGACG AATCCGTGAC TGAACCAGGG GGCGTGGAGG CGCTGCCGCT CGGCGCTCTG CACGCCGCTG CTTCTTCGAC CAGCTACCCG GCCTATCCGG CGCCGGTGGA CGCACCGGTG ACCTGGCGCT CGATCGTCTC CGACGCCCTG TCCTGGATCG GGCTGGGAAT GGCGACCGAC CAGCACATCC CAGATGGCCC GATCAATGAT CTGCTGGCCG GGCTGTGGGT TGGCATGCGC CGACTGCACT ACACGTTCTT CAACTCGTCA CCCGAGCTGG ACCACGGCGC CGCCACCGAA GATCCCGACA CCGGGATCAT CACCGGCGAC CTCGAAGCCC ATGACGCCGA CGGTGATGTC ATCACCTTCG TGCTGACCGG CGCACCCACT CACGGCACAG TGAGCTTCGC CGAAGACGGC CGGTACACCT ACATTCCCGA CACCGCCTTC GCCGCCACCG GCGGCACCGA CACCTTCACC GTCACCGCCA CGGACACCGG CGGCGCCAAC CCGTGGCACA CCAACCTTCC CCGCCTGCTC TGGTCGGCCC TGAGACCGTT GCTGTCCGCA CTCGGATTCA CCGCGCCTGT TGATTCCTCC AGCACGACCA CCGTCACCGT CACCGTCACC CCGCAGGCCT GCAGGACCGA CGGCGCCGGA GCTGAGTGCG CGGCGGCCAG GGCGCCAAAG ATCACCTTGC ACAACAACTC TGAGCACACG ATCTGGGTGT ACAACCTGCC GAGCTCCGGC GACTACAGCA TCGGCGCGGA CTTCACCCCG GTGTCCATCG CGAAGGGCGC CAGCGCACCG GTGACCCTCG CCGTCGGCAC CGGGTCACCC GGTTCACCCC AGAACCGGAT CTACATCGTC GAGGGCGAGA CCGGTTTCAC GCTGCCGGTC AGCTCGTCAT CCGGGGTGGA CGCATTCAAC CCGACCGCAC CGTCGGAAGG AAACTCCTTC CTGAACTACA ACTTCGTGGA GTACTACCTC TATCCCGATG GCGGCGGCGG ATACCAGTAC ACCATCGACA CCTCCTACAT CGACGAGTGG TCGCTGCCGA TCCAGTACAA ATTCACGCTC AACGGCGCCC GGTGGTCGGG GGCCGTTGAC GGACATACCT ACGGCTTCGA CGACTACGAC ACGGTGGTCA ATCAACTCAA TGCCGCCGGG GGGCCCTACA AGCATCTGGT TTGGGGCGGC GGCACACCGT GGGCCCCCCA GCCGCCGTCC ACGGTGCATC GAATCATCGG GCCCGACAAG GTCTGGACCG CACAGGCGAG TCAGCCGGCA AGCAATGTCA ACATGAACCA CGTCGGCTGG GTGCCCACCT CGTATCAAGA CTTCGTCCAA TACGACTCCC ACACGGAACC GGACGGACAC GTCGTCTACC CCTACGCGCA GAACGGCACG AAGTACTCTC GCGACGGCAA TTTCAGCTTC TGGAAGAACG AAGTGGATGC CCCGGCGTCC ACGCCCTATC CGATTGCCTT GCGCACGGCC GCCGTCCTCG ACGGCTTTCC CGCCAAGAAC GGTGTGTACG GATTCTTCAC CTATCCCAAT GACGAGACGG CTGGCCAGTT CACCAACATC CCCACGTCGG TGTCCCTCGA CATCTACGTT CACGGCTCCT CAGACGGGGT CAGCGACAGT GTGATCGAAG GGGGCAGCTG GTTCTACACC AGCACGACTT CACCGTCCGG GCGGGGGTTG GCGAATCGCC GGCACGTGGT CACCGGGTCG AGCGCCACCG ACACCTTCAT CCTGGATTCG GTGTTCACCC GCAGCCGAAC CGCACCGGTC GTCGTCGCCG AGGCCGTCCA GGGCGACATC GTGGTGATCG ACCGGACAGC TTTGGGGGCA ACCAGCTACG AAGTGGACGT CGTTGACCGC GCGTGGTTCC TCGGGGGCGG GCTCGCCAAG TACGACAGCC AGTTCGTCTA CGACCGCTCG ACCGGAATCC TGTACTACGA CCAAGATCCC GACCGGTTCG GCTACACCGG CGTCCTGGCC AACCTGTCGT GCAGCTCTGC CGACGCGGCC AGCGTGGTGT TCGTGCTCTG A
|
Protein sequence | MSLTRGGRPA GSSRPFRPTL RWAQVGVAGT GMAFALLAAP AVATADTGAD EGGADSSATT DAASTPSPRA DSLSRHTAVG DSTGESDARS AEDEAERVLG TLHPETSERA VSPPGPRTEH EALSDDPEPA VEPDPPEAGP ALDTPSRDTA PTPGKPESTH DPQDGDAETA ADGARATSAT STPSARMTLR GPTAPGSSTA PLSDTAQMAN EAGSPMLSPD PQDESVTEPG GVEALPLGAL HAAASSTSYP AYPAPVDAPV TWRSIVSDAL SWIGLGMATD QHIPDGPIND LLAGLWVGMR RLHYTFFNSS PELDHGAATE DPDTGIITGD LEAHDADGDV ITFVLTGAPT HGTVSFAEDG RYTYIPDTAF AATGGTDTFT VTATDTGGAN PWHTNLPRLL WSALRPLLSA LGFTAPVDSS STTTVTVTVT PQACRTDGAG AECAAARAPK ITLHNNSEHT IWVYNLPSSG DYSIGADFTP VSIAKGASAP VTLAVGTGSP GSPQNRIYIV EGETGFTLPV SSSSGVDAFN PTAPSEGNSF LNYNFVEYYL YPDGGGGYQY TIDTSYIDEW SLPIQYKFTL NGARWSGAVD GHTYGFDDYD TVVNQLNAAG GPYKHLVWGG GTPWAPQPPS TVHRIIGPDK VWTAQASQPA SNVNMNHVGW VPTSYQDFVQ YDSHTEPDGH VVYPYAQNGT KYSRDGNFSF WKNEVDAPAS TPYPIALRTA AVLDGFPAKN GVYGFFTYPN DETAGQFTNI PTSVSLDIYV HGSSDGVSDS VIEGGSWFYT STTSPSGRGL ANRRHVVTGS SATDTFILDS VFTRSRTAPV VVAEAVQGDI VVIDRTALGA TSYEVDVVDR AWFLGGGLAK YDSQFVYDRS TGILYYDQDP DRFGYTGVLA NLSCSSADAA SVVFVL
|
| |