Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1814 |
Symbol | |
ID | 4644070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1926410 |
End bp | 1929418 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639805302 |
Product | hypothetical protein |
Protein accession | YP_952642 |
Protein GI | 120402813 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.693843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0180395 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCCG CGGCGAGAAT GCCGAATCTG ACGCGTCGAA GCCGGGTAAT GATCGCCGTC GCCCTGGCCG TCGTGGTGCT GTTGCTGTTG GGCCCGCGGC TCGTCGACAC CTACGTCAAC TGGTTGTGGT TCGGGGAGCT CGGCTACCGA TCGGTGTTCA CCACCCAGAT CGTGACCCGG TTGCTCCTGT TCCTGGCGGT GGCCGTCGTC TTCGGTGCCG TCGTGTTCGC CGCAATGGCG TTGGCCTACC GCACCCGGCC GGTGTTCGTG CCGACCGCCG GGCCCAACGA TCCGATCGCG CGCTACCGCA CCGCGGTGAT GGCCCGGCTG CGGCTGGTCG GCATCGGGGT TCCGGTCGCC GTCGGCCTGC TGGCCGGCCT GATCGCCCAG AACTACTGGC AGCGTGTGCA GCTGTTCCTG CACGGCGGCA GCTTCGGGGT GTCCGACCCG CAGTTCGGCA TCGACCTCGG CTTCTACGCG TTCGACCTGC CGTTCTACCG CTTGATGCTG ACGTACCTGT TCGCCGCGAC GTTCCTGGCG TTCATCGCGA ATCTGCTGGG TCACTACCTG TTCGGGGGCA TCCGGCTGGC CGGGCGCAGC GGCGCGCTGA GCCGGGCGGC CCGCATCCAG CTGATCGCTC TGGTCGGGTT CCTGATGCTG CTGAAGGCGG TCGCCTACTG GCTAGACCGC TACGAGTTGC TCAGCCATAC CCGCGGCGGC AAGCCGTTCA CCGGAGCCGG GTACACCGAC ATCAACGCGG TGCTGCCGGC CAAGCTGATC CTGATGGTCA TCGCGGTGAT CTGCGCCGCG GCGGTGTTCT CCGCGATCGT GCTGCGCGAC TTGCGGATTC CCGCGATCGG TGTGGTGCTG CTGCTGCTGT CCTCGCTGAT CGTGGGTGCG GGCTGGCCGC TGGTGGTGGA ACAGATCAGC GTGCGCCCCA ACGCCGCGCA GAAGGAAAGC GAATACATCA GCCGAAGTAT CACCGCCACC AGACAGGCCT ACGGGCTGAC CGACGAGGCG GTGGAGTACC GCGACTACCC CGGTAACGCC ACGGCGACGG CGCAGCAGGT GGCCGCCGAC CGCGCCACGA CGTCCAACAT CCGGGTGCTC GACCCGAACA TCGTCAGCCC GGCGTTCACC CAGTTCCAGC AGGGTAAGAA CTTCTACTTC TTCCCCGACC AGCTGAACAT GGACCGCTAC CGCGACGAGG ACGGCAATCT GCGTGATTAC GTGGTGGCCG CCCGCGAGCT CAACCCGGAC CGACTGATCG ACAACCAGCG TGACTGGATC AACCGGCACT CGGTGTACAC CCACGGCAAC GGCTTCATCG CCTCGCCGGC CAACACCGTG CGCGGAATCG CCAACGACCC CAACCAGAAC GGCGGTTACC CGGAGTTTCT GGCCAGCGTC GTGGGCGCCA ACGGTGAGGT CGTCTCGCCC GGGCCGGCCC CGCTGGATCA GCCGCGCATC TACTTCGGCC CGGTGATCGC CAACACCCCC GCCGACTACG CGATCGTCGG CGAGAGCGGC ACCCCGCGCG AGTACGACTA CGAGACCAAC ACCGCCACCC GCAACTACAC CTACACCGGC AGCGGCGGCG TGCCGATCGG CAACTGGCTG ACCCGCAGCG TGTTCGCCGC CAAGTACGCC GAGCGGAACT TCCTGTTCTC GAACGTCATC GGCGAGAACA GCAAGATCCT GTTCAACCGT GACCCTGCCG ACCGGGTGGA GGCGGTCGCG CCGTGGCTGA CCACCGACAC CGCGGTCTAC CCTGCGATCG TCAACAAGCG CATCGTCTGG ATCGTCGACG GGTACACCAC GCTGGACAAC TACCCGTACT CGGAGTTGAT GTCGTTGTCG TCGGCCACCA CCGACTCCAA CGAGGTGGCG CTGAACCGGC TGCAGCCCGA CAAGCAGGTG TCCTACATCC GCAACTCGGT CAAGGCCACC GTCGACGCCT ACGACGGCAC CGTGACGCTG TACGCCCAGG ACGAGCAGGA CCCGGTGCTG CAGGCGTGGA TGAAGGTGTT CCCGGACACC GTCAAGCCCA AGGCTGACAT CACCCCCGAA CTGCAGGAGC ACCTGCGCTA TCCGGAGGAC CTGTTCAAGG TGCAGCGCGC GCTGCTGGCC AAGTACCACG TCGACGACCC GGTGACGTTC TTCTCGACGT CGGACTTCTG GGATGTCCCG CTCGACCCGA ACCCGACGGC CAGCAGCTAC CAGCCGCCGT ACTACATCGT CGCCAAAGAC CTTGCCGAGA ACAACAATTC GTCGTCGTTC CAGCTGACCA GTGCGATGAA CCGGTTCCGG CGCGACTTCC TGGCCGCCTA CATCAGCGCC AGCTCGGATC CCGAGACGTA CGGCAAGCTC ACCGTGCTGA CCATTCCCGG TCAGGTCAAC GGGCCCAAGC TGGCGTTCAA CGCGATCAGC ACCGACACCG CCGTCAGCCA GGACCTCGGT GTCATCGGCC GTGACAACCA GAACCGGATC CGCTGGGGCA ATCTGCTGAC GCTGCCGATG GGGCAGGGCG GATTGCTTTA TGTCGCACCG GTTTACGCCT CACCGGGCGC CAGCGACGCG GCATCGTCGT ATCCGCGTCT GATCCGCGTC GCGATGATGT ACAACGACCA GATCGGTTAC GGGCCGACCG TGCGCGACGC GCTGACCGAC CTGTTCGGCC CCGGCGCGGA TGCCACCGCG ACAGGACCTG CGGCGACGGA ACCGCCCGCC GGTCAGGCGC CGCAACCGCA GGGGAACAAC CAGCCGCCTG CCGCGGCACC GCCGAACCGG CCGGGACAGG CCCCGACGCC GCAACAGCCG GAGGTGCCGG TGGCGGTGCC GCCGACCGGG CCGACCCAGC TGTCCGCCGG GAAAGCTGCT GCGCTGCAGG ACGTCAACGC GGCACTGGAC GCGCTGCAGG ACGCGCAACG CAGCGGTGAT TTCGCGCAGT ACGGTGAGGC GCTGCAACGC CTCGACGACG CGGTGAACAA GTACCAGGCG ACGAACTAG
|
Protein sequence | MRPAARMPNL TRRSRVMIAV ALAVVVLLLL GPRLVDTYVN WLWFGELGYR SVFTTQIVTR LLLFLAVAVV FGAVVFAAMA LAYRTRPVFV PTAGPNDPIA RYRTAVMARL RLVGIGVPVA VGLLAGLIAQ NYWQRVQLFL HGGSFGVSDP QFGIDLGFYA FDLPFYRLML TYLFAATFLA FIANLLGHYL FGGIRLAGRS GALSRAARIQ LIALVGFLML LKAVAYWLDR YELLSHTRGG KPFTGAGYTD INAVLPAKLI LMVIAVICAA AVFSAIVLRD LRIPAIGVVL LLLSSLIVGA GWPLVVEQIS VRPNAAQKES EYISRSITAT RQAYGLTDEA VEYRDYPGNA TATAQQVAAD RATTSNIRVL DPNIVSPAFT QFQQGKNFYF FPDQLNMDRY RDEDGNLRDY VVAARELNPD RLIDNQRDWI NRHSVYTHGN GFIASPANTV RGIANDPNQN GGYPEFLASV VGANGEVVSP GPAPLDQPRI YFGPVIANTP ADYAIVGESG TPREYDYETN TATRNYTYTG SGGVPIGNWL TRSVFAAKYA ERNFLFSNVI GENSKILFNR DPADRVEAVA PWLTTDTAVY PAIVNKRIVW IVDGYTTLDN YPYSELMSLS SATTDSNEVA LNRLQPDKQV SYIRNSVKAT VDAYDGTVTL YAQDEQDPVL QAWMKVFPDT VKPKADITPE LQEHLRYPED LFKVQRALLA KYHVDDPVTF FSTSDFWDVP LDPNPTASSY QPPYYIVAKD LAENNNSSSF QLTSAMNRFR RDFLAAYISA SSDPETYGKL TVLTIPGQVN GPKLAFNAIS TDTAVSQDLG VIGRDNQNRI RWGNLLTLPM GQGGLLYVAP VYASPGASDA ASSYPRLIRV AMMYNDQIGY GPTVRDALTD LFGPGADATA TGPAATEPPA GQAPQPQGNN QPPAAAPPNR PGQAPTPQQP EVPVAVPPTG PTQLSAGKAA ALQDVNAALD ALQDAQRSGD FAQYGEALQR LDDAVNKYQA TN
|
| |