Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_4472 |
Symbol | |
ID | 4649088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 4801118 |
End bp | 4804573 |
Gene Length | 3456 bp |
Protein Length | 1151 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639807942 |
Product | RecB family-like nuclease |
Protein accession | YP_955253 |
Protein GI | 120405424 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.836505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.104003 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCGTCG CGACCGACGC CGACGACGTT CGGGTCATCT ACAGCGCCTC CGACCTCGCC GCCGCGGCGC GCTGCGAGTA CGCGCTGCTG CGCTCCTTCG ATGCCCGGCT GGGCTGGGGC CCCGCCGTCT CGGCCGACGA CGAACTGCTG GCCCGCACCG CGACGCTCGG CGACGAACAC GAACAGCGCC ACCTCGACGA ACTGCGGCAC GACACCGAGG TCGCAGTCAT CGGCCGTCCC CGGTACTCCG TCTCGGGTCT GACCGCGGCG GCCGAGCAGA CGCTGCGCGC GATCGAGCGG CGCGTCCCCG TCGTCTACCA GGCGGCGATG TTCGACGGCC GCTTCGCCGG GTTCGCCGAC TTCCTGATCC TGGAGGACTC CCCCGACGGC CCGCGTTACC GGCTGCGCGA CACCAAGCTG GCCCGCTCGG TCAAGGTCGA GGCGCTGCTG CAGATGGCGG CCTACGTCGA AACACTCACC GCCGCAGGCG TTCCGGTGGG CCCCGAAGTC GACCTGGTCT TGGGTGACGG CACGGCGGTC AGCTATCCCG TCGACGAGCT GCTGCCGGTG TACCGTCCGC GACGGGCCGC GCTGCAGCGG CTCCTCGACG AGCATTTCGC CGGCGGCCGC GCGGTGGCGT GGGAGGACGA GCACGTGCGC GCCTGCTTCC GGTGCGCCGA ATGCGACAAG CAGGTCCGCG CGCACGACGA CCTGTTGCTC GTCGCCGGAA TGCGGGTCAG CCAGCGGGCC CGGCTGCTCG ACGCGGGCAT CACCACCGTG CAGGAGCTGG CCGCCCATGA CGGACCGGTG CCGGAGCTGT CGACGCGCGC CGTCACGGCG TTGAAGTCAC AGGCCCGGCT GCAACTCGCC GACCGGGTCA TCGAGAACGG CAACGTGAAA CCGCCCTACG AGCTCGTCGA CCCGCAACCG CTGATGGTGC TGCCCGACGT GGACAAGGGT GACCTGTTCT TCGACTTCGA GGGCGACCCG CTGTGGACCA CGGACGGTCA CGAGTGGGGG CTGGAATACC TGTGGGGCGT GCTCAGCGTC GCCGACGACT TCGAGCCGCT GTGGGCGCAC AACCGAAGCG AGGAACGGCA GGCGCTCAAG GACTTCCTCG CGCTGGTGCG CAAGCGCAAG CGCCGCTACC CCGGCATGCA CGTCTACCAC TACGCGGCCT ACGAGAAGAG CACACTGCTG CGGCTGGCGG GCCGCTACGG CGTCGGCGAG CACGAGGTCG ACGAGCTGCT GCGTGACGGC GTGCTCGTCG ACCTGTATCC CCTTGTGCGC AAGAGCATCC GGGTCGGCAC CGAGAACTAC AGCATCAAAT CGCTCGAACC GCTGTACATG GGCAACGAGC TGCGTGACGG CGAGGTCACC ACCGCCACCG CGTCGATCAC CGAATACGCC CGCTACTGCG CCCTGCGCGA CGAGGGCCGC ACCGACGAGG CCGCCACGGT CCTCAAGGAG ATCGAGGAGT ACAACCGCTA CGACTGCCGC TCGACCCGAC GGCTGCGGGA CTGGTTGATG GCCCGCGCGA TCGAATGCGG GGTCCCGCCG CGTGGCCCGG TCCCGGTCAC GGCCGGCCAG GAAGGGGCCG CCGCCGACTC CCCCGACCCG GTCGACCGCA AGCTGTTGAA GTTCGCCGGT GACGGCATCG AACCCCGCAC TCCGGAACAG GCCGCGGTCG CAATGCTGGC TGCCGCCAAG GGCTTTCACA AACGCGAGGA CAAGCCGTAC TGGTGGGGCC ACTTCGACCG GGTCAACAAC CCGGTCGACG AATGGGCCGA CGACGGTGGG GTGTTCGTCG CCGAGCGGCA CGAGGTCGTC GCCGACTGGC ACCAGCCGCC ACGGGCGCGC AAGCCGCAGC GCCACCTCCG GCTGTTCGGC GAGATAGCCA CGGGCGAGCT GGGCCGCGAG ATGTACGCGC TCTACGACCC GCCCTCGCCT GCCGGGCTGT CCGACGATCC GGACCGGCGG GCATTCGGCT CGGTGACCGT CACCGAATGC GACGACCCTG AGGCGCCCAC CGAAGTCGTC ATCGTCGAGC GGCAACCCAA AGGTGGTGAC GTCTTCCCAC AGGCGCCGTT CGCGCTGACC CCGGGCCCGC CGATCAGCAC CGCCCAACTC CAGGACGCCA TCGCCGACAC CGCCGCGCTG GTGGCCGCCG GGCTGCCGAA CCTGCCCGCC GACGGGCTGA CCGACATCCT GCTGCGACGC CCGCCCCGCA CCCGCAGCGG CGGCCCCCTG CCCCGCACCG GCGACGCCGT CGCCGACATC ACCGCCGCGC TGCTGGATCT GGACTCGTCG TATCTGGCCG TGCACGGGCC ACCCGGCACC GGCAAGACCC ACACCTCGGC ACAGGTGATC GCCACCCTGG TCGGCAGGCA CGGATGGACG GTCGGCGTGG TGGCGCAATC GCATGCGGTG GTGGAGAACC TGTTCACCGA CGTGATGCGG GCCGGGGTGG ACGGCACACG CATCGGCAAG AAGGCGCACA CGGCGCACAC GGTCAGTGGC GGCTGGACCG AACTGGACCG CGACGACTAT GCCGAGTTCC TCCGCCAGGA CGGCTGCGTC GTCGGCGGAA CGGCATGGGA TTTCGCCAAC GACAACAAGT TCACCCGCGG CTGTCTCGAC CTTCTGGTGA TCGAGGAGGC GGGCCAGTTC AGCCTGGCCA ACACCGTGGC GGTGTCGCGG GCCGCACGCA ACCTGCTGCT GCTGGGCGAC CCTCAGCAGC TGCCCCAGGT CAGCCAGGGC ACCCATCCCG AACCCGTCGA CGGCTCGGCG CTCGGCTGGC TCGTCGACGG CCACCACACG CTGCCGCCCG AGCGCGGGTA CTTCCTGGAC CGCTCCTACC GCATGCATCC CGACGTGTGC CGGGCGGTGT CCCGGTTGTC CTACGACGGG CGGCTGCTGT CCAACGAGCA CGTCACCGCC GCCCGCCGCC TCGACGGCGT CACCCCCGGG GTGCGGACGC TGGAGGTCGA TCACCTCGGC AACGCCACCG AAAGCCCGGA GGAGGCCGAC GCGATCGTCA CCGCGATCAC CGGCCTGCTC GCAACGCCGT GGACCGACGA GGGCGGCACC CGACCGCTGG CCCAGCGCGA CGTGCTCATC GTGACCCCAT ACAACGCGCA GGTGGTGCTG GTTCGGCGCC GCCTCGACGC GGCGGGGCTG ACCGAGGTGC GGGCGGGCAC CGTGGACAAG TTCCAGGGCC AGCAGGCGCC CGTGGTGTTC GTGTCGATGA CGGCATCCTC GATCGACGAC GTGCCCCGCG GAATCGCCTT CCTGCTCAAC AGAAACCGCC TCAACGTGGC GGTCAGCCGG GCCAAGTACC TGGCCGTGAT CGTGCGTTCG CAGCATCTGA CCGACTACCT GCCCGGAACC CCGGACGGGC TGGTGCAGCT GGGCGCGTTC CTGTCCCTGG CCACATGTGA TACACCCACG GAGTGA
|
Protein sequence | MFVATDADDV RVIYSASDLA AAARCEYALL RSFDARLGWG PAVSADDELL ARTATLGDEH EQRHLDELRH DTEVAVIGRP RYSVSGLTAA AEQTLRAIER RVPVVYQAAM FDGRFAGFAD FLILEDSPDG PRYRLRDTKL ARSVKVEALL QMAAYVETLT AAGVPVGPEV DLVLGDGTAV SYPVDELLPV YRPRRAALQR LLDEHFAGGR AVAWEDEHVR ACFRCAECDK QVRAHDDLLL VAGMRVSQRA RLLDAGITTV QELAAHDGPV PELSTRAVTA LKSQARLQLA DRVIENGNVK PPYELVDPQP LMVLPDVDKG DLFFDFEGDP LWTTDGHEWG LEYLWGVLSV ADDFEPLWAH NRSEERQALK DFLALVRKRK RRYPGMHVYH YAAYEKSTLL RLAGRYGVGE HEVDELLRDG VLVDLYPLVR KSIRVGTENY SIKSLEPLYM GNELRDGEVT TATASITEYA RYCALRDEGR TDEAATVLKE IEEYNRYDCR STRRLRDWLM ARAIECGVPP RGPVPVTAGQ EGAAADSPDP VDRKLLKFAG DGIEPRTPEQ AAVAMLAAAK GFHKREDKPY WWGHFDRVNN PVDEWADDGG VFVAERHEVV ADWHQPPRAR KPQRHLRLFG EIATGELGRE MYALYDPPSP AGLSDDPDRR AFGSVTVTEC DDPEAPTEVV IVERQPKGGD VFPQAPFALT PGPPISTAQL QDAIADTAAL VAAGLPNLPA DGLTDILLRR PPRTRSGGPL PRTGDAVADI TAALLDLDSS YLAVHGPPGT GKTHTSAQVI ATLVGRHGWT VGVVAQSHAV VENLFTDVMR AGVDGTRIGK KAHTAHTVSG GWTELDRDDY AEFLRQDGCV VGGTAWDFAN DNKFTRGCLD LLVIEEAGQF SLANTVAVSR AARNLLLLGD PQQLPQVSQG THPEPVDGSA LGWLVDGHHT LPPERGYFLD RSYRMHPDVC RAVSRLSYDG RLLSNEHVTA ARRLDGVTPG VRTLEVDHLG NATESPEEAD AIVTAITGLL ATPWTDEGGT RPLAQRDVLI VTPYNAQVVL VRRRLDAAGL TEVRAGTVDK FQGQQAPVVF VSMTASSIDD VPRGIAFLLN RNRLNVAVSR AKYLAVIVRS QHLTDYLPGT PDGLVQLGAF LSLATCDTPT E
|
| |