Gene Mvan_4472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4472 
Symbol 
ID4649088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4801118 
End bp4804573 
Gene Length3456 bp 
Protein Length1151 aa 
Translation table11 
GC content70% 
IMG OID639807942 
ProductRecB family-like nuclease 
Protein accessionYP_955253 
Protein GI120405424 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.836505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.104003 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGTCG CGACCGACGC CGACGACGTT CGGGTCATCT ACAGCGCCTC CGACCTCGCC 
GCCGCGGCGC GCTGCGAGTA CGCGCTGCTG CGCTCCTTCG ATGCCCGGCT GGGCTGGGGC
CCCGCCGTCT CGGCCGACGA CGAACTGCTG GCCCGCACCG CGACGCTCGG CGACGAACAC
GAACAGCGCC ACCTCGACGA ACTGCGGCAC GACACCGAGG TCGCAGTCAT CGGCCGTCCC
CGGTACTCCG TCTCGGGTCT GACCGCGGCG GCCGAGCAGA CGCTGCGCGC GATCGAGCGG
CGCGTCCCCG TCGTCTACCA GGCGGCGATG TTCGACGGCC GCTTCGCCGG GTTCGCCGAC
TTCCTGATCC TGGAGGACTC CCCCGACGGC CCGCGTTACC GGCTGCGCGA CACCAAGCTG
GCCCGCTCGG TCAAGGTCGA GGCGCTGCTG CAGATGGCGG CCTACGTCGA AACACTCACC
GCCGCAGGCG TTCCGGTGGG CCCCGAAGTC GACCTGGTCT TGGGTGACGG CACGGCGGTC
AGCTATCCCG TCGACGAGCT GCTGCCGGTG TACCGTCCGC GACGGGCCGC GCTGCAGCGG
CTCCTCGACG AGCATTTCGC CGGCGGCCGC GCGGTGGCGT GGGAGGACGA GCACGTGCGC
GCCTGCTTCC GGTGCGCCGA ATGCGACAAG CAGGTCCGCG CGCACGACGA CCTGTTGCTC
GTCGCCGGAA TGCGGGTCAG CCAGCGGGCC CGGCTGCTCG ACGCGGGCAT CACCACCGTG
CAGGAGCTGG CCGCCCATGA CGGACCGGTG CCGGAGCTGT CGACGCGCGC CGTCACGGCG
TTGAAGTCAC AGGCCCGGCT GCAACTCGCC GACCGGGTCA TCGAGAACGG CAACGTGAAA
CCGCCCTACG AGCTCGTCGA CCCGCAACCG CTGATGGTGC TGCCCGACGT GGACAAGGGT
GACCTGTTCT TCGACTTCGA GGGCGACCCG CTGTGGACCA CGGACGGTCA CGAGTGGGGG
CTGGAATACC TGTGGGGCGT GCTCAGCGTC GCCGACGACT TCGAGCCGCT GTGGGCGCAC
AACCGAAGCG AGGAACGGCA GGCGCTCAAG GACTTCCTCG CGCTGGTGCG CAAGCGCAAG
CGCCGCTACC CCGGCATGCA CGTCTACCAC TACGCGGCCT ACGAGAAGAG CACACTGCTG
CGGCTGGCGG GCCGCTACGG CGTCGGCGAG CACGAGGTCG ACGAGCTGCT GCGTGACGGC
GTGCTCGTCG ACCTGTATCC CCTTGTGCGC AAGAGCATCC GGGTCGGCAC CGAGAACTAC
AGCATCAAAT CGCTCGAACC GCTGTACATG GGCAACGAGC TGCGTGACGG CGAGGTCACC
ACCGCCACCG CGTCGATCAC CGAATACGCC CGCTACTGCG CCCTGCGCGA CGAGGGCCGC
ACCGACGAGG CCGCCACGGT CCTCAAGGAG ATCGAGGAGT ACAACCGCTA CGACTGCCGC
TCGACCCGAC GGCTGCGGGA CTGGTTGATG GCCCGCGCGA TCGAATGCGG GGTCCCGCCG
CGTGGCCCGG TCCCGGTCAC GGCCGGCCAG GAAGGGGCCG CCGCCGACTC CCCCGACCCG
GTCGACCGCA AGCTGTTGAA GTTCGCCGGT GACGGCATCG AACCCCGCAC TCCGGAACAG
GCCGCGGTCG CAATGCTGGC TGCCGCCAAG GGCTTTCACA AACGCGAGGA CAAGCCGTAC
TGGTGGGGCC ACTTCGACCG GGTCAACAAC CCGGTCGACG AATGGGCCGA CGACGGTGGG
GTGTTCGTCG CCGAGCGGCA CGAGGTCGTC GCCGACTGGC ACCAGCCGCC ACGGGCGCGC
AAGCCGCAGC GCCACCTCCG GCTGTTCGGC GAGATAGCCA CGGGCGAGCT GGGCCGCGAG
ATGTACGCGC TCTACGACCC GCCCTCGCCT GCCGGGCTGT CCGACGATCC GGACCGGCGG
GCATTCGGCT CGGTGACCGT CACCGAATGC GACGACCCTG AGGCGCCCAC CGAAGTCGTC
ATCGTCGAGC GGCAACCCAA AGGTGGTGAC GTCTTCCCAC AGGCGCCGTT CGCGCTGACC
CCGGGCCCGC CGATCAGCAC CGCCCAACTC CAGGACGCCA TCGCCGACAC CGCCGCGCTG
GTGGCCGCCG GGCTGCCGAA CCTGCCCGCC GACGGGCTGA CCGACATCCT GCTGCGACGC
CCGCCCCGCA CCCGCAGCGG CGGCCCCCTG CCCCGCACCG GCGACGCCGT CGCCGACATC
ACCGCCGCGC TGCTGGATCT GGACTCGTCG TATCTGGCCG TGCACGGGCC ACCCGGCACC
GGCAAGACCC ACACCTCGGC ACAGGTGATC GCCACCCTGG TCGGCAGGCA CGGATGGACG
GTCGGCGTGG TGGCGCAATC GCATGCGGTG GTGGAGAACC TGTTCACCGA CGTGATGCGG
GCCGGGGTGG ACGGCACACG CATCGGCAAG AAGGCGCACA CGGCGCACAC GGTCAGTGGC
GGCTGGACCG AACTGGACCG CGACGACTAT GCCGAGTTCC TCCGCCAGGA CGGCTGCGTC
GTCGGCGGAA CGGCATGGGA TTTCGCCAAC GACAACAAGT TCACCCGCGG CTGTCTCGAC
CTTCTGGTGA TCGAGGAGGC GGGCCAGTTC AGCCTGGCCA ACACCGTGGC GGTGTCGCGG
GCCGCACGCA ACCTGCTGCT GCTGGGCGAC CCTCAGCAGC TGCCCCAGGT CAGCCAGGGC
ACCCATCCCG AACCCGTCGA CGGCTCGGCG CTCGGCTGGC TCGTCGACGG CCACCACACG
CTGCCGCCCG AGCGCGGGTA CTTCCTGGAC CGCTCCTACC GCATGCATCC CGACGTGTGC
CGGGCGGTGT CCCGGTTGTC CTACGACGGG CGGCTGCTGT CCAACGAGCA CGTCACCGCC
GCCCGCCGCC TCGACGGCGT CACCCCCGGG GTGCGGACGC TGGAGGTCGA TCACCTCGGC
AACGCCACCG AAAGCCCGGA GGAGGCCGAC GCGATCGTCA CCGCGATCAC CGGCCTGCTC
GCAACGCCGT GGACCGACGA GGGCGGCACC CGACCGCTGG CCCAGCGCGA CGTGCTCATC
GTGACCCCAT ACAACGCGCA GGTGGTGCTG GTTCGGCGCC GCCTCGACGC GGCGGGGCTG
ACCGAGGTGC GGGCGGGCAC CGTGGACAAG TTCCAGGGCC AGCAGGCGCC CGTGGTGTTC
GTGTCGATGA CGGCATCCTC GATCGACGAC GTGCCCCGCG GAATCGCCTT CCTGCTCAAC
AGAAACCGCC TCAACGTGGC GGTCAGCCGG GCCAAGTACC TGGCCGTGAT CGTGCGTTCG
CAGCATCTGA CCGACTACCT GCCCGGAACC CCGGACGGGC TGGTGCAGCT GGGCGCGTTC
CTGTCCCTGG CCACATGTGA TACACCCACG GAGTGA
 
Protein sequence
MFVATDADDV RVIYSASDLA AAARCEYALL RSFDARLGWG PAVSADDELL ARTATLGDEH 
EQRHLDELRH DTEVAVIGRP RYSVSGLTAA AEQTLRAIER RVPVVYQAAM FDGRFAGFAD
FLILEDSPDG PRYRLRDTKL ARSVKVEALL QMAAYVETLT AAGVPVGPEV DLVLGDGTAV
SYPVDELLPV YRPRRAALQR LLDEHFAGGR AVAWEDEHVR ACFRCAECDK QVRAHDDLLL
VAGMRVSQRA RLLDAGITTV QELAAHDGPV PELSTRAVTA LKSQARLQLA DRVIENGNVK
PPYELVDPQP LMVLPDVDKG DLFFDFEGDP LWTTDGHEWG LEYLWGVLSV ADDFEPLWAH
NRSEERQALK DFLALVRKRK RRYPGMHVYH YAAYEKSTLL RLAGRYGVGE HEVDELLRDG
VLVDLYPLVR KSIRVGTENY SIKSLEPLYM GNELRDGEVT TATASITEYA RYCALRDEGR
TDEAATVLKE IEEYNRYDCR STRRLRDWLM ARAIECGVPP RGPVPVTAGQ EGAAADSPDP
VDRKLLKFAG DGIEPRTPEQ AAVAMLAAAK GFHKREDKPY WWGHFDRVNN PVDEWADDGG
VFVAERHEVV ADWHQPPRAR KPQRHLRLFG EIATGELGRE MYALYDPPSP AGLSDDPDRR
AFGSVTVTEC DDPEAPTEVV IVERQPKGGD VFPQAPFALT PGPPISTAQL QDAIADTAAL
VAAGLPNLPA DGLTDILLRR PPRTRSGGPL PRTGDAVADI TAALLDLDSS YLAVHGPPGT
GKTHTSAQVI ATLVGRHGWT VGVVAQSHAV VENLFTDVMR AGVDGTRIGK KAHTAHTVSG
GWTELDRDDY AEFLRQDGCV VGGTAWDFAN DNKFTRGCLD LLVIEEAGQF SLANTVAVSR
AARNLLLLGD PQQLPQVSQG THPEPVDGSA LGWLVDGHHT LPPERGYFLD RSYRMHPDVC
RAVSRLSYDG RLLSNEHVTA ARRLDGVTPG VRTLEVDHLG NATESPEEAD AIVTAITGLL
ATPWTDEGGT RPLAQRDVLI VTPYNAQVVL VRRRLDAAGL TEVRAGTVDK FQGQQAPVVF
VSMTASSIDD VPRGIAFLLN RNRLNVAVSR AKYLAVIVRS QHLTDYLPGT PDGLVQLGAF
LSLATCDTPT E