Gene Mvan_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1107 
Symbol 
ID4648517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1175999 
End bp1179364 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content66% 
IMG OID639804607 
Producthypothetical protein 
Protein accessionYP_951950 
Protein GI120402121 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.186279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCG CACCGGCGCG GGAACGCCTG CTGACACCGT CGAAGGTGAC CGCCTGGCTG 
GACTGTCCGC ACTATCTCGC CCTGTCGGCC CGGGTCGAAG ACGGCACCAT GCCGCGGCCG
GAGCTGCGGT TCGGGTCATT CGCAGAGCTT CTGCTGAACA AGGGTCTCGC TCACGAGCAG
GACTGTCTGG CCGAGTACCG TCGCCAGGAG CGGCGCATCC TGGAGGTGCC GGCCAAAGCC
AAGGGGCAGA CGTTCGCGTC ATGGGTGGCC GAGACCGGCA ACCCGCTGGA CGGCGTGCAC
GACGTCGTCT ACCAGATGCC GTTCATCCAC AACGGCATTC GCGGCGTCGC CGACTTCGTG
GTGCGGGTGC AGGACCCGGA CACCGGGGCG GTCAGCTACG AACCGGTGGA CGCCAAGCTC
ACCCGTGTCG ACGCCAAGCC GGGCCATGTG CTGCAGTTGT GCTTCTACGC CGATGCCATC
GAAGCGCTGA CCGGCAGGCG TCCTGAGCAC ATGCACATCT GGCTGGGTTC CGGGCGCATG
GAGACGCTGC GCGTCAGCGA CTTTCAGCCT TACTGGCGGC GGCTACAAGG CCAGCTCGCG
GCGGCGCTGG CCGGGGGGCC CGCTGAGGGC ACGGTCGCCG AGCAGTGCGC GCACTGCGAA
TTCTGTGAAT TCCAGCCCAT CTGTGAAGCG CAGTGGCGGG ACGCTGATTC GCTGATCTAT
GTGGCCGGCA TCCGTAAACC CGACATCGCC ACGCTGGTCG AGGCGGACAT CGCCACCCTG
ACCGCTCTGG CCACCAGCGA CGGCCCGGTG GACATCCTGG CTCCTGACCG CTTCACCCGA
CTCCGGGGAC AAGCGGCACT GCAACTGGCG GCACGGGAAC AGAGCGACGC GCGGCCGCCA
TTCGAACTGA TCGAACCCGG CGACGAGCCC TGGGGGCACG GCTTCGAAAC GCTGCCCGAA
CCCGATGCCG GTGATGTCTT CCTCGACTTC GAAGGCCATC CGTTCTGGCG CGCCGACACT
GGCCTGTTCT TCCTGTTTGG GCTCATCGAA CAGTCGGAGG ACCGGTGGCG GTACCGCTCC
TGGTGGGCGC ATGACCCAGA CCGAGAGGCG GTAGCGGTCG ACGAGCTCGT CGACTACCTT
GCTCGTCGGC GTGAGCAGTT CCCCGGCATG CACGTCTACC ACTACAACCA CACCGAACGT
TCCGCCCTGC AACGCATGAC AGAGACCCAC GGTGTCGCAG AGGTCGAACT GGCCCAACTG
ATCGACACCG GCGCGTTCGT GGACCTGCTG CTGGTGGCGC GCAACAGTAT TCAGGTGGGC
ACCGAGTCCT ACGGGTTGAA GCACCTGGAA CGCCTCACCG ACTTCGAACG CAGCCACGAG
ATCGACCAGG GCGCCGGAGC GGTCGTCCAG TACGAGCACT ATATGGCCGA ACCCAACCAG
GACGATCTCG ACGCGATCGC CGCGTACAAC GAAGACGACG TCCGGGCCAC CCTCGCACTG
CGCGACTGGC TGGTCGGGCA CCGCCCACCC GGGTTGCCGT GGCGGCCCGC CGTCACCGAG
CCCAACCCCG AACAACCGGA ACTCGACGAG CTCGTCGTCC GCCTCCACGA ATTTCCCTCC
GGCACCGACG AACACAACCT CGGCGATCTG CTGCGCTACT GGCTCGACGA ATGGCGCGCT
TACATCGCGC CGAAGAAGGT GAAGCTGGCC GCGGATCCGC TCGACCTGCT CGACGACGCT
GAGGTCATCG CCGACCTCGG CGGCGTGGCG TTGATCGAGC GTCTGGGGGT CCGGGGCACA
CCGATCACAC CGGCGATGCG ATTCACCTTC CCCGCGCAGA ACATCGACCG GTTTCCCAGC
AGCGGCGGCA AGGTGATGAT TGCCGCTCTC CCCGAGGAAA GGCGCTCCGT AGAGATCGTC
CACCTCGACC GCGACGCTCT GACTATCGAT GTGGTGTGGA ACAAGGACCT CCAGGACGCC
GACTGGCAGC CGCGGGTGGC CGTGCTCGAT GACTGGGTGA ACACGCAACC GAAGCCCGCG
GCGCTGCAAG CCTTCGCCGA GGATCTGCTG GAAGCACGCG GCGCCAACCC GGTGACGCTC
GCCCTCCTGC GTCGCGACCT ACCGCGATTC ACCGATCAAC CCCGCACCGC GTTCGCGGAC
GATCTCGACG AGATGGCGGG GTGGGTCACC CGGCTCGACC ACAGCGTCGT CGCGGTGCAA
GGCCCACCGG GCACCGGAAA GACCTACCGC GCAGCGCGAT TGATTCGAGC CCTAGTGTGC
GCCGGCCAGC GGGTGGGCGT CACCGCTCTC AGCCACCACG CCATCGCCAA CGTGCTCGAA
GGTGTGGTCA AAGCGTTCAC CGAAACCGGA GAACTCGAGC TGCTGCACGC AGTGTGCAAT
GCGGGCACCA GCTCGGTGCA GCGAGTTCCC GGCGTCACTT ACGGCGACAA CGGCAAGTGC
GCCCGCGACG AGTTCAACGT CGTCGCCGGC ACCACTTGGC TGTTCTCCAA CGCGCTGATG
CGCAATGCTC CCGTGGATGT CCTGCTGATC GATGAAGCCG GACAGTTGGC GCTCGCCGAC
GCGTTGGCGG CCTCGGGGGC GGCGCACAAC CTCGTGCTGC TCGGGGACCC GCTTCAGCTG
CCGCAAGTCG CGCAAGCCAA ACACCCCGGT ATTTCCGGTC GCAGCGTGCT GGACCATGTC
GTGGGCGACG ACGTGCTGCT GCCGCCGGAC CGAGGTGTCT TCCTCCACGA AACCCGGCGC
ATGCATCCGG ATGTGTGTGA GTTCATCTCC ACCCAGATCT ACGACGGGCG CCTGCACAGC
TTTCCGGACT GCGGCCGACA GTCGACGGTC GCGGGAACCG GGCTGCGCTG GCTGCGGGTG
GATCACGCAG GAAACCGCAC GTTCTCGGTG CAGGAAGCCG ATGCGATCGC CCAGGAGCTT
TCCCGGCTGA TCGACACACC GTGGACCAAC CACAAGGGCG AAACAGAGCG GCTACAGGCA
GGCGATTTCA TGGTCGTCGC GCCGTACAAC CTGCAGGTCA ATACGACTCA CGCGCGACTG
GCTCAGGACG CGGCGCTGCG TGACGTTCCG GTAGGCACGG TCGACAAGTT CCAGGGCCGC
GAAGCCGCGG TGGTGTTCTT CAGCATGGCC GCTTCGAGTG GGGAGGACAT CACCAGGGGA
GTGGAGTTCC TGTTCTCCCG CAACCGACTC AACGTCGCAG TCAGCCGCGC CCGCTGCCTC
GCCTACCTCG TCTGCACCGA TGCGTTGCTG GACACCCGTG CCCGCACGGT CGAGGAAATG
CGGCTCATCT CCACCCTCAA CGCGTTCGTC GACACGGCGG CACTGCACGA AAGTCGGGAG
GTGTGA
 
Protein sequence
MTSAPARERL LTPSKVTAWL DCPHYLALSA RVEDGTMPRP ELRFGSFAEL LLNKGLAHEQ 
DCLAEYRRQE RRILEVPAKA KGQTFASWVA ETGNPLDGVH DVVYQMPFIH NGIRGVADFV
VRVQDPDTGA VSYEPVDAKL TRVDAKPGHV LQLCFYADAI EALTGRRPEH MHIWLGSGRM
ETLRVSDFQP YWRRLQGQLA AALAGGPAEG TVAEQCAHCE FCEFQPICEA QWRDADSLIY
VAGIRKPDIA TLVEADIATL TALATSDGPV DILAPDRFTR LRGQAALQLA AREQSDARPP
FELIEPGDEP WGHGFETLPE PDAGDVFLDF EGHPFWRADT GLFFLFGLIE QSEDRWRYRS
WWAHDPDREA VAVDELVDYL ARRREQFPGM HVYHYNHTER SALQRMTETH GVAEVELAQL
IDTGAFVDLL LVARNSIQVG TESYGLKHLE RLTDFERSHE IDQGAGAVVQ YEHYMAEPNQ
DDLDAIAAYN EDDVRATLAL RDWLVGHRPP GLPWRPAVTE PNPEQPELDE LVVRLHEFPS
GTDEHNLGDL LRYWLDEWRA YIAPKKVKLA ADPLDLLDDA EVIADLGGVA LIERLGVRGT
PITPAMRFTF PAQNIDRFPS SGGKVMIAAL PEERRSVEIV HLDRDALTID VVWNKDLQDA
DWQPRVAVLD DWVNTQPKPA ALQAFAEDLL EARGANPVTL ALLRRDLPRF TDQPRTAFAD
DLDEMAGWVT RLDHSVVAVQ GPPGTGKTYR AARLIRALVC AGQRVGVTAL SHHAIANVLE
GVVKAFTETG ELELLHAVCN AGTSSVQRVP GVTYGDNGKC ARDEFNVVAG TTWLFSNALM
RNAPVDVLLI DEAGQLALAD ALAASGAAHN LVLLGDPLQL PQVAQAKHPG ISGRSVLDHV
VGDDVLLPPD RGVFLHETRR MHPDVCEFIS TQIYDGRLHS FPDCGRQSTV AGTGLRWLRV
DHAGNRTFSV QEADAIAQEL SRLIDTPWTN HKGETERLQA GDFMVVAPYN LQVNTTHARL
AQDAALRDVP VGTVDKFQGR EAAVVFFSMA ASSGEDITRG VEFLFSRNRL NVAVSRARCL
AYLVCTDALL DTRARTVEEM RLISTLNAFV DTAALHESRE V