Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_1107 |
Symbol | |
ID | 4648517 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 1175999 |
End bp | 1179364 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639804607 |
Product | hypothetical protein |
Protein accession | YP_951950 |
Protein GI | 120402121 |
COG category | [R] General function prediction only |
COG ID | [COG2251] Predicted nuclease (RecB family) |
TIGRFAM ID | [TIGR03491] RecB family nuclease, putative, TM0106 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.186279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCG CACCGGCGCG GGAACGCCTG CTGACACCGT CGAAGGTGAC CGCCTGGCTG GACTGTCCGC ACTATCTCGC CCTGTCGGCC CGGGTCGAAG ACGGCACCAT GCCGCGGCCG GAGCTGCGGT TCGGGTCATT CGCAGAGCTT CTGCTGAACA AGGGTCTCGC TCACGAGCAG GACTGTCTGG CCGAGTACCG TCGCCAGGAG CGGCGCATCC TGGAGGTGCC GGCCAAAGCC AAGGGGCAGA CGTTCGCGTC ATGGGTGGCC GAGACCGGCA ACCCGCTGGA CGGCGTGCAC GACGTCGTCT ACCAGATGCC GTTCATCCAC AACGGCATTC GCGGCGTCGC CGACTTCGTG GTGCGGGTGC AGGACCCGGA CACCGGGGCG GTCAGCTACG AACCGGTGGA CGCCAAGCTC ACCCGTGTCG ACGCCAAGCC GGGCCATGTG CTGCAGTTGT GCTTCTACGC CGATGCCATC GAAGCGCTGA CCGGCAGGCG TCCTGAGCAC ATGCACATCT GGCTGGGTTC CGGGCGCATG GAGACGCTGC GCGTCAGCGA CTTTCAGCCT TACTGGCGGC GGCTACAAGG CCAGCTCGCG GCGGCGCTGG CCGGGGGGCC CGCTGAGGGC ACGGTCGCCG AGCAGTGCGC GCACTGCGAA TTCTGTGAAT TCCAGCCCAT CTGTGAAGCG CAGTGGCGGG ACGCTGATTC GCTGATCTAT GTGGCCGGCA TCCGTAAACC CGACATCGCC ACGCTGGTCG AGGCGGACAT CGCCACCCTG ACCGCTCTGG CCACCAGCGA CGGCCCGGTG GACATCCTGG CTCCTGACCG CTTCACCCGA CTCCGGGGAC AAGCGGCACT GCAACTGGCG GCACGGGAAC AGAGCGACGC GCGGCCGCCA TTCGAACTGA TCGAACCCGG CGACGAGCCC TGGGGGCACG GCTTCGAAAC GCTGCCCGAA CCCGATGCCG GTGATGTCTT CCTCGACTTC GAAGGCCATC CGTTCTGGCG CGCCGACACT GGCCTGTTCT TCCTGTTTGG GCTCATCGAA CAGTCGGAGG ACCGGTGGCG GTACCGCTCC TGGTGGGCGC ATGACCCAGA CCGAGAGGCG GTAGCGGTCG ACGAGCTCGT CGACTACCTT GCTCGTCGGC GTGAGCAGTT CCCCGGCATG CACGTCTACC ACTACAACCA CACCGAACGT TCCGCCCTGC AACGCATGAC AGAGACCCAC GGTGTCGCAG AGGTCGAACT GGCCCAACTG ATCGACACCG GCGCGTTCGT GGACCTGCTG CTGGTGGCGC GCAACAGTAT TCAGGTGGGC ACCGAGTCCT ACGGGTTGAA GCACCTGGAA CGCCTCACCG ACTTCGAACG CAGCCACGAG ATCGACCAGG GCGCCGGAGC GGTCGTCCAG TACGAGCACT ATATGGCCGA ACCCAACCAG GACGATCTCG ACGCGATCGC CGCGTACAAC GAAGACGACG TCCGGGCCAC CCTCGCACTG CGCGACTGGC TGGTCGGGCA CCGCCCACCC GGGTTGCCGT GGCGGCCCGC CGTCACCGAG CCCAACCCCG AACAACCGGA ACTCGACGAG CTCGTCGTCC GCCTCCACGA ATTTCCCTCC GGCACCGACG AACACAACCT CGGCGATCTG CTGCGCTACT GGCTCGACGA ATGGCGCGCT TACATCGCGC CGAAGAAGGT GAAGCTGGCC GCGGATCCGC TCGACCTGCT CGACGACGCT GAGGTCATCG CCGACCTCGG CGGCGTGGCG TTGATCGAGC GTCTGGGGGT CCGGGGCACA CCGATCACAC CGGCGATGCG ATTCACCTTC CCCGCGCAGA ACATCGACCG GTTTCCCAGC AGCGGCGGCA AGGTGATGAT TGCCGCTCTC CCCGAGGAAA GGCGCTCCGT AGAGATCGTC CACCTCGACC GCGACGCTCT GACTATCGAT GTGGTGTGGA ACAAGGACCT CCAGGACGCC GACTGGCAGC CGCGGGTGGC CGTGCTCGAT GACTGGGTGA ACACGCAACC GAAGCCCGCG GCGCTGCAAG CCTTCGCCGA GGATCTGCTG GAAGCACGCG GCGCCAACCC GGTGACGCTC GCCCTCCTGC GTCGCGACCT ACCGCGATTC ACCGATCAAC CCCGCACCGC GTTCGCGGAC GATCTCGACG AGATGGCGGG GTGGGTCACC CGGCTCGACC ACAGCGTCGT CGCGGTGCAA GGCCCACCGG GCACCGGAAA GACCTACCGC GCAGCGCGAT TGATTCGAGC CCTAGTGTGC GCCGGCCAGC GGGTGGGCGT CACCGCTCTC AGCCACCACG CCATCGCCAA CGTGCTCGAA GGTGTGGTCA AAGCGTTCAC CGAAACCGGA GAACTCGAGC TGCTGCACGC AGTGTGCAAT GCGGGCACCA GCTCGGTGCA GCGAGTTCCC GGCGTCACTT ACGGCGACAA CGGCAAGTGC GCCCGCGACG AGTTCAACGT CGTCGCCGGC ACCACTTGGC TGTTCTCCAA CGCGCTGATG CGCAATGCTC CCGTGGATGT CCTGCTGATC GATGAAGCCG GACAGTTGGC GCTCGCCGAC GCGTTGGCGG CCTCGGGGGC GGCGCACAAC CTCGTGCTGC TCGGGGACCC GCTTCAGCTG CCGCAAGTCG CGCAAGCCAA ACACCCCGGT ATTTCCGGTC GCAGCGTGCT GGACCATGTC GTGGGCGACG ACGTGCTGCT GCCGCCGGAC CGAGGTGTCT TCCTCCACGA AACCCGGCGC ATGCATCCGG ATGTGTGTGA GTTCATCTCC ACCCAGATCT ACGACGGGCG CCTGCACAGC TTTCCGGACT GCGGCCGACA GTCGACGGTC GCGGGAACCG GGCTGCGCTG GCTGCGGGTG GATCACGCAG GAAACCGCAC GTTCTCGGTG CAGGAAGCCG ATGCGATCGC CCAGGAGCTT TCCCGGCTGA TCGACACACC GTGGACCAAC CACAAGGGCG AAACAGAGCG GCTACAGGCA GGCGATTTCA TGGTCGTCGC GCCGTACAAC CTGCAGGTCA ATACGACTCA CGCGCGACTG GCTCAGGACG CGGCGCTGCG TGACGTTCCG GTAGGCACGG TCGACAAGTT CCAGGGCCGC GAAGCCGCGG TGGTGTTCTT CAGCATGGCC GCTTCGAGTG GGGAGGACAT CACCAGGGGA GTGGAGTTCC TGTTCTCCCG CAACCGACTC AACGTCGCAG TCAGCCGCGC CCGCTGCCTC GCCTACCTCG TCTGCACCGA TGCGTTGCTG GACACCCGTG CCCGCACGGT CGAGGAAATG CGGCTCATCT CCACCCTCAA CGCGTTCGTC GACACGGCGG CACTGCACGA AAGTCGGGAG GTGTGA
|
Protein sequence | MTSAPARERL LTPSKVTAWL DCPHYLALSA RVEDGTMPRP ELRFGSFAEL LLNKGLAHEQ DCLAEYRRQE RRILEVPAKA KGQTFASWVA ETGNPLDGVH DVVYQMPFIH NGIRGVADFV VRVQDPDTGA VSYEPVDAKL TRVDAKPGHV LQLCFYADAI EALTGRRPEH MHIWLGSGRM ETLRVSDFQP YWRRLQGQLA AALAGGPAEG TVAEQCAHCE FCEFQPICEA QWRDADSLIY VAGIRKPDIA TLVEADIATL TALATSDGPV DILAPDRFTR LRGQAALQLA AREQSDARPP FELIEPGDEP WGHGFETLPE PDAGDVFLDF EGHPFWRADT GLFFLFGLIE QSEDRWRYRS WWAHDPDREA VAVDELVDYL ARRREQFPGM HVYHYNHTER SALQRMTETH GVAEVELAQL IDTGAFVDLL LVARNSIQVG TESYGLKHLE RLTDFERSHE IDQGAGAVVQ YEHYMAEPNQ DDLDAIAAYN EDDVRATLAL RDWLVGHRPP GLPWRPAVTE PNPEQPELDE LVVRLHEFPS GTDEHNLGDL LRYWLDEWRA YIAPKKVKLA ADPLDLLDDA EVIADLGGVA LIERLGVRGT PITPAMRFTF PAQNIDRFPS SGGKVMIAAL PEERRSVEIV HLDRDALTID VVWNKDLQDA DWQPRVAVLD DWVNTQPKPA ALQAFAEDLL EARGANPVTL ALLRRDLPRF TDQPRTAFAD DLDEMAGWVT RLDHSVVAVQ GPPGTGKTYR AARLIRALVC AGQRVGVTAL SHHAIANVLE GVVKAFTETG ELELLHAVCN AGTSSVQRVP GVTYGDNGKC ARDEFNVVAG TTWLFSNALM RNAPVDVLLI DEAGQLALAD ALAASGAAHN LVLLGDPLQL PQVAQAKHPG ISGRSVLDHV VGDDVLLPPD RGVFLHETRR MHPDVCEFIS TQIYDGRLHS FPDCGRQSTV AGTGLRWLRV DHAGNRTFSV QEADAIAQEL SRLIDTPWTN HKGETERLQA GDFMVVAPYN LQVNTTHARL AQDAALRDVP VGTVDKFQGR EAAVVFFSMA ASSGEDITRG VEFLFSRNRL NVAVSRARCL AYLVCTDALL DTRARTVEEM RLISTLNAFV DTAALHESRE V
|
| |