Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_2284 |
Symbol | |
ID | 4644470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | + |
Start bp | 2436327 |
End bp | 2439185 |
Gene Length | 2859 bp |
Protein Length | 952 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639805768 |
Product | transcriptional regulator |
Protein accession | YP_953104 |
Protein GI | 120403275 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.195552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCACCG AATTGCGGCT GCTCGGCGAC GTCGAGGTGC TCGTCGACGG ACGACGCCTC GACGTCGGTC ACGCCCGCCA GCGCTGCGTT CTGGTGGCAC TTCTGGCGGA TGTGAACCAG CCGGTTCCCG CGGAGCAGCT CATCGACCGG GTCTGGGCCG GAGACCCGCC CCATCGTGTC CGAAACGCCC TGGCCGGTTA TCTGTCCCGG CTGCGTGCCC TGTTCGCCGG CTCCGATGAG GTGACGATCA CCCGCGAGCC GGGGGGCTAC ATGTTGTCGA CGGATCCCTC GGCGGTGGAC CTCCACCGGT TCCGCCGCCT CGTCGCCGAC GCCCGTTCCA GCGCCGAACC CGCACGGGCA GCGGATCTGT TTGACGAGGC GCTGTCGTTG TGGCGTGGGG AACTGTGCAC CACGCTGGAC ACTCCCTGGG TGAACGAGCT GCGCACCGCC CTCGAGGTGG AGCGGCTCTC CATCGTTTCG GAGCGCAACG ACGCCGCGTT GAACGCGGGA CGGCACGCAG AGTTGCTCGC CGACCTGGTG GCCGCATCGC GTGCGCACCC GCTCGACGAG CGGTTGGCCG GTCAGTTGAT GCTGGCGCAG TACGGCAGTG GGCGGCAAGC CGAGGCGCTG GACACTTACC GTCGAACGCG TCAGCGGCTC GTCGACGAGC TCGGCGTGGA CCCCAGCCCC ACCTTGCGGG CGGCGTATCA ACGCATTCTC GACGGCGACT CCGACCGGGC CCCGGCGACG CCGGCGGTGG GAGCGCAGGG GATTCCGCCG GCGGATTCAC TGCCCCGGCG CGTGACGAGT TTCATCGGGC GTCGGCAGGA ACTGGCCCAC ATCGCAGCCG CTTTGGGGCA GGGTCCGCTG CTCACCCTGA CCGGTGTCGG CGGGGCCGGC AAAACCCGGC TGGCCCTCGA GGCGGCGACG CGTCACAAGG CCCGATTCGG CGACGGTGTC TGGTGGTGCG AATTGGCGGC CCTGGCCGAC GACGCGGCGG TCGGCCACGC GGTGGCGGGC GCGCTGCGCC TGCAGCAGAG GCAGGGGCTC GACATCGACG CGACGGTGAT CGAGTACCTC GCCACGCGGG AGCTTCTGCT CGTGATCGAC AACTGCGAGC ACCTGCTCGA CGCCGCCGCG CAGCTGATCG ACCGCATCGT TGCGCGGTGC CCGGGGGTCA CCGTGCTGGC CACCAGCCGG GAAGCGCTCG GAGTCGCCGG AGAGCGGATC ATGCCGGTGC CGCCGCTGCC GCCGGACGAG GCCAGCGCAC TGTTCGCCGA TCGCGCCAGG GCGGGTCGCC CTGATTTCGA CCTCGACCGT GAGCCGGTTG GCGCCGTGGC CGAGATCTGT CGTCAACTCG ACGGTCTGCC GCTGGCGATC GAGTTGGCGG CAGCCCGGAT CCGTGTCATG GGCAGCCTCG ACCTGGCGCG CCGGCTCGAC GGGTTGCGTC TGCTCAGCGG CGGAGCGCGC GGCGCTTCGC CCCGCCAGCA GAGCTTGGCC GCCACCATCG ACTGGTCCTA CCGGCTGCTC TCCGAGTCCG AACAGCAGTT GTTCGCTCGG CTGTCGGTCT TCGCCGGTGG TTTCGACCTT GCCGGGGCGC ACGGGGTGTG CGCCGAGGAT GCGGCAGGTG AGGAGGACAC CCTCGCGCTC CTCACCGGCC TGGTCGAGAA ATCCATCGTC GTGCTCCGCC CCGGCACCGG CTGGACGCGG TACAGCCTGC TGGAGACGTT GCGCGCATAC GGGCGAAACC TCTTGCGCGA AAACGCAATC GAACAGGTGT ACGCGCGCCG GCATGCGGTG TATTTCACCG GACTGGCCGA ACGTGCCGCG GCGGGAATGC ACACTGTGGA CGAGGGCGCC TGGGTCGACC GGATGCTGCC CGACTACGAC AACCTTCGGG TGGCCTTCGA TCGCGCGATG GCCGACGGGG ACGTCGATCT CGCGATGCGG CTGGTCACCT CGCTGTCGGA GTTCGGACAT CTGCGGGTGG GCTACGAGGC GTCGGAGTGG GCTGAGAGGG CGTTCGCGGT CACCGGTCCA GACCATCCTC TGTTCGCGGC GGCTGTCGGG TTCGCCGCAC GCGGCGCGTG GAATCGCGGC GAGGACAACC GCGTCCGGTC GCTGGCGGCT CTGGCCGGCG GCCGTAGTCC CCAGCGCGGA AACGGTCGGG TGGCTTACCC CGGTGACGTG CTCGCCGATG TGGCGCTCTA TGAGGGCCGT CCCGATGTCG CGCTGGCGCA TTACACCGCC GAAATGGAGC GCGCGCGCCG TGAGGCCGAT CCCATCCGGC TGGTGTGGAC GCTGTTCTAC GTGGCGATCT GCTACGCCGC ACTGCGCACC CCGGAAGCCG GACTGCACGC CGCGCAGGAG GCGGTCCAGG TTGCGGACAC GACCGCCAAC CCGACGGCGC GCTCGATGGC GGGTTACGCC CTCGGCCTGG TGCTCAAGAA ATGCGAGCCC GAGCAGGCGC TGGCGCTGTT CGACGAGGCG GCACAGTCGG CCGCGTCGGT GCGGAACTTC TGGTGGCAGG GCATCGCGAT GATGGAAGCC GCCGCCACCC GCGCCGTGCA CGGTGATTCG GCCAGAGCCG CAGGCGAATT CATCGCAGTG CTGGAGCACT GGGACAGGGT GGGGGACTGG AGCCAGCAGT GGCTCAACCT GCGGTACGTG ACGCGCCTGC TGGTCCGGTT GGGCGCCACC GAGGACGCCG CCGCGTTGCA CTGTGCACTC GTCAAGGCGG GCAAGCCGTC TCCTTTGACC GACACCGCGG TGGCTGACCT CGGCCGGCCC GCGGCCGACG GGCTCAGTGG GGTCGACGCC GTCAAACGCG CCTACTCGGC CCTTGCCCGG TATCGCTGA
|
Protein sequence | MATELRLLGD VEVLVDGRRL DVGHARQRCV LVALLADVNQ PVPAEQLIDR VWAGDPPHRV RNALAGYLSR LRALFAGSDE VTITREPGGY MLSTDPSAVD LHRFRRLVAD ARSSAEPARA ADLFDEALSL WRGELCTTLD TPWVNELRTA LEVERLSIVS ERNDAALNAG RHAELLADLV AASRAHPLDE RLAGQLMLAQ YGSGRQAEAL DTYRRTRQRL VDELGVDPSP TLRAAYQRIL DGDSDRAPAT PAVGAQGIPP ADSLPRRVTS FIGRRQELAH IAAALGQGPL LTLTGVGGAG KTRLALEAAT RHKARFGDGV WWCELAALAD DAAVGHAVAG ALRLQQRQGL DIDATVIEYL ATRELLLVID NCEHLLDAAA QLIDRIVARC PGVTVLATSR EALGVAGERI MPVPPLPPDE ASALFADRAR AGRPDFDLDR EPVGAVAEIC RQLDGLPLAI ELAAARIRVM GSLDLARRLD GLRLLSGGAR GASPRQQSLA ATIDWSYRLL SESEQQLFAR LSVFAGGFDL AGAHGVCAED AAGEEDTLAL LTGLVEKSIV VLRPGTGWTR YSLLETLRAY GRNLLRENAI EQVYARRHAV YFTGLAERAA AGMHTVDEGA WVDRMLPDYD NLRVAFDRAM ADGDVDLAMR LVTSLSEFGH LRVGYEASEW AERAFAVTGP DHPLFAAAVG FAARGAWNRG EDNRVRSLAA LAGGRSPQRG NGRVAYPGDV LADVALYEGR PDVALAHYTA EMERARREAD PIRLVWTLFY VAICYAALRT PEAGLHAAQE AVQVADTTAN PTARSMAGYA LGLVLKKCEP EQALALFDEA AQSAASVRNF WWQGIAMMEA AATRAVHGDS ARAAGEFIAV LEHWDRVGDW SQQWLNLRYV TRLLVRLGAT EDAAALHCAL VKAGKPSPLT DTAVADLGRP AADGLSGVDA VKRAYSALAR YR
|
| |