Gene Mvan_2284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2284 
Symbol 
ID4644470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2436327 
End bp2439185 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content70% 
IMG OID639805768 
Producttranscriptional regulator 
Protein accessionYP_953104 
Protein GI120403275 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.195552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCG AATTGCGGCT GCTCGGCGAC GTCGAGGTGC TCGTCGACGG ACGACGCCTC 
GACGTCGGTC ACGCCCGCCA GCGCTGCGTT CTGGTGGCAC TTCTGGCGGA TGTGAACCAG
CCGGTTCCCG CGGAGCAGCT CATCGACCGG GTCTGGGCCG GAGACCCGCC CCATCGTGTC
CGAAACGCCC TGGCCGGTTA TCTGTCCCGG CTGCGTGCCC TGTTCGCCGG CTCCGATGAG
GTGACGATCA CCCGCGAGCC GGGGGGCTAC ATGTTGTCGA CGGATCCCTC GGCGGTGGAC
CTCCACCGGT TCCGCCGCCT CGTCGCCGAC GCCCGTTCCA GCGCCGAACC CGCACGGGCA
GCGGATCTGT TTGACGAGGC GCTGTCGTTG TGGCGTGGGG AACTGTGCAC CACGCTGGAC
ACTCCCTGGG TGAACGAGCT GCGCACCGCC CTCGAGGTGG AGCGGCTCTC CATCGTTTCG
GAGCGCAACG ACGCCGCGTT GAACGCGGGA CGGCACGCAG AGTTGCTCGC CGACCTGGTG
GCCGCATCGC GTGCGCACCC GCTCGACGAG CGGTTGGCCG GTCAGTTGAT GCTGGCGCAG
TACGGCAGTG GGCGGCAAGC CGAGGCGCTG GACACTTACC GTCGAACGCG TCAGCGGCTC
GTCGACGAGC TCGGCGTGGA CCCCAGCCCC ACCTTGCGGG CGGCGTATCA ACGCATTCTC
GACGGCGACT CCGACCGGGC CCCGGCGACG CCGGCGGTGG GAGCGCAGGG GATTCCGCCG
GCGGATTCAC TGCCCCGGCG CGTGACGAGT TTCATCGGGC GTCGGCAGGA ACTGGCCCAC
ATCGCAGCCG CTTTGGGGCA GGGTCCGCTG CTCACCCTGA CCGGTGTCGG CGGGGCCGGC
AAAACCCGGC TGGCCCTCGA GGCGGCGACG CGTCACAAGG CCCGATTCGG CGACGGTGTC
TGGTGGTGCG AATTGGCGGC CCTGGCCGAC GACGCGGCGG TCGGCCACGC GGTGGCGGGC
GCGCTGCGCC TGCAGCAGAG GCAGGGGCTC GACATCGACG CGACGGTGAT CGAGTACCTC
GCCACGCGGG AGCTTCTGCT CGTGATCGAC AACTGCGAGC ACCTGCTCGA CGCCGCCGCG
CAGCTGATCG ACCGCATCGT TGCGCGGTGC CCGGGGGTCA CCGTGCTGGC CACCAGCCGG
GAAGCGCTCG GAGTCGCCGG AGAGCGGATC ATGCCGGTGC CGCCGCTGCC GCCGGACGAG
GCCAGCGCAC TGTTCGCCGA TCGCGCCAGG GCGGGTCGCC CTGATTTCGA CCTCGACCGT
GAGCCGGTTG GCGCCGTGGC CGAGATCTGT CGTCAACTCG ACGGTCTGCC GCTGGCGATC
GAGTTGGCGG CAGCCCGGAT CCGTGTCATG GGCAGCCTCG ACCTGGCGCG CCGGCTCGAC
GGGTTGCGTC TGCTCAGCGG CGGAGCGCGC GGCGCTTCGC CCCGCCAGCA GAGCTTGGCC
GCCACCATCG ACTGGTCCTA CCGGCTGCTC TCCGAGTCCG AACAGCAGTT GTTCGCTCGG
CTGTCGGTCT TCGCCGGTGG TTTCGACCTT GCCGGGGCGC ACGGGGTGTG CGCCGAGGAT
GCGGCAGGTG AGGAGGACAC CCTCGCGCTC CTCACCGGCC TGGTCGAGAA ATCCATCGTC
GTGCTCCGCC CCGGCACCGG CTGGACGCGG TACAGCCTGC TGGAGACGTT GCGCGCATAC
GGGCGAAACC TCTTGCGCGA AAACGCAATC GAACAGGTGT ACGCGCGCCG GCATGCGGTG
TATTTCACCG GACTGGCCGA ACGTGCCGCG GCGGGAATGC ACACTGTGGA CGAGGGCGCC
TGGGTCGACC GGATGCTGCC CGACTACGAC AACCTTCGGG TGGCCTTCGA TCGCGCGATG
GCCGACGGGG ACGTCGATCT CGCGATGCGG CTGGTCACCT CGCTGTCGGA GTTCGGACAT
CTGCGGGTGG GCTACGAGGC GTCGGAGTGG GCTGAGAGGG CGTTCGCGGT CACCGGTCCA
GACCATCCTC TGTTCGCGGC GGCTGTCGGG TTCGCCGCAC GCGGCGCGTG GAATCGCGGC
GAGGACAACC GCGTCCGGTC GCTGGCGGCT CTGGCCGGCG GCCGTAGTCC CCAGCGCGGA
AACGGTCGGG TGGCTTACCC CGGTGACGTG CTCGCCGATG TGGCGCTCTA TGAGGGCCGT
CCCGATGTCG CGCTGGCGCA TTACACCGCC GAAATGGAGC GCGCGCGCCG TGAGGCCGAT
CCCATCCGGC TGGTGTGGAC GCTGTTCTAC GTGGCGATCT GCTACGCCGC ACTGCGCACC
CCGGAAGCCG GACTGCACGC CGCGCAGGAG GCGGTCCAGG TTGCGGACAC GACCGCCAAC
CCGACGGCGC GCTCGATGGC GGGTTACGCC CTCGGCCTGG TGCTCAAGAA ATGCGAGCCC
GAGCAGGCGC TGGCGCTGTT CGACGAGGCG GCACAGTCGG CCGCGTCGGT GCGGAACTTC
TGGTGGCAGG GCATCGCGAT GATGGAAGCC GCCGCCACCC GCGCCGTGCA CGGTGATTCG
GCCAGAGCCG CAGGCGAATT CATCGCAGTG CTGGAGCACT GGGACAGGGT GGGGGACTGG
AGCCAGCAGT GGCTCAACCT GCGGTACGTG ACGCGCCTGC TGGTCCGGTT GGGCGCCACC
GAGGACGCCG CCGCGTTGCA CTGTGCACTC GTCAAGGCGG GCAAGCCGTC TCCTTTGACC
GACACCGCGG TGGCTGACCT CGGCCGGCCC GCGGCCGACG GGCTCAGTGG GGTCGACGCC
GTCAAACGCG CCTACTCGGC CCTTGCCCGG TATCGCTGA
 
Protein sequence
MATELRLLGD VEVLVDGRRL DVGHARQRCV LVALLADVNQ PVPAEQLIDR VWAGDPPHRV 
RNALAGYLSR LRALFAGSDE VTITREPGGY MLSTDPSAVD LHRFRRLVAD ARSSAEPARA
ADLFDEALSL WRGELCTTLD TPWVNELRTA LEVERLSIVS ERNDAALNAG RHAELLADLV
AASRAHPLDE RLAGQLMLAQ YGSGRQAEAL DTYRRTRQRL VDELGVDPSP TLRAAYQRIL
DGDSDRAPAT PAVGAQGIPP ADSLPRRVTS FIGRRQELAH IAAALGQGPL LTLTGVGGAG
KTRLALEAAT RHKARFGDGV WWCELAALAD DAAVGHAVAG ALRLQQRQGL DIDATVIEYL
ATRELLLVID NCEHLLDAAA QLIDRIVARC PGVTVLATSR EALGVAGERI MPVPPLPPDE
ASALFADRAR AGRPDFDLDR EPVGAVAEIC RQLDGLPLAI ELAAARIRVM GSLDLARRLD
GLRLLSGGAR GASPRQQSLA ATIDWSYRLL SESEQQLFAR LSVFAGGFDL AGAHGVCAED
AAGEEDTLAL LTGLVEKSIV VLRPGTGWTR YSLLETLRAY GRNLLRENAI EQVYARRHAV
YFTGLAERAA AGMHTVDEGA WVDRMLPDYD NLRVAFDRAM ADGDVDLAMR LVTSLSEFGH
LRVGYEASEW AERAFAVTGP DHPLFAAAVG FAARGAWNRG EDNRVRSLAA LAGGRSPQRG
NGRVAYPGDV LADVALYEGR PDVALAHYTA EMERARREAD PIRLVWTLFY VAICYAALRT
PEAGLHAAQE AVQVADTTAN PTARSMAGYA LGLVLKKCEP EQALALFDEA AQSAASVRNF
WWQGIAMMEA AATRAVHGDS ARAAGEFIAV LEHWDRVGDW SQQWLNLRYV TRLLVRLGAT
EDAAALHCAL VKAGKPSPLT DTAVADLGRP AADGLSGVDA VKRAYSALAR YR