Gene Mvan_4868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4868 
Symbol 
ID4643846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5211873 
End bp5212964 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID639808339 
ProductMarR family transcriptional regulator 
Protein accessionYP_955647 
Protein GI120405818 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.440159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.306915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGGC CCGCACTGTT CACCCCGTCG GCGTCGGACA CCGCCGCGCT GGAAGCCCTG 
ACCGTCGGAC GCACCGACCT GCTGGACACG TTGACCCAGC GCATCCGATC ATCGGCGCGC
GACGGCTCCC GGCCCCATAC GCTACTGGTG GCGCCCCGCG GGGCGGGCAA GACACATACG
CTGCGTGTCG TTCTCGGCCG CACGCTTTCC GATGCGGCAA CAGCGAGACA TGTTCTGCCG
GTGATGGTTT CGGAGGATTC GCTGGCGATC GGCTCCTATG CCGACCTGCT CGTCGAGGCG
GCGCGCGCGA TCGGGCCGCA GGTGGCCGAG CAGGCACGGC AGCTGCGCCG CGCGCGGGAC
CCGGTGGGCA TCGAAGCGGC AATCATCGAG GCCGCCGCGG GCAGGATGAT CCTGCTGGCG
ATCGAGAACC TGGACCGCGT GTTCGACGCC ATCGGCGCGA CAGGGCAGGG CAGCCTGCGG
GCGTGGGTCG AAACCTCGAC GACCGTACTG GTTCTCGGGA CATCGCCGGA GCTGTTTCCC
GGTGTCTCGT CGCGGGAGTA CCCGTGGTAC GGCTCGTTCA TGATCGAGAC GTTGCCGGAG
CTGACGGCGC AGGACGCCGG CGAATTGCTG CGCGGCGCGG CGCGGCGACG CGGTGACACC
GACTTCCACA GGTTCCTCCA GTCCCCGGAC GGCCGGGCCC GGGTCGCGGC CGTGCACAAG
ATCATCGGCA GCACCCCACG GATGTGGCAT CTGCTCGCCG AATGCGCCGA CGCGCCCAGC
CTCGACGCGG TCACGCCCGC CGTCAACGCC CTGCTCGACC GTCTCGCACC GCAATACCAG
CAGCGGCTGT GGCAGCTGCC CGCGGGCGAA CAGCGTCTGG TCGTCGAGCT GGCCCGGGGT
TCGGAACCCC GTTCGGTGTC CGAACTCGCC GAGGCCGTGG GGGTTTCGAA TCAATCCGCG
TCGGCGGCAC TCGGGCGGCT GGCCGCCGAG GGCTGGGTGA CGTCGTCGAA GGCCGCCGGT
GATCGGCGCA CGTCCTGGTA CGACCTCACC GACCCGCTGC TGCGCAGCTA CCTGCAGTAC
CGCGGCGGCT GA
 
Protein sequence
MTRPALFTPS ASDTAALEAL TVGRTDLLDT LTQRIRSSAR DGSRPHTLLV APRGAGKTHT 
LRVVLGRTLS DAATARHVLP VMVSEDSLAI GSYADLLVEA ARAIGPQVAE QARQLRRARD
PVGIEAAIIE AAAGRMILLA IENLDRVFDA IGATGQGSLR AWVETSTTVL VLGTSPELFP
GVSSREYPWY GSFMIETLPE LTAQDAGELL RGAARRRGDT DFHRFLQSPD GRARVAAVHK
IIGSTPRMWH LLAECADAPS LDAVTPAVNA LLDRLAPQYQ QRLWQLPAGE QRLVVELARG
SEPRSVSELA EAVGVSNQSA SAALGRLAAE GWVTSSKAAG DRRTSWYDLT DPLLRSYLQY
RGG