Gene Mvan_5523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5523 
Symbol 
ID4648660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5903912 
End bp5905072 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content68% 
IMG OID639808995 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_956295 
Protein GI120406466 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCCCG GGGGCTTCTC GTGGCCGATC GTCGTGAGCA TCGACGTGAC CCTGTTCATC 
GTCTCTGCGG TCGCCATCCT GCAACGCCCC GAGTCACAGT GGTGGATCGC AATCCTCGGC
CTCGCCGTGG CGTACTCGCC GTACCTGGTG TTCTTCGCCG TCGACATCAG TGTGTCGCCG
GCCGTCGAGG CGTTGGCGCT GGGACTGGCG TGGATGCTGG CCATTTCGAT CTGGCTGTTC
GGCATGTCGG CACCGATCGA GGGTGATTTC GCCCCGGTCC TGCTCGGACT GGTCACCGGG
GTCATCGGCT CGATGACGTC GATGCGGGGC GGAATGCTGG CCGCCGTGAC GGCGGCGGGC
ATACTCGGCG CCGCGGTCGC GATGGATCGT GTCGACACGC CCTGGCTGTA CCTGAGCTTC
ATCGGGATCG GTTGGCTGGT CGGCTATCTG ATGCGGATCC AGCAGGAGTT GCTCCTCGAA
CAGAGAGAGG CCCAGCAGAA GCTCGCCGAG CATGCGGCGG TCGACGAACG GCGGCGCATC
GCGCGCGAGG TGCACGACGT GATCGCGCAC TCGCTGACCG TGACCCTGCT GCATGTCACC
GGGGCGCGTC GCGGTCTGCA GGAGGACCGC GACGTCGACG AGGCGGTGGA GGCACTGGAG
CAGGCCGAAC GACTGGGCCG CCAGGCCATG GCCGACATCC GACGCACCGT CGGATTGCTC
GACGACGCCG TCGGCAAGGT CACCCCGGAG CCGGGTGTCG GTGACATCGC GATGCTGGTC
GACGATTTCG TGCGCGCCGG GCTGGCGGTG AGGTTCGAGT CGAGCGGTCG GCACGACCGG
ATCTCCGGGG CTGCCGGTCT CGCGCTCTAC CGCATCGCGC AGGAATCGCT GGCCAATATC
GCCAAGCATG CGCCCGACGC AGTGTCGGAG TTGTCGCTGA CGGTCTCGCC GTCCGAGGCG
GTGCTGACCG TCACGAACCG GCTGCCGGTA CCGGTCGCCG CGGCGAACTG TGCGGACGGG
CGCGGGATGC GCGGGATGCG TCAGCGCGTC CAACAGCTCG GCGGCACGGT CAATGCCGGC
CCGGACGGTG ACGGCTGGGT GGTGCACGCG ACGATCCCGC TCGACGACGA ATCGTGCGTG
TTGCGAGGGT GCGGCGGATG A
 
Protein sequence
MMPGGFSWPI VVSIDVTLFI VSAVAILQRP ESQWWIAILG LAVAYSPYLV FFAVDISVSP 
AVEALALGLA WMLAISIWLF GMSAPIEGDF APVLLGLVTG VIGSMTSMRG GMLAAVTAAG
ILGAAVAMDR VDTPWLYLSF IGIGWLVGYL MRIQQELLLE QREAQQKLAE HAAVDERRRI
AREVHDVIAH SLTVTLLHVT GARRGLQEDR DVDEAVEALE QAERLGRQAM ADIRRTVGLL
DDAVGKVTPE PGVGDIAMLV DDFVRAGLAV RFESSGRHDR ISGAAGLALY RIAQESLANI
AKHAPDAVSE LSLTVSPSEA VLTVTNRLPV PVAAANCADG RGMRGMRQRV QQLGGTVNAG
PDGDGWVVHA TIPLDDESCV LRGCGG