Gene Mvan_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2071 
Symbol 
ID4646746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2207325 
End bp2209502 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content71% 
IMG OID639805554 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_952892 
Protein GI120403063 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.249392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.37167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGACCC GCTTCCTGGA ACGCGTCGTG CGGTCGCCGG CGCGGCCACT CGGCCTCGGG 
ATCGCAGTGG CGGCCGGGTT CTTGGCGGCC GAGGTGCTCG CCGTGTTCGC ACTGAAGAAG
ATCGCGCCCG AGAACGCGTT CGGCGCGCTG CTCCTGCTGG GCGTGCTGGT GGTGTCGGCC
GGCTGGGGAT TCGGATTGTC GATCGCCACC TCGCTGGCCA GCGCCGCCGT CTACGCCTAC
CTCCATCTGG AGGGCCGCGA CAGCCTCGCC CCTGCCCTGA TCATCTTCCT CACGCTCGCG
CTTCTCACCA ATGCCCTTGT CGGGCAAGCA CGCCTGCGTG CCGCCGAAGC CGAGCAGCGG
CGGCGGGAAG CCGACCTCTC TGCGGACCTG GCCCGCGTCA TGCTGCGGGC ACCGGCGCTG
GGGCCCGCGC TGGAGGACGC GGGGCGGCGC TTCGCCGACG TGCTCGGATT GCCGTACGCG
ACGCTGACGG TCGGTGGCGG CGCGACCGGC GCCGACGAGA TGGCAATCGA CCTGGTCGAC
GGGGCCGAAC ACACCGGAAC CCTGCTCGTG CCGAGGGATC TGCCTTTGGC GTCGGTTCGG
CGGATCCACC GGATGGTGCC GTCCCTGCAG GCTCTGCTGG CTGCCGCATG CGACCGCGAA
GACATCACCG CGGAGCTCGA AGCCAGCCGC CGCGAGCTGG AACGGTTCTT CGCCGTGGCG
TCGGATCTGC TGTTCATCGG CACGTACGTC GACGGCGTCG CACAGCTGAC ACGGGTGAAC
CCGGCGTTCG AGCGGGCACT CGGATACTCG GCGGCAGAAC TGGCGGCACG GCCACTGACC
GAGTTCATCC ATCCCGACGA CCGCGACGGC ACGGCCGCCG CGCTCGACTC GGTGCCACAG
ACGGACGGCG CCACGCAGTT CGAGAACCGC AGTCTGCGTC GGGACGGCGG TGTGCGCTGG
CTGGAGTGGA ACGTCGTCTC CGACCGCGGT GTGCTGCTCG GCGGCGCCCG CGACGTCACC
GAACGCAGGC GGGAACAGGA CCGGCTGCGA ATGGCAAGGA CTCAGCAGGC GGCGCTGCGT
AGAGTCGCGA CGTTGGTGGC GCGCGGAGCC CCGCTGTCCG AGGTCTACGA CGTGGCGGTC
ACCGAGCTGG CGCACAGCCT CGGCGTCAAC CATGTCACCC TGTTGGCCTT CGAGGCCGAT
GACCACGCCG TGGTCCGGGC GGCGCTGAAG TCCGCGCATC AGCCCGGTTT CGCTGTGGGA
GATCGGCTCT CGCTCGACGG TGGAAGCATC AGCGAGCAGG TGCACCGCAC GGGCCTACCG
GCCCGCATCG ACGACTACAG CGAGGTGCCC GGGCGGATCG CCACCCGTCT GCGCGAGCTC
GGGATCCGCT CCGCCGCGGG CTCCCCGCTG ACCGTCGACG GCGGGACCCG AGGCGTGCTG
GTGGTGGGAT CACATGCCGT GCAGGGCGTT CCGGATGGCA CCGAGGCCCA CATCGGCGAC
TTCGCCGATC TCATCTCGAC GGCGATCGCC AACGCCGAGA CCCGCGCCGA GCTCACCGCG
TCACGCGCCA GGATCGTGGC GGCCGCCGAC GAGGCCCGAC GCGGCTTCGA ACGCGATCTG
CACGACGGCG CCCAGCAGCG GATCGTGTCG TTGAGCCTGC AACTGCGCGA GGCCGAGGCG
GCCGTCGAGG GAGACGAGGC GCTGCGGACC CAGCTGTCGA CTGTGGTCAA CGGTCTGGCC
GGGCTGCACT CCGATCTGCA GGAGCTGTCC CGCGGACTGC ATCCGGCGGT GCTGTCGCGC
GGCGGCCTGA AGCCGGCGAT GCGCAACCTG GCGCGCCGCT CCACGGTGCC GGTCGAGTTG
ACCGTCGACA TCGACCGCCG TCTGCCGGAG CCGGTCGAAG TCGCGGCGTA TTACGTTGTC
GCAGAATCGT TGACGAATGT CGCCAAGCAC GCTCAAGCCG ACTCGGTGAC GGTGGAGATC
GGGCTCGACG ACGCCCCGGA CGGCGGCACG CTGCTGCGGC TGTCGGTCAC CGACGACGGC
ACCGGCGGCG CGTCGGCCGA CGGCGGATCC GGCCTGGTCG GCCTGCGCGA CCGGGTCGAG
GCGCTGTCGG GTCAGCTGAC GGTGACCAGC CGACCCGGTG ACGGGACGAG CATCAGCGCG
ACAATCCCGG TCGACTAG
 
Protein sequence
MPTRFLERVV RSPARPLGLG IAVAAGFLAA EVLAVFALKK IAPENAFGAL LLLGVLVVSA 
GWGFGLSIAT SLASAAVYAY LHLEGRDSLA PALIIFLTLA LLTNALVGQA RLRAAEAEQR
RREADLSADL ARVMLRAPAL GPALEDAGRR FADVLGLPYA TLTVGGGATG ADEMAIDLVD
GAEHTGTLLV PRDLPLASVR RIHRMVPSLQ ALLAAACDRE DITAELEASR RELERFFAVA
SDLLFIGTYV DGVAQLTRVN PAFERALGYS AAELAARPLT EFIHPDDRDG TAAALDSVPQ
TDGATQFENR SLRRDGGVRW LEWNVVSDRG VLLGGARDVT ERRREQDRLR MARTQQAALR
RVATLVARGA PLSEVYDVAV TELAHSLGVN HVTLLAFEAD DHAVVRAALK SAHQPGFAVG
DRLSLDGGSI SEQVHRTGLP ARIDDYSEVP GRIATRLREL GIRSAAGSPL TVDGGTRGVL
VVGSHAVQGV PDGTEAHIGD FADLISTAIA NAETRAELTA SRARIVAAAD EARRGFERDL
HDGAQQRIVS LSLQLREAEA AVEGDEALRT QLSTVVNGLA GLHSDLQELS RGLHPAVLSR
GGLKPAMRNL ARRSTVPVEL TVDIDRRLPE PVEVAAYYVV AESLTNVAKH AQADSVTVEI
GLDDAPDGGT LLRLSVTDDG TGGASADGGS GLVGLRDRVE ALSGQLTVTS RPGDGTSISA
TIPVD