Gene Mkms_1400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1400 
Symbol 
ID4614231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1501440 
End bp1502930 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content70% 
IMG OID639791075 
Productsignal transduction histidine kinase 
Protein accessionYP_937402 
Protein GI119867450 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC TCGGTGACCT GCTCGCCGAG CACACCGTGC TGCCCGGCAA CGCCGTCGAC 
CATCTGCATG CGGTGGTCGG GGAGTGGCAG CTGCTGGCCG ACCTGTCCTT CGCCGATTAC
CTGATGTGGG TACGCCGCGA CGACGGGGTG CTGGTCTGTG TGGCGCAGGT CAGGCCCAAC
ACCGCCGCCA CCGTCCTGTT GGCCGACGCG GTGGGGACCA CTGCGATCCC CGAGACCATG
CCGTTGGTCA CGGCCACCTT CGAGTCGGGC ATCATCGGCC GGGAAGGTCC TGCCGCGCAG
AACGACGCGC CCGGCCTGAA CGTCGAAGCG GTGCCGGTGC GCTACCGCGA AGACGTCGTC
GCGGTGCTGA CCCATCAGAC CGCCCTGGCG GCCCGGCGGA CCGCGAGTCC GCTGGAGGTC
GCCTATCTGG ACTGCGCCGG CGATCTGCTG ATCATGCTCT CCGAGGGGAC CTTCCCCAAC
GTCGGTGACC TGGCCATGTC ACGTTCCAGC CCGCGGGTGG GGGACGGCTT CATCCGGTTG
GACGGCGCGG GCAACGTGGT GTTCGCCAGC CCCAACGCGA TCTCGGCGTA TCACCGGATG
GGGCTGACCG CCGACCTCGA GGGCCACAAC CTGGTGGCGG TGACGCGTCC GCTCATCTCC
GACCCGTTCG AGGCGCAGGA ACTGGCCAAC CATGTGCGGG ATTCGCTCGC CGGTGGGTCC
AGCATGCGGA TGGAGGTCGA TGCCGGCGGT GCGGCGGTGC TGCTGCGCAC GATGCCGCTC
GTGGTGCACG GCGCGGCGGT GGGCGCGGCC GTGCTGATCC GCGACGTGAC CGAGGTGAAG
CGGCGCGACC GTGCGCTGCT GTCGAAGGAC GCGACGATCC GTGAGATCCA TCACCGGGTC
AAGAACAATC TGCAGACGGT GGCCGCGCTG CTGCGTCTGC AGGCCCGGCG CACGAACAAC
GCCGAGGGGC GCGAGGCGTT GATGGAGTCC GTGCGCCGGG TGTCGTCGAT CGCCCAGGTG
CACGACGCGC TGTCGATGTC GGTGGACGAG GAGGTCAACC TCGACGAGGT CGTCGACCGC
ATCCTGCCGA TCATGAACGA CGTCGCCAGC GTCGGCCCGC CGATCCGGAT CAAGCGCGAA
GGTGACCTCG GCGTGCTCGA CGCCGATCGG GCGACGGCGC TGATCATGGT GATCACCGAG
GTGGTGCAGA ACGCCATCGA GCACGCCTTC GATGCCAGCA CCGCGCAGGG CAGCGTCACG
ATCCGCGCCG AGCGCTCGGC GCGGTGGCTC GACGTGGTGG TGCACGACGA CGGGCGCGGT
CTACCGACCG GCTTCAGCCT GGAGAAGTCC GACCGGCTGG GGTTGCAGAT CGTGCGGACG
CTGGTGTCGG CGGAGTTGGA CGGATCGCTC GGGATGCATG ACGTGCCCAG TGGGGGGACG
GATGTGGTGC TGCGGGTCCC GATCGGTCGC CGCGGCCGAG CCGCGCAGTA G
 
Protein sequence
MSTLGDLLAE HTVLPGNAVD HLHAVVGEWQ LLADLSFADY LMWVRRDDGV LVCVAQVRPN 
TAATVLLADA VGTTAIPETM PLVTATFESG IIGREGPAAQ NDAPGLNVEA VPVRYREDVV
AVLTHQTALA ARRTASPLEV AYLDCAGDLL IMLSEGTFPN VGDLAMSRSS PRVGDGFIRL
DGAGNVVFAS PNAISAYHRM GLTADLEGHN LVAVTRPLIS DPFEAQELAN HVRDSLAGGS
SMRMEVDAGG AAVLLRTMPL VVHGAAVGAA VLIRDVTEVK RRDRALLSKD ATIREIHHRV
KNNLQTVAAL LRLQARRTNN AEGREALMES VRRVSSIAQV HDALSMSVDE EVNLDEVVDR
ILPIMNDVAS VGPPIRIKRE GDLGVLDADR ATALIMVITE VVQNAIEHAF DASTAQGSVT
IRAERSARWL DVVVHDDGRG LPTGFSLEKS DRLGLQIVRT LVSAELDGSL GMHDVPSGGT
DVVLRVPIGR RGRAAQ