Gene Namu_2567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2567 
Symbol 
ID8448178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2815732 
End bp2816868 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content72% 
IMG OID645041667 
Producthistidine kinase 
Protein accessionYP_003201911 
Protein GI258652755 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0076552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00389148 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGATTC GGATGCCCCT GCGGTTGTTC TTGTCCTACG CCCTCGTGGC TTCCGTCGGA 
GCCGTCGTGG CGTACCTGAC GGTGTGGCTG CTTGCGCCGG CCCTGTTCGA TCGCCAGGTC
GGCATGATGA ACAACGGCGG CATGGGCTCG GGGTCGGGGG GCGGAGCGCA GGCCGGTTTG
CGTGACGCGT TCCGGTTCGC CCTCACCACG GCCCTGGCCG TCGGGCTCAC CGCCAGCGTG
GTGGCCGCCG TCGTCGTGGC CTGGTTCGTC ACGCGCCGGC TGATGCGCCC GCTGCACGCG
GTGCGGGCGG CCACCCGGCG GATCGCGGCG GGTGACTACC GAGTCAGCGT GCCGGTGCCG
AGGGAGCCGG AACTGGCCGC GCTGGCCACC GATGTGAACA CCCTGGGCTC CGCGCTGGCC
GACACCGAGG CCCGCCGCAC TCGGCTGCTC GGCGATGTCG CGCACGAACT GCGCACGCCG
TTGACCGCCC TGGACGGATA TGTCGAGGGC CTGATCGATG GAGTGTTCGC GCCGACTCCG
GACATCCTTG GGTCGCTCAG TGACGAGCTG CGCCGGCTGC ACCGGTTGGC CGAGGACCTT
TCCAGTCTGT CCCGGGCCCA GGAGCAGGGC CTGGATCTGC ATCCGGTCGA CGCGGACCTC
GCCGACCTTG CTCGTCGTGC GGCCGCCCGG CTGGCCCCGC AGTTCCGGGA CGCGCAGGTC
ACGTTGACCG TCCAGGCCGA CCAGATGGTG CCGGTCACCG CCGATCCCGA CCGGATCATC
CAGGTGCTGA CGAACCTGCT CGGCAACGCG CTGCTCGCCA CCGCGGCGGG CGGGGCCGTG
ACGATTGCCG CCCGCGCCGG CGCCCGGAGC GGCGAGGTCT CGGTGACCGA CACCGGGGTG
GGTCTGGCCG AGGCGGACCT CGAACGGGTC TTCGAGCGCT TCTACCGGGC GCCGGGCCAG
CCTCGCCGGT CGGCCGGGTC GGGCATCGGG CTGACCATCG CCCGGGACAT CGCCCGCGGA
CACGGGGGAA ACGTGACCGC CTCCTCACCC GGCCCCGGTC AGGGCGCCCG CTTCACGGTG
ACCATTCCGC TGCGTGCGCA TCGGCGCTCC GAAATCGCCG AGCACCTTCA GAGCTGA
 
Protein sequence
MTIRMPLRLF LSYALVASVG AVVAYLTVWL LAPALFDRQV GMMNNGGMGS GSGGGAQAGL 
RDAFRFALTT ALAVGLTASV VAAVVVAWFV TRRLMRPLHA VRAATRRIAA GDYRVSVPVP
REPELAALAT DVNTLGSALA DTEARRTRLL GDVAHELRTP LTALDGYVEG LIDGVFAPTP
DILGSLSDEL RRLHRLAEDL SSLSRAQEQG LDLHPVDADL ADLARRAAAR LAPQFRDAQV
TLTVQADQMV PVTADPDRII QVLTNLLGNA LLATAAGGAV TIAARAGARS GEVSVTDTGV
GLAEADLERV FERFYRAPGQ PRRSAGSGIG LTIARDIARG HGGNVTASSP GPGQGARFTV
TIPLRAHRRS EIAEHLQS