Gene Namu_4605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4605 
Symbol 
ID8450233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5125270 
End bp5126481 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content75% 
IMG OID645043646 
Producthistidine kinase 
Protein accessionYP_003203873 
Protein GI258654717 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.506334 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGGT CCGATACCGT GGGCGCGGTG CCCGTGTCTC ACCTGAGCCC CGTGTTCGTG 
GGGCTGCGCA CCGGGCTGCA CGTGCTGGTC GCCGGCCTGG CCCTGTTCGT GGTCGCCCAG
GCCTGGTCGG CCGGCCGCCC GTCCGCCGTC GCGATCACGC TCGTCGCCGC CGCCTTCCTG
GTGGTGTACG CGGCCGGCTT CGTCACCCGC GCCGACCGCA ACCGGATCCG GTTCTGGAGT
CGCATCTGGG TGGCCGCACT GTCGGTGCTG TGGATGGCGC TGATCTGGCT GACCCCGGAG
GCGGCCTACC TGGCGTTCCC GTTGTTCTTC CTCTACCTGG AGGTCATGCC CGGCTGGCGC
GGGCCGTTCA CCGTGGCGCT GGCCACCGTG GTGGCGATCC TGGGCAGCGG CTGGGCCGGC
GGCTGGACCG CCGGCGGCAT CCTCGGCCCG ATCATCGGCG CGGCGGTGGC GATCCTGATC
GGTGCCGGCT ACCACTCGCT GCAACGGGAG GCGGCCGAGC GGGAGCAGCT GGTCGCCGAA
CTCGTGGCCA CCCGGGCCGA GCTGGCCCAC CAGGAACGGG CGGCCGGCGC AGCCCAGGAG
CGGGCCCGGC TGGCCCGGGA CATTCACGAC ACCCTGGCCC AGGGCCTGTC CAGCATCCAG
ATGCTGCTGC AGGCCGCCGA GCGGGCCGAC CCGACCGGTC CGTCGGCCGG CCACGTCCGG
CTCGCCCTGG CCACCACCGC CGACAACCTG GCCGAGGCCC GCACTCTCGT GCGTGAACTC
ACCCCGGCCC CGCTGGCCGA CGCCGGCCTG GTCGCCGCAC TGCGCCGGCT CGCCCAGACC
CAATGGCAGC GACCGGATCT GGCCGTGACC ATCGAGGCCG ACGAGGCGCT GGACCTGCCG
ATGGCGACGC AGACGGCGCT GCTGCGCCTC GCCCAAGGGG CGATGGCCAA CGTGCTGCAG
CACGCCCGGG CGACCCGGGC CACCATCGGC ATCAGCCGCC GGCCGGCGGC GGTGCGGCTG
ACGGTGCGCG ACAACGGGAT CGGTTTCGAC GTCGACCACT GGCAGCCACC CGCCGGCGGG
CCGGATTCGT TCGGCCTGGC CGCCGCCCGC ACCCGGGTCG AGCAGCTCGG CGGCACCTTG
GAACTGACCA CCGCGCCGGG GCGGGGCACC ACCGTCACCG TCGAGCTGCC CGTCGCGATG
GTGACGGCAT GA
 
Protein sequence
MNGSDTVGAV PVSHLSPVFV GLRTGLHVLV AGLALFVVAQ AWSAGRPSAV AITLVAAAFL 
VVYAAGFVTR ADRNRIRFWS RIWVAALSVL WMALIWLTPE AAYLAFPLFF LYLEVMPGWR
GPFTVALATV VAILGSGWAG GWTAGGILGP IIGAAVAILI GAGYHSLQRE AAEREQLVAE
LVATRAELAH QERAAGAAQE RARLARDIHD TLAQGLSSIQ MLLQAAERAD PTGPSAGHVR
LALATTADNL AEARTLVREL TPAPLADAGL VAALRRLAQT QWQRPDLAVT IEADEALDLP
MATQTALLRL AQGAMANVLQ HARATRATIG ISRRPAAVRL TVRDNGIGFD VDHWQPPAGG
PDSFGLAAAR TRVEQLGGTL ELTTAPGRGT TVTVELPVAM VTA