Gene Namu_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4222 
Symbol 
ID8449848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4673366 
End bp4674667 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID645043271 
Producthistidine kinase 
Protein accessionYP_003203500 
Protein GI258654344 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0052969 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0674921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCC GGGTTCTGGT GTCGATCCTG CTGGTCATCG CGGCGACCGT TCTGACCCTG 
GGCGTCCCGC TCTCGATCGT CTCCTGGCGA GTCGTCGACG ACCTGATGCG CAGCGACCTG
AACAGTCGCC TGGATTCGAT CGCCGCCTCG ATCGCCAACC AGTCCAGCTC GTCGGCCATC
GATCTGGGGC AGTTGGCCGC GGCGGTGCCC TCCGGCGGCC GGCTGGACAT CACCATGGCC
GGCCGGATCG ACCAGTCCAT CGGGACCACC TCAACCAGCC AGGTCTACTC CGAGCAACTG
GCGATGGCCG GTGGCGGCAC CCTGCGGTTG TCGGTGCCAG AGAGCTATCT GCGGGCCGAG
CAGTGGAAGG CGTTGGCGTT GGTCGCGCTG GCCATCGCGC TGTCCGTGGT GGTCGGTACT
GGGGTCGCGG TGCTGACCGC TCGCCGCCTG GCGACCCCGT TGACCGATGT CGCGCGCCGG
GCCGCCCGGC TGGGCTCGGG CGACTTCCGG ACCTTCCGGC GCCGGTACGA CATTCCGGAG
CTGGACCGGG TCGCGGACGT CCTGGACTCC TCGGCGCACG ACATCTCCGC GCTGATCGCC
CGGGAGCGGG ACCTGGCCGG TGACATCTCG CACCAGCTGC GGACCCGGCT CACCGGGTTG
CGGTTGCGGC TGGAGGAGCT CGCCGAGTAC CCGGACGCCG ACGTGCAGCA GGAGGTGCAG
GAGGCCCTGG AACAGACCGA TCGCCTGGTC ACGGTGGTCG ACGACCTGCT GGCCAATGCC
CGGTCGCAGC GCGCGGCCGG GGCCAGCGAG TTGGAGCTGT TCGACGAGCT GGCCGAGATC
GAGGCGGAGT GGGGGCCGGC GCTCACCGCG GCGGGCCGCA CGCTGACGGT GCGCTGCGGC
CGGGACGTGC GGGTGCACGC CACTCCGGGG CGGCTGCGCG AGGCCATCGG GGTGTTGGTG
GAGAACTCAC TGCGGCACGG CGCGGGCACG GTCGGGGTGC TGGTCCGGCC GGCCGGCCGG
GGCTCGGGCG GCATGGTGGT GCTGGAAGTC AGCGACGAGG GGCCGGGCAT TCCCGAGGCA
CTGGTCGCGC ACATCTTCGA TCGGGGGGTG TCGACCGCGT CGTCCACCGG GATCGGGCTG
GGGCTGGCCC GGGCGTTCGT CGAGGCCGAC GGGGGACGGT TGGAGCTGCG GCGCGCCGTC
CCGCTGACCT TCGCCATCTT CCTGGTGGTC AGCGAAGAAC CGTCCACGCC GGATGCGGAG
GGCGACCGGG TGCCGGGGCC GGCCGGCGCC CCCGTCGGGT AG
 
Protein sequence
MQRRVLVSIL LVIAATVLTL GVPLSIVSWR VVDDLMRSDL NSRLDSIAAS IANQSSSSAI 
DLGQLAAAVP SGGRLDITMA GRIDQSIGTT STSQVYSEQL AMAGGGTLRL SVPESYLRAE
QWKALALVAL AIALSVVVGT GVAVLTARRL ATPLTDVARR AARLGSGDFR TFRRRYDIPE
LDRVADVLDS SAHDISALIA RERDLAGDIS HQLRTRLTGL RLRLEELAEY PDADVQQEVQ
EALEQTDRLV TVVDDLLANA RSQRAAGASE LELFDELAEI EAEWGPALTA AGRTLTVRCG
RDVRVHATPG RLREAIGVLV ENSLRHGAGT VGVLVRPAGR GSGGMVVLEV SDEGPGIPEA
LVAHIFDRGV STASSTGIGL GLARAFVEAD GGRLELRRAV PLTFAIFLVV SEEPSTPDAE
GDRVPGPAGA PVG