Gene Amir_5034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_5034 
Symbol 
ID8329232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5993160 
End bp5994656 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content70% 
IMG OID644945470 
Producthistidine kinase 
Protein accessionYP_003102702 
Protein GI256379042 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTACC GCGACCGCTG GCCCGCGAGC AGGCTGCTGG CCGCGGCCGT GGCCATCCTG 
ATCCTCGTCT CGGGCGGGGC GGTCACCGCG ATCGGCTTCG CGCTCAACTC GCTGGACAAG
CAGCGCGCGG TGGTGCTCGA CCAGATCGGG CCCGCGAGGC GGTTGGTGAT CGAGATGGAC
GCGGCCCTGG TCAACCAGGA GACCGGGCTC CGCGGGTACG CGCTGTCGGC GTCCTCGGAC
TTCCTGACGC CGTACACGCG CGGCATCGAG GCCGAGGAGA ACGCGGCGCG GGGCGTGACG
TCGGCGCTGC GGACCACGCG GCCGGACCTG CTGGCGCGGC TGGCCGAGAC GCGCGGCATC
GCGCAGGACT GGCGCGAGCA GTACGCCGAG CGGCTGATCG CGCAGGTCAC CGAGAACGGG
CAGCCGCAGC CGGGCGTGGG CGGCACGACC GTGGGCAAGG AGCTGTTCGA CCAGGTCAGG
CGGTCGGTGA ACCAGCTCCA GGCGGACATG ACCCGCGACG TGGACGCGGC GCGGGCCGAG
CTGGACCTCA CGGCCGACCG GCTGCTGTGG CTGTGCATCG CGCTGGGGAC GCTGCTGGCG
GTGATCATCG CGGGGGTCGC GGTGGTGCTG CACCGCATCC TGATCCGGCC GCTGGCCACG
CTCGCCGCGC AGGTGCGGGA GGTGTCCGAG GGCGACTACG GGCACGCGGT GGAGACCAGC
GGGCCGCGCG AGACGGTCAT GCTGGCCGAG GACGTGGACG CGATGCGCAG GCGGATCGTG
TCCGACCTGA AGGACCTGCA GCGGTCCAAC GCGGAGCTGG AGCAGTTCGC GTACGTGGCC
TCGCACGACC TGCAGGAGCC GCTGCGGAAG GTCGCGAGCT TCTGCCAGCT GCTGGAGCGG
CGCTACTCGG GGCAGCTGGA CGCGCGCGGC GAGCAGTACA TCCAGTTCGC GGTCGACGGG
GCCAAGCGGA TGCAGGTGCT GATCAACGAC CTGCTGGCGT TCTCCAGGGT CGGGCGGATC
ACCCGCGAGC AGACGATGGT GGACTGCGGC GAGCTGGTGG ACCAGGTGGT GGACAGCTAC
TCGGAGGTGA TCACCAAGAC GAGCGCGGTG GTCACGCACA GCGGGTTGCC GACCGTGCGC
GGGGAGTCGT CGCTGCTCAG CGGGGTGTTC GGGAACCTGA TCAGCAACGG GCTGAAGTTC
CACGGGGAGC AGCCGCCGAG GATCGACATC GGGGTGGAGC GAACGGGTAA GTTCTGGACG
TTCACCGTGA CCGACAACGG GATCGGCATC GATCCGGAGT ACGCTGAGCG GATCTTCGTG
ATCTTCCAGA GGTTGCACCA CCGGGACGAC TACCCTGGCA CGGGGATCGG GCTCTCGATG
TGCCGCAAGA TCGTCGAATA CCACGGCGGG ACGATCTGGC TGGAGACGGC CGAGAGCCCT
GGCACGACCT TCAAGTTCAC CCTGCCCGTC GTAGAGGAGA CCGGGGACAA GCGATGA
 
Protein sequence
MNYRDRWPAS RLLAAAVAIL ILVSGGAVTA IGFALNSLDK QRAVVLDQIG PARRLVIEMD 
AALVNQETGL RGYALSASSD FLTPYTRGIE AEENAARGVT SALRTTRPDL LARLAETRGI
AQDWREQYAE RLIAQVTENG QPQPGVGGTT VGKELFDQVR RSVNQLQADM TRDVDAARAE
LDLTADRLLW LCIALGTLLA VIIAGVAVVL HRILIRPLAT LAAQVREVSE GDYGHAVETS
GPRETVMLAE DVDAMRRRIV SDLKDLQRSN AELEQFAYVA SHDLQEPLRK VASFCQLLER
RYSGQLDARG EQYIQFAVDG AKRMQVLIND LLAFSRVGRI TREQTMVDCG ELVDQVVDSY
SEVITKTSAV VTHSGLPTVR GESSLLSGVF GNLISNGLKF HGEQPPRIDI GVERTGKFWT
FTVTDNGIGI DPEYAERIFV IFQRLHHRDD YPGTGIGLSM CRKIVEYHGG TIWLETAESP
GTTFKFTLPV VEETGDKR