Gene Sare_4857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4857 
Symbol 
ID5707596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5512596 
End bp5513921 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content73% 
IMG OID641274253 
Producthistidine kinase 
Protein accessionYP_001539598 
Protein GI159040345 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0991376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTGGG CGCTCAACCG GCTGGCGCTG GCCATCACGT CGATGGTGGC ACTGGCCTTC 
CTCGTGCCGC TCGCGGTGGT GACCCGCCAG CTGGCGCACG ACAGGGCCAT CGGTGATGCC
CGCCAGCAGG CTGCCGCGAT GGTGGCGGCG CTCGCCGTGG ACGAGGATCC GCACCTGCTG
ACGCGCGCGG TGATGAGCAC CACCGCCGGC AGCGAGGGAC GGCTCGCCGT GCACCTGCCC
GACGTGGCCC CGGTCGGGGT GGTGCACGCC ACCGCCACCG ACGTGGCGCT CGCCGCCGGA
TACCGGCGTC CGGTCACCGC GGACACGAGC GGCGGTCTGG CCTACCTGCT GCCGACGGTG
ATCAGCGACG GGCAGACCGC GGTGATCGAG GTACACGTGC CCCGCGAGGA TATGGAGCGC
GGGGTATGGC GTTCCTGGCT GGCCCTCGCG GGCCTCGCCG TCATCCTGGT CGGTGGCTCC
ACACTCGTTG CCGATCGACT GGGTAGCCGG ATCGTGCGGT CCACCCGCCG GCTCGCCGGT
GCCGCCCGGC AACTCGGCAC CGGCGACCTG ACCGCCCGGG TCGCCCCCGA CGGCCCGGCG
GAGTTGCACG ACGCCGCACA GGCGTTCAAC GGGCTCGCCC AGGACATGCG GCGGCTGATC
GACGCCGAGC GGGAGATAGC CGCCGACCTG TCGCACCGCC TGCGTACGCC GCTGACGGCA
CTGCGCCTCG ACGTCGAGGC CATGCCGCCC GGGCCCGTCG GGGAGCGGAT GCGGCAAGCC
TGCGACCTCC TCGACGAGGA GTTGGAGGCC ATCATCACGG GGGCGCGGAG CAGCGTGGGC
GAGCGCGACA CCGAGTGCAC CGACCTCGTC GAGGTGCTGG CCGACCGGCT GGCGTTCTGG
GCTGTCCTGG CCGAGGACCA GCAGCGGCCC TGGACGGTGG TCGGCGGCGA TCGGCAGGTG
CCGCTGCCGG TGCCACGCGG TGATCTGATC CTGGCGGTGG ACGCCCTGCT CGGCAACGTG
TTCGCGCACA CCCCGGAGGG GTCGGCGTTC CAGGTCACCG TCTCACCGGA CGCGCTCGTC
GTCGACGATG CCGGTCCCGG CATCGCCGAC CCCGCCGCAG CCGTCCGACG CGGCACGAGC
GGAGCCGGTT CGACCGGGCT TGGCCTGGAC ATCGTGCAGC GAATAGCCAT CGCCGCTGGC
GGTCGGCTGC ACATCGGCAC CGGGTCGTTG GGTGGCGCCC GGGTGGCACT GGTCCTGGCG
GCGGGCACAG CATCCGACCT GCCGTCTGTG CCGAACGAGC GCCGTTGGCA CGCCGACCGC
AGCTGA
 
Protein sequence
MRWALNRLAL AITSMVALAF LVPLAVVTRQ LAHDRAIGDA RQQAAAMVAA LAVDEDPHLL 
TRAVMSTTAG SEGRLAVHLP DVAPVGVVHA TATDVALAAG YRRPVTADTS GGLAYLLPTV
ISDGQTAVIE VHVPREDMER GVWRSWLALA GLAVILVGGS TLVADRLGSR IVRSTRRLAG
AARQLGTGDL TARVAPDGPA ELHDAAQAFN GLAQDMRRLI DAEREIAADL SHRLRTPLTA
LRLDVEAMPP GPVGERMRQA CDLLDEELEA IITGARSSVG ERDTECTDLV EVLADRLAFW
AVLAEDQQRP WTVVGGDRQV PLPVPRGDLI LAVDALLGNV FAHTPEGSAF QVTVSPDALV
VDDAGPGIAD PAAAVRRGTS GAGSTGLGLD IVQRIAIAAG GRLHIGTGSL GGARVALVLA
AGTASDLPSV PNERRWHADR S