Gene Sare_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1783 
Symbol 
ID5706594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2054486 
End bp2055778 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content72% 
IMG OID641271286 
Producthistidine kinase 
Protein accessionYP_001536661 
Protein GI159037408 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000910041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACACCG CGGCGGGGAC GCACTGGCGG CGGCCCGGGC CGACCCCGAC ACAGCAACGG 
CGGGACCTCA TCGTGGGGCT GGCCGTGACC GTGCTGGCCA TCACGAGCCT CACCCTCACC
CGCGGCACGG GCGTGTTCCT GCTCGGCCCG CCACCATCGT TCGTGGAACA GCTGCTGTGG
GCCGGCGCGG TCGGCCTGCC GCTGATCTGG CGGCGACGGT TCCCCGTGAC GGCGACGCTG
GTCATCTCGG TCGCCTTCGT CGCGGCACAA GCCCGCTCCG TACCGGAGAC GCGGGTCACC
ACCGGAGTAC TGTTCGCGGC CATCTACACC CTCGGCGCCT GGGGACAGGA CCGCCGCCTG
GCCCGTCGGG TTCGGATCGG CGTGATCGCC GCCATGTTCG GGTGGCTCGG GATCTGGTAC
TCGGTCAATC TGGGCAACCT GCTGGCGGAC CCGCCGCCCG ACGCGTTCGC CGACGCCGCG
GGACCGATAC CCCCGCTGGT CGCCGCGCTG CTCAGCGACG CGCTCTTCAA CATCCTGTTC
TTCGGATTCG CCTACTTCTT CGGCGAGATC GCCTGGCTGG CCGCCCGACG CGAGCACGAA
CTCCAGGCGC AGGCCGAAGA ACTACGCCGG TCGCAGGCCG AGGCCAGAGA GCACGCCGTG
GTCGGCGAGC GGGTTCGGAT CGCCCGGGAA CTGCACGACG TCGTCGCCCA CCACGTCTCG
GTGATGGGCG TGCAGGCCGC CGCCTGCCGC CGGGTCCTCG ACCGGGATCC GGCCAAGGCC
CGCACCGCAC TGGCCGCGGT CGAAGAGTCG GCCCGCACGG CGGTGGACGA GCTACGCCGG
ATGCTCGGCG TCCTGCGTGC CCGCAACCAG GAACCGGACA CCGTCGAACC GCCCGCCGGG
ATCAGCCGGG TCGCCACCGT GATCGAACGG GCGCGCGAGA TCGGGCTACG AGCCACGCTG
GGGGTGTACG GCGACCCGGT GCCGATCCCG GAATCGATCT CCCAGGCGGC GTACCGGATC
GTGCAGGAAG CAGTGACGAA CACGCTCAAA CACGCCGTCG CGGCGACCAC ACTGGACGTG
CGGATCCGCT ACCTGGCGCA CGAGGTGGAG GTGGACGTGA CCGACGACGG GCGCAGCAGC
GGCACGATGA ACGCGGACGG GGTGGGCCTG GTCGGCATGC GAGAGCGGGT CGCCGCCCAT
GGCGGCACAC TGGAGGTCGG CCCCCGCGCC GCCGGCGGCT GGCGGGTCCG GACCCGCCTC
CCGCTACCGC TGACCGAGCG GCCGGCGGCA TGA
 
Protein sequence
MDTAAGTHWR RPGPTPTQQR RDLIVGLAVT VLAITSLTLT RGTGVFLLGP PPSFVEQLLW 
AGAVGLPLIW RRRFPVTATL VISVAFVAAQ ARSVPETRVT TGVLFAAIYT LGAWGQDRRL
ARRVRIGVIA AMFGWLGIWY SVNLGNLLAD PPPDAFADAA GPIPPLVAAL LSDALFNILF
FGFAYFFGEI AWLAARREHE LQAQAEELRR SQAEAREHAV VGERVRIARE LHDVVAHHVS
VMGVQAAACR RVLDRDPAKA RTALAAVEES ARTAVDELRR MLGVLRARNQ EPDTVEPPAG
ISRVATVIER AREIGLRATL GVYGDPVPIP ESISQAAYRI VQEAVTNTLK HAVAATTLDV
RIRYLAHEVE VDVTDDGRSS GTMNADGVGL VGMRERVAAH GGTLEVGPRA AGGWRVRTRL
PLPLTERPAA