Gene Sare_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1289 
Symbol 
ID5706251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1500051 
End bp1502456 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content69% 
IMG OID641270802 
Producthistidine kinase 
Protein accessionYP_001536183 
Protein GI159036930 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00742101 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGGTCTC GCAGTACGAA TCTGCGTGCG AAGATTGTTG CTCTCCTGGC CTCGCTGTTC 
GCGCTGTGGG CGTTCGCCGC CTGGGTGACA GTGAGCGACG GCGTGAACCT GCTCGGCGTC
CAGACGATCG ACAGCCGGAT CGTGGGCCCG AGTGAGCCCC TGCTGCTGGA ACTCCAGGTC
GAGCGTCGAC TGTCCACCGC GGAGCTCGGT TCAGCCAGCC CCGCTCGCCA GGAGATCCTC
GCCGCCAGTC GGAAGCGGGT CGACGCAGCC ATCGCCGACT TCACCTCGAC CGCCCAGAGC
TGGTCGGCCC GGCTAGTTGC CAACGCCGAG GCCGAGCGGA CGGTCAGCGA CGCCATCAGT
GCGCTCGACA GCCTGCCCGA GGCCCGGCGA TCGATCGACA ACCGTAGTAT CAGTCGGGCC
CACACCGCGC AGGTCTACAC AGACGTGATC AGCTCCCTGT TCAAGCTCTA CGACGTGGTC
GGTGGCCTCG ACGACAGGGA CATCGAGGCG GACACCCGCA ACCTGATCGA GCTGTACCGG
ATCCGTGAGC TGATTTCCCA GGAGGACGCC CTGCTCTCCG GGGCGCTCGC GGCCGGTCAG
GTCACCGCGG CGGAGAACAT GCGGTTCACC GAGTTCGTCG CCGCGCGGCG GTTCCTCACC
GAACGGGCCG TCGAACAGCT GCGTACGCCG GAACGCACCC AGTTCGAGCA GGTGGTCGCC
GGGCGAGCAT TCACCCAGCT GCGCACCCTG GAGGAGCGCA TTCTCACCGG GCAACGGGGG
ACCTCCAGCC CGCCCATCCC GATCCAGGAA TGGGATGGCA GCGTCGAGGC CGCCCTGACC
GAGTTGCAGG AAATGGTGCT CGACGCCGGC GATGCCCTGG TGGACCGGGC GGCTCCGGTC
GCCACCTGGG TGATCGTTCG GCTGGTGCTG GCCGCCGGTC TGGGCCTGCT GGCCGTCGTC
GCCTCGATCG TTGTGTCCGT GACCACGGCC CGTGCGCTGG TCGCACAGTT GCAGCGCCTG
CGGGCCGCCG CGTACCAACT GGCCGACAAG CGGTTGCCGG GTGTGGTCGA ACGGCTCGGC
CGCGGGGAGA AGGTCGACGT CGCCGCCGAA GCCCCGCCGC TGAACTTCGG CGACGACGAG
ATCGGCCAGG TGGGGCAGGC GTTCAACAGG GTCCAGGAAA CCGCCATCCG CACTGCGGCA
GAGCAGGCCG AGCTTCGCCG CACCGTCCGG GACGTGTTCC TCAGCCTTGC TCGCCGTACC
CAGGTCCTCG TCCACCGACA GGTCAGCCTG CTCGACGCGA TGGAGCGTCG GGAACACGAC
GCCGAGGAAC TCGAGGACCT CTTCCGCGTC GACCACCTCG CCGTCCGGAT GCGCCGTAAC
GCCGAAAACC TCGTCGTGCT CTCCGGGGCG ACCTCAGGGC GCGCCTGGCG TCGCGATGTA
CCCATGGTCG ACGTCGTTCG CGGCGCCGTC CAGGAAGTCG AGGACTACGC CCGAGTCACC
GTCATGCCCC TCCCTCCCGT CGGGCTCGCC GGCCGCGTCG TCGGCGACGT GAGTCACCTC
CTCGCGGAAC TGATCGAGAA CGCGCTGTCC TTCTCGCCCC CACAAACCGA GGTACGGGTG
CGGGGAGAGA GCGTCGGGCA CGGCTTCGCC ATCGAGGTGG AGGACCGAGG GCTCGGGATG
GACGCGGAAG AACTGGCTGC CGCCAACGAA CAGATCGCCA GCGATCAGCA GTTCCAGCTG
GAGAACGCCA ACAGGCTCGG ACTGTTCGTG GTCAGTCGGC TGGCCGCGCG ACACAAGCTC
GGAGTGCACC TGAAGGCGTC ACCGTACGGC GGCACCACCG CCATTGTTCT GATTCCTGCG
GATCTCATCA CCAGCGCGGA CGTCCGTACC ACCGTGCCGA CTCCGGACCG GCCGGCTGTG
CCCGCCGCCT CCGCCGGCGC TGGGGCGTAC GGAAGCCGGG CGCCGAGGCC CGCGGCCGGG
AGCCCGCTCG ACGCTCCCAC AGTGCCCGTC TTGACGGTAC CGACGCCCAG GTTCGGCGTG
CCCTCCGTAC GTCAGGACCC GCCACGCGTG ACCCGGCCGG CGGATGCCGC GACGACTCCC
CGACCGGCAG CCGGGGCGTC CGTCGGGGCG GAGGGGCAGT CGCAGCAGTC CCACACGCTC
GGTGGGCTAC CCATTCGTGT GCGCCAGAAC AACCTGGCAC CGCAACTGCG TGCCAATTCC
AGCACGACTG AGGACAGGGG CGACAGCGAA CCTGTCCGTT CCCCCGACCA GGTGCGACGC
ATGATGACGT CCTACCAGAG CGGCAGCCTG CGCGGCCGGG CCGACGCTGA GCGCCTGAGT
GACAAGGACA CTGGCGCGAA CGCCGCGAAT ACCGATGAAT CGACCGTAAC CGAGACTGAT
CGCTAG
 
Protein sequence
MRSRSTNLRA KIVALLASLF ALWAFAAWVT VSDGVNLLGV QTIDSRIVGP SEPLLLELQV 
ERRLSTAELG SASPARQEIL AASRKRVDAA IADFTSTAQS WSARLVANAE AERTVSDAIS
ALDSLPEARR SIDNRSISRA HTAQVYTDVI SSLFKLYDVV GGLDDRDIEA DTRNLIELYR
IRELISQEDA LLSGALAAGQ VTAAENMRFT EFVAARRFLT ERAVEQLRTP ERTQFEQVVA
GRAFTQLRTL EERILTGQRG TSSPPIPIQE WDGSVEAALT ELQEMVLDAG DALVDRAAPV
ATWVIVRLVL AAGLGLLAVV ASIVVSVTTA RALVAQLQRL RAAAYQLADK RLPGVVERLG
RGEKVDVAAE APPLNFGDDE IGQVGQAFNR VQETAIRTAA EQAELRRTVR DVFLSLARRT
QVLVHRQVSL LDAMERREHD AEELEDLFRV DHLAVRMRRN AENLVVLSGA TSGRAWRRDV
PMVDVVRGAV QEVEDYARVT VMPLPPVGLA GRVVGDVSHL LAELIENALS FSPPQTEVRV
RGESVGHGFA IEVEDRGLGM DAEELAAANE QIASDQQFQL ENANRLGLFV VSRLAARHKL
GVHLKASPYG GTTAIVLIPA DLITSADVRT TVPTPDRPAV PAASAGAGAY GSRAPRPAAG
SPLDAPTVPV LTVPTPRFGV PSVRQDPPRV TRPADAATTP RPAAGASVGA EGQSQQSHTL
GGLPIRVRQN NLAPQLRANS STTEDRGDSE PVRSPDQVRR MMTSYQSGSL RGRADAERLS
DKDTGANAAN TDESTVTETD R