Gene Sala_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2089 
Symbol 
ID4080063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2193821 
End bp2195500 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content71% 
IMG OID638010464 
Productsignal transduction histidine kinase 
Protein accessionYP_617131 
Protein GI103487570 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.268373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.985504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCTT CGTCATGGGC AGACCGGCTG CTCGCGCTCT CGGCCCGCTT CGTCGGGTTC 
GTCATCCTCA TCGCCACGCT CGCCGCCGCG GCGGGCAGCT TCTTTTACAA TCGCGCCGAC
GAGCGCGCCC GCGCCGAGCG CGTCGCGATG CAGATCGACA CGCGGCTGCG CGATCATGTC
GCGCTGCTCG AAGGGGTGCG CGCGCTCTAT CGGTCGGACA GCCGGTCGAG CGGTCCCGGC
GTCCGCGCCT ATCTCAACGC GCTGCAGCCG CAGGTCCATG CGCCGGGGCT GGAGGGGATC
GGCATCGCGG TCGCGATGCG CCAGCGGACC CCCGCCGCGG CCGAGGCGAT GCTGCGCGAA
AATTACGGCC GCGACATTGC GGTCTGGCCG GTCAGCGACC AGCCCATCGG CTTTCCGATC
GTCCTGCTCG AACCTCCGAC GCCGCGCCGC GACCAGGCGC TGGGGTATGA CATGTACAGC
GAGCCGGTAC GCCGCGCGGC GATGCGCCGG GCCTGGCAGA CGGGAGCGCC CGCGGCGAGC
GGCATCGTCG AGCTGGTGCA GGAAGGGGCG GCGACGCGGC GCCAGGCGGG GTTCCTCATC
TATGTCCCCG TCTATGCCGA CCAGCCGGCG CCCGGCGTCG GCGCCCCTGC GCTCCCCGGC
GACCCGCCGC GCGCCGCGAC CCGTGCCAGC GCGCCGGGCG CGCGGCCGAT CGAGGCGTTC
GTCTATGCGC CCTTCCGCAT CGACGACCTG ATGACCGCAA TCCTCGGCGC GCAGCTCGAC
ACGATCGACG GGATCGAAAT CCGTGCCGGC GAAGGACCGG CGGCGCCGCT CGCCTATCGT
CACGGCACGA TGGGCTGGGA CCCGCACGAA CAGGTGCTGC GCGTCGCCGA CCGCCAGTGG
ACGATGCGCA TTTCCTACAG CCGCCTGTTC GAGCGGCTCG GCCGGCCGGT CGCCATCTTC
CTCTTCGGCT TTGCGATCAT GCTGCTCGCA ATGCAGCTCA TGCGGCTCCA GCACCGCCGC
GTCGACGCCT TTCGCGCGCT CGCCGACGAA CAGGCGCTGC GCGCCGCAGA CCGCGAGCTG
ATGATCGGCG AAATGGCGCA CCGGATGAAG AACGCCTTTG CGCGCATCGG CGCGCTCGCG
CGCATAACCT TGCGCGAATC GGCCAGCCTC GACGAGTTCG AGGCGAAGTT CGACGGCCGG
ATGCGCGCGC TGTCCGATGC CAAGCAGATG CTCGTCACCG GCGCGGTCGA CACGGTCGAA
CTCGAACGCA TCGTCCACCG CGAGCTCGAC CTCGCCGGGG TTCCACCCGA CCGGCTTGCC
GCCATCACCG GCCCCGCGGT GCGGCTCGAT GACGAGGGCG CGCAGGCGAT CTCGCTCGCG
GTCCACGAAT TCGTCACCAA CAGCATCAAA TATGGCGCGC TGGCGGGCGA GGGCGAGCTC
AGCGTCGGCT GGCACCGCGA GGATGGTCAG GTGACGCTCG ACTGGACCGA AAGCGGGCTG
CCCGAAACCC CGAATATCGA GCAGGAAAGT TTCGGCACCC GGTTCATCCG CACGCTGATC
GAACGTCAGC TCAAAGGGAG ATGGACGCGC ACCGCCGCCG CGGGGCGGCT CTCGATCGTC
ATCCGCTGGC CCGATGCCGC CTCGGCGACC GGCGCGACCG CAGCGTCGCC CGCACGTTGA
 
Protein sequence
MRPSSWADRL LALSARFVGF VILIATLAAA AGSFFYNRAD ERARAERVAM QIDTRLRDHV 
ALLEGVRALY RSDSRSSGPG VRAYLNALQP QVHAPGLEGI GIAVAMRQRT PAAAEAMLRE
NYGRDIAVWP VSDQPIGFPI VLLEPPTPRR DQALGYDMYS EPVRRAAMRR AWQTGAPAAS
GIVELVQEGA ATRRQAGFLI YVPVYADQPA PGVGAPALPG DPPRAATRAS APGARPIEAF
VYAPFRIDDL MTAILGAQLD TIDGIEIRAG EGPAAPLAYR HGTMGWDPHE QVLRVADRQW
TMRISYSRLF ERLGRPVAIF LFGFAIMLLA MQLMRLQHRR VDAFRALADE QALRAADREL
MIGEMAHRMK NAFARIGALA RITLRESASL DEFEAKFDGR MRALSDAKQM LVTGAVDTVE
LERIVHRELD LAGVPPDRLA AITGPAVRLD DEGAQAISLA VHEFVTNSIK YGALAGEGEL
SVGWHREDGQ VTLDWTESGL PETPNIEQES FGTRFIRTLI ERQLKGRWTR TAAAGRLSIV
IRWPDAASAT GATAASPAR