Gene Sala_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1274 
Symbol 
ID4082525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1330045 
End bp1332366 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content64% 
IMG OID638009634 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_616321 
Protein GI103486760 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.18473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGG AATGTCGTAA TGAAGCAACA ATGACGCCTG TGGCTGCCCC GACCTTCGAC 
ATGGATTCGG ACAGATCGAA CGCGCGTGCG GGTTTCTGGC GCTTCGGCGT TCCCGAATAT
TATGCGCTCG CGGCCATGGT CAGCATCGCG ATCGCCACCT ATATTTTCGT GACCGGCGAC
GCGCAGAGCG AGCGGCTTCT CACGCCCGCA CTGGTGGCCG CGATCATGGT CGCCAATCTG
GTTCCGGCGA TGGCGTTGAT CGTACTGATC GGCAGCCGCG TGGCGCGCAC GCGCGCTGTG
CGGTCGATGG CGGGAGGCAA CGGGCGCCTG CACGTTCGCC TCGTCGCCCT CTTTTCGCTG
ATTGCGGCAA CGCCGACATT GCTCGTCGTG ATTTTCGCTT CGCTGCTCTT CCAGTTCGGT
GTCGATTTCT GGTTTTCGGA CCGCTCGCGC GGGATGTTCG AGAATGCGGC GAACCTGGCC
GAGGGTTATT ATCAGGAAAA TCAGCGTCAG GTCGGCGCAA ACACCTTTGC GATGGCGACC
GATCTCGGCA TGCGGCTGCG GCAATATCCG ATCGATGCAC CGGGATTTAA CGACTATTAT
TTTCAACAGG TTGTCGTCCG TAGTTTGAAT GAATCGGCGA TTATCGAGAT CGGACGCGAT
GGCGTTGCGC GGACGGCGAC GGCGATCGAC CCAGACGAAC GGCCCGCGAC GAGCCGCCTC
ACTCCGCAAA TGATCGCGCG TCTCGACGCG GGCGAAGATG TCGTTGTCGT GCGCCGTTCG
GACCGTATTG AGGCTGCGGC CCGCTTGCCG GGCACCGACC GCGCCTTTGT TTATGCGTCG
CGCGACACCA ATGTACCGGG TTTTCAGCAA TCGGCGCGGG CCAGCGCGGT GCTGGCCGAT
TACAATCAGC TCTTTGATCG CTCGCAGATG CTTCAGTTGC AGTTCAACGG CGCGCTTTAC
CTCGGCTCGC TGCTCCTGCT CGCGCTGGCC GTGATCGCTG CGATCGTGGT CGCCGACCGC
ATCGTCCGGC CGCTCGGCAC GCTGATCGGC GCGACGCGGA CCGCGGCGGG CGGCGACCTG
TCGGTTCGCG TAACGCCCCC CGAGCGCGAC GACGAGATTT CGGTGCTGAC GCGCGCATTC
AACCGGATGA CCGAGCAGCT CGAGCGGCAA ACGCGTGCGC TCGTCAATGC GAACGAGCAG
CTCGAGGCGC GGCGCAGCTT TATCGAGGCG GTGCTGAGCG GCGTGTCTTC GGCGGTGGTA
TCGGTCGATG CCGACCGCCG CATCCTGCTC GCCAATGCCG CGGCGGAAAG GCTCATCGGT
CGCTCGGCCG ACGAGCTGAC GGGACTTCTG CTGGACGAGG TCGCGCCCGA GCTGGCGCAA
TTGCTGACCG GGAACGAGCG CGAGGCGATC GTCCAGCTTG TGCGCAAGGA CAGCGAACCG
GCGACACTCG CGGCGAAGGC GGTGGCGCAG GGTGACGGCT TCGTCCTCAG TTTTGAAGAC
ATAACGCAGC AATTGCTCGA TCAGCGTCGC GCCGCCTGGT CGGACGTTGC GCGCCGGATC
GCGCACGAGA TCAAGAATCC GCTTACCCCG ATCCAGCTTG CCGCCGAGCG GTTGCAGCGC
CGTTTCGGCG ACCGTGTCGA GGGCGATGCG CCGACGTTCC GCAAGCTGAC CGACACGGTG
ATCCGCCAGG TTCACGACAT GCGCCGCATG GTCGACGAGT TTTCCAGCTT TGCGCGAATG
CCCAAGCCGA CCTTCGGCGT CGAGGATGTA CGAGACATCT TGCGGCAGGC GGTGTTCCTG
TTCGAGGTTG CCAAGCCCGA CATCGCCTTC ACGGTCAGGA CGCCTGCCGA GGTCGAACCG
CTGGTGTGCG ACCGTCGCCT GCTGTCGCAA GCCATAACGA ACATCGTCAA GAACGCTGTC
GAAGCAATTG AAGAAAAATA TAAAAACTCT GACGCAACGG CTGTTGGCTC GATCCGGGCG
GAACTGGAGA TTGGCGCGCA CGATGCGATC GTGATCCGCG TCACCGACGA CGGTATCGGG
CTACCCGAAG CGCGCGACGC GATCGCCGAA CCCTATATGA CGACGCGGCA GGGCGGGACG
GGCCTCGGCC TCGCCATCGT CAAAAAGATC GTCGAGCAAC ATTATGGCGA ACTGGAGTTC
GCCGACAATC CGGCGGGGCA GGGGACGTGC GTAACCCTGA CGCTGCACCC CGAACGGCTG
CGGCCGCTGG CCGGGCAGGG CGACGACAAT GGTCCGGGCA AGAGCGAGAC GGTTCCGGGA
CGCATCCGCA ACCGGAAGGA CAGGGAAGAC CATGGCGCTT GA
 
Protein sequence
MQQECRNEAT MTPVAAPTFD MDSDRSNARA GFWRFGVPEY YALAAMVSIA IATYIFVTGD 
AQSERLLTPA LVAAIMVANL VPAMALIVLI GSRVARTRAV RSMAGGNGRL HVRLVALFSL
IAATPTLLVV IFASLLFQFG VDFWFSDRSR GMFENAANLA EGYYQENQRQ VGANTFAMAT
DLGMRLRQYP IDAPGFNDYY FQQVVVRSLN ESAIIEIGRD GVARTATAID PDERPATSRL
TPQMIARLDA GEDVVVVRRS DRIEAAARLP GTDRAFVYAS RDTNVPGFQQ SARASAVLAD
YNQLFDRSQM LQLQFNGALY LGSLLLLALA VIAAIVVADR IVRPLGTLIG ATRTAAGGDL
SVRVTPPERD DEISVLTRAF NRMTEQLERQ TRALVNANEQ LEARRSFIEA VLSGVSSAVV
SVDADRRILL ANAAAERLIG RSADELTGLL LDEVAPELAQ LLTGNEREAI VQLVRKDSEP
ATLAAKAVAQ GDGFVLSFED ITQQLLDQRR AAWSDVARRI AHEIKNPLTP IQLAAERLQR
RFGDRVEGDA PTFRKLTDTV IRQVHDMRRM VDEFSSFARM PKPTFGVEDV RDILRQAVFL
FEVAKPDIAF TVRTPAEVEP LVCDRRLLSQ AITNIVKNAV EAIEEKYKNS DATAVGSIRA
ELEIGAHDAI VIRVTDDGIG LPEARDAIAE PYMTTRQGGT GLGLAIVKKI VEQHYGELEF
ADNPAGQGTC VTLTLHPERL RPLAGQGDDN GPGKSETVPG RIRNRKDRED HGA