Gene Saro_0487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0487 
Symbol 
ID3918616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp528858 
End bp530525 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content65% 
IMG OID640443217 
Productsignal transduction histidine kinase 
Protein accessionYP_495769 
Protein GI87198512 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAGC TTCTTTCGGG AAAGAAGCTC ACGCCGAGAT GGTACGAGCG CTTTCCGCGC 
GGCGTGCCGC TGGGCATTTT CGGGCTGACG ATGGCCGTGA CCATGCTCAG CGTGTTCGCC
ATCGAGAACG CCGAGTATCG CCGGCAGACC GCGCAGACGG CGCAGACCGC GCAAGCCGTC
GCATCGGGCC TCGAGCGCCG GTCGAACGCC AATGCCGCCT ACCTGAGGTC GAGCGCCGCG
CTCTTCGCCA CGCAGCAGAT GGTGGAAGCG CCGCTGTTCC GGACCTTCAT CCGCCAGCTC
CATCTCGATG GGCGCTATGT CGGGTCCGAT GGCATCGGCT GGGCGATGAA GGTCTATCGC
GACGATATTC CCACCGTCGA GGCGATCATG CGCGACGGGG GCAACCCCGG TTTCGCGGTG
CGCCCGACAC CCGAACCGGA ACGGCCATTT GTCGTGCCGG TCATGTTCCT TGAACCCGAT
ACGCCGCGCA ACCGGCGGGC CATCGGCTTC GACATGTTCT CCGAAGGCGT ACGCCGCACC
GCGATGAAGG CGGCCGAGCG CACCGGCCAG CCCACAGCGT CCGGCCATGT CGTGCTGCAG
CAGGAAGGCC GCCCGGATGG CCTGCCGGGC TTCCTTGTCT ACATGCCGGT CTACACCCTT
CAGGAAGACG GCCCGCGCAA GCTTCGCGGC TTTGTCTATT CGCCGTTCAA TGCGCAGCGC
TTTCTCGAAT CGTCGATCGA TGTGAACCAT CTCGATGGTG CCGGCGTGCG GCTCTATGAC
GAAGATGCCG GCGGACTGGT CGTACTTGCC TCTGTTGCGC CCGCAGAGGT TTCCGGGCGG
GTGATCCGGC GTCCGATCGA CATATCGGGA CATCGCTTCA TGCTCGAGAT CGAGGCGCCG
GCGACTGCGA TGCTGTCGGT GATGTCGGTG ATGACGCTGC TGTTCGGCCT GCTGGTGGCG
ACCTTGCTGC TGGTGCTGGC GCGTCTGCTT ACCCAGCAGG CAGTGGAGGA CCGAATCGCG
CTGGCGTGGT TCGAGCAGCA GTCCTCGATC CGCAATTCGC TGACCCGCGA ACTGAACCAC
CGCGTGAAGA ACACGCTGGC CAACGTGCTT TCGATCATTT CGCTCACGCG GCGCCGGGCC
ACGAGCCTGC CCGAGTTCGC GGACAGTCTA GAAGGGCGCA TCCGTGCGCT CTCGGCGACG
CATGACCTGC TGACCCAGTC GGATTGGGGA ACGACGCCGA TCGAACTGGT CATCCGGGCC
GAGCTTGCCC CTTATGCCGG GGATACGCAG CGGCATGTGG AGATGGGAGG GCCGGAAGTG
GAGCTGGCGC CCAACGACGC GCTATCGCTG GGTCTGGCGA TCCACGAACT GGCCACGAAC
GCAGCCAAGT ACGGGGCGCT CAGCGTCGAG AAAGGGCGCG TATCGGTCCG CTGGGAACTG
GCCGGCGAAG GGCAGGCGCG GATCGAATGG GTCGAGCGCG GCGGTCCTCC GATCGATCAG
GAGACAAAGC GCAAGCGCGG CTTCGGGACC GAGCTGATCG AGAAGATCGT GGCGCACGAA
TTGCGCAGTC CGGTGGACCT GCGGTTCGAG ACCGAAGGTG TGCGCTGCAA ACTGCTGGTG
CCGGTCCGCC GGAAAAGCGA CTTCGCCATA CGACAAGGCC GCGCCTGA
 
Protein sequence
MEKLLSGKKL TPRWYERFPR GVPLGIFGLT MAVTMLSVFA IENAEYRRQT AQTAQTAQAV 
ASGLERRSNA NAAYLRSSAA LFATQQMVEA PLFRTFIRQL HLDGRYVGSD GIGWAMKVYR
DDIPTVEAIM RDGGNPGFAV RPTPEPERPF VVPVMFLEPD TPRNRRAIGF DMFSEGVRRT
AMKAAERTGQ PTASGHVVLQ QEGRPDGLPG FLVYMPVYTL QEDGPRKLRG FVYSPFNAQR
FLESSIDVNH LDGAGVRLYD EDAGGLVVLA SVAPAEVSGR VIRRPIDISG HRFMLEIEAP
ATAMLSVMSV MTLLFGLLVA TLLLVLARLL TQQAVEDRIA LAWFEQQSSI RNSLTRELNH
RVKNTLANVL SIISLTRRRA TSLPEFADSL EGRIRALSAT HDLLTQSDWG TTPIELVIRA
ELAPYAGDTQ RHVEMGGPEV ELAPNDALSL GLAIHELATN AAKYGALSVE KGRVSVRWEL
AGEGQARIEW VERGGPPIDQ ETKRKRGFGT ELIEKIVAHE LRSPVDLRFE TEGVRCKLLV
PVRRKSDFAI RQGRA