Gene Saro_2666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2666 
Symbol 
ID3918440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2904622 
End bp2905962 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID640445443 
Productsignal transduction histidine kinase 
Protein accessionYP_497936 
Protein GI87200679 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.653198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGGCCG AAGCGCATCT CTCTGCCATA ATACAATCAT CTGATGACGC GATCATAAGC 
AAGGACCTCT CGGGCACGAT CCTGAGCTGG AACCCCGCCG CCACGCGCAT CTTCGGATTC
TCCGAAGCGG AGATGATCGG CCATTCCGTC CGCCGCCTCA TTCCGGCGGA GCGGCAGGCG
GAAGAGGACG ACATCCTCGC GCGCATCGCC CGTGGCGAGC GGGTGAAGAG CTTCGACACG
ATGCGGCAGC GAAAGGACGG GGTCCAGATC GCGGTCTCGA TCACCGTCTC GCCGGTCTAC
GACAAGGCGG GCCGCATCGT CGGGGCCAGC AAGATTGCCC GCGACATCAC GTCGCGCGAG
GAAGGCCAGC GAGCCCTGCG CGAGAGCGAG GCCCGCTTTC GCATGCTGGC CGACAACATC
TCGCAGCTCA CTTGGGTGGC CGACCGCACG GGCGCCATCG GCTGGTATAA CAAACGCTGG
TACGACTACA CCGGGGTGCC GCACGGTTCG ACCGATGGCT GGGGCTGGGA TCGCGTGCAC
CATCCCGACC ATCTCGAACG GGTGCGCGAG CATTTTGCCG AGAGCATTGC TGCGGGACGC
GAATGGGAGG ACACCTTCCC GCTTCTCGGC CGCGACGGGA CCTACCGCTG GTTCCTGTCG
CGCGCGAAGC CGATCCGGGG CGAGGATGGC GGGATCGTCT ACTGGTTCGG CACCAATACC
GACGTGACCG AGATGCTCGA GAAGGAAGAG CAGATCCGCG TCCTGCTGAT GGAAGTGAAC
CACCGCTCGA AGAACCTGCT CTCGGTCGTC CAGGCGCTGG CCCGGCGGTC TGGCGGAGGC
GATCCCGAGT TCCTGCGCCG TTTCGAGAAC CGTCTCGCCA GCCTTTCTGC CAACCAGGAC
CTGCTGGTGC GGCGCGGTTG GTCGACGATC ATGATGGACG AGCTGGCCGA CGCCCAGCTC
GCGATCCTCG GCCGCGACAG CCGCGAACAG GTCCTGACGC AAGGCCTGTC CCTGGCCCTG
AGCCCCCGCA GCGCCGAGAT CATCGGCATG GCGCTGCACG AGCTGGCAAC CAACGCGCTC
AAGTACGGGG CGCTCAGCGT GCCGACCGGC CGCGTTTCGC TGTCATGGGA GGAGACACCG
GACGGGCATT TCCAGATCGA CTGGCGCGAA AGCGGCGGCC CAGCCGTGCG CGACCCGAAG
CAGCACGGCT TCGGGACAAC GCTCATCCGC CATATTCCGG CGCGCAGCCT CCACGCAGAC
GTCACGCTCG ACTACGCGCC CGCAGGCCTG CGCTGGCAAT TGCGCTGCAC CAGCGCGACG
GCGCGGACCC TTTCGAGTTA G
 
Protein sequence
MLAEAHLSAI IQSSDDAIIS KDLSGTILSW NPAATRIFGF SEAEMIGHSV RRLIPAERQA 
EEDDILARIA RGERVKSFDT MRQRKDGVQI AVSITVSPVY DKAGRIVGAS KIARDITSRE
EGQRALRESE ARFRMLADNI SQLTWVADRT GAIGWYNKRW YDYTGVPHGS TDGWGWDRVH
HPDHLERVRE HFAESIAAGR EWEDTFPLLG RDGTYRWFLS RAKPIRGEDG GIVYWFGTNT
DVTEMLEKEE QIRVLLMEVN HRSKNLLSVV QALARRSGGG DPEFLRRFEN RLASLSANQD
LLVRRGWSTI MMDELADAQL AILGRDSREQ VLTQGLSLAL SPRSAEIIGM ALHELATNAL
KYGALSVPTG RVSLSWEETP DGHFQIDWRE SGGPAVRDPK QHGFGTTLIR HIPARSLHAD
VTLDYAPAGL RWQLRCTSAT ARTLSS