Gene Saro_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1664 
Symbol 
ID3918773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1743718 
End bp1745310 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID640444405 
Producthistidine kinase 
Protein accessionYP_496938 
Protein GI87199681 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.586003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGCGC TGCCGCGCTT CGCCGCCCTG CCATCGCGTG GCGTTCCCAT CGCCGCGGCA 
TGGGCGGCGC TGGCGGTGGC CCTTCTGATC GTGGACGCCG TGGTGCCCGG AGAGAGCCTT
GCGCTTCACG GCGCTACTCT CACCAGCTTC TGCATCTTCG TTCTCACCTG CATTCGCCAG
GCGCGGCGGG ACGACGACAG CCAAGACACG GACCGCGAAG CCCTGCGCCA GGAGCTTGAC
GCTGCCGAAT GCGCCAGCGC TGCGAAGAGC CGGTATCTTG CCAGCGTCAG CCACGAGATC
CGCTCGCCGC TCAATGCGAT CTATGGCTAT GCGCAGCTCG TCGAGCGCGA AGGCACGGTC
GATCCGCGCG ATGCGGCCAG GGTCATCCGT CGCAGCGCCG AACATCTCAC TAACCTCGTG
GAAGGCCTGC TCGACATCGC CTCGATCGAA CAGGGCGTAG TGCGCATCGA CAGCACGGTT
GCACGCCTCG ATGCGCTGGT CGAGCAGGTG GCGGAAATGT TCCGGCCGCT CGCGGTGCAG
AAGGGCCTCG CGTTCCGCTG CGATCTTCCC GCGCGTCTGC CCGAGTTCGT GCGGATGGAC
GAACGTCGGG TGCGGCAGGT GTTGATCAAC CTTGTCTCCA ACGCGGTGAA GTTCACCCAG
GCGGGCGAAG TGGTCCTGGC GGTGCGCTGG AGCGGCGAAA TAGCGACATT CGAAGTGCGC
GATACCGGCC CGGGCATTTC CCCGGCCCAT CAGGAGACGG TATTCTCGCC CTACCAGACT
GGTGGGGTCG AATGTGGCGG CGGTGCCGGA CTGGGACTGG CGATCACGCG TGCGATAGTC
GACATGCTGG GCGGCGACCT GCGGCTGGAA AGCCGGCTGG GCGAGGGATC GCTGTTCCGC
GTCGTGCTGA TGATGCCGCA TGTCTCCGGC ATGGTGGACT GTGCCGCGCC GCGCCCTCGA
CCGGTGGGGT ATCGGGGGGC AAGGCGGTCG CTGTTGCTCG TCGATGACGA TGCCGACCAT
CTTGCCGTGC TGCGCTGCAC GCTCGAATCC TGTGGGTTCG ACGTTTCGCT GGCGCCCGAC
GGCGCGGCGG CGCTTGCTCT GGCTCACGCG CGCGCCTTCG ATGCTGTCGT CCTGGACATT
GCGATGCCGG GATTGTCGGG TTGGGAGGTC GCGGAAAGGC TGCGTGCCGC GCACGGGCAG
TCATTCAGGC TGGTCATGCT TTCCGCCAAT GCCGAAGAGC GGCACGGGCC ACGGGGCAAG
GAGCCTGACC ACGACCTGTT CCTGATGAAG CCGGTTGAAC TGTCCGCGCT GGTCGACTCG
CTGGGCAAGC TGCTGGGGCT GGAGTGGATA CTCTCCGAAG GCGGCGGCGA CACCGTGCTG
GCCCAGCCGC GGATCGACGT GTCGGACAGC GCGCGGACAC ATGTCGATCG TCTGAAATCG
TTGGCGAGGA TCGGCCACTT GCGAGGGTTG GAGGCGGAAA TTCGCAGCAT GCAGGAAACG
GACACCGGGA CGGCGCCACT TGCCGCGCGC CTCTTCGATT GCCTCGACCG GTGCGACCTC
GTGGCGATGC GGCGGGTGTT GGAGGGCATA TGA
 
Protein sequence
MIALPRFAAL PSRGVPIAAA WAALAVALLI VDAVVPGESL ALHGATLTSF CIFVLTCIRQ 
ARRDDDSQDT DREALRQELD AAECASAAKS RYLASVSHEI RSPLNAIYGY AQLVEREGTV
DPRDAARVIR RSAEHLTNLV EGLLDIASIE QGVVRIDSTV ARLDALVEQV AEMFRPLAVQ
KGLAFRCDLP ARLPEFVRMD ERRVRQVLIN LVSNAVKFTQ AGEVVLAVRW SGEIATFEVR
DTGPGISPAH QETVFSPYQT GGVECGGGAG LGLAITRAIV DMLGGDLRLE SRLGEGSLFR
VVLMMPHVSG MVDCAAPRPR PVGYRGARRS LLLVDDDADH LAVLRCTLES CGFDVSLAPD
GAAALALAHA RAFDAVVLDI AMPGLSGWEV AERLRAAHGQ SFRLVMLSAN AEERHGPRGK
EPDHDLFLMK PVELSALVDS LGKLLGLEWI LSEGGGDTVL AQPRIDVSDS ARTHVDRLKS
LARIGHLRGL EAEIRSMQET DTGTAPLAAR LFDCLDRCDL VAMRRVLEGI