Gene Saro_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1600 
Symbol 
ID3918708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1663284 
End bp1664690 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content69% 
IMG OID640444340 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_496874 
Protein GI87199617 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.298704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCGCG GGATGCTGCG CTCGACGACC GTGCGGTTCG CCGCGCTCGT GTTCCTGCTG 
CAGGTCGTGG CAGCGGCGTT CATGCTGGGC GGCCTTGGCG CCGTAATGCG GCAGCAGAGC
CGCGCGCAGG CCCTCGATAC CGTGGAAACC CTGCGCGACG ACCTGATGGC GACGACGGCA
CAGGGCGGCG AGCGGCAATT GGTCGAGGCG ATCAGGCTGC GGCTGGCGAA CGAGGTCGGC
CGGGGTGTCG TGGTGGCGCT GGTGGACCCG TCCGGACGTC TGGTGGAGGG CAATCTGGCT
CGCATGCCCG ACGACGGTTT CGCCGTGCAC CTGAACAGGG TCGCCAGCGT CGTCAACGTG
CGACGGCGCA ATCACGCGGC TGACGAGGCA GCTCTCATCG TAGCTGCGCG CCTGCCCGGC
GGCCAACTAC TGCTCGCCGG AACGGTGGTG GAAAGCGACA GGCAATTCCT GGCGCTGCTC
GAACGCGCCA GCATTGCGAC GCTGGCGTTG TCCCTTCTTC TGGCGGGGCT GGCGTCCTTT
CTGGCGACGC GGCAGATCGT CCAGCGGTTG CGCGGTACGG TCGCGACGCT CGAGGCCGTC
GGGGCAGGCG ATCTGGCCCG CCGGGTGCCG CCCGACGGGT CCGGAGACGC GTTTTCGCGG
CTGGGCGAGG AGGTCAACCG CGCGCTGGCC CGGGCCGAGG CGCTGAACGG GGAACTGAAA
ATTGCGACCG ACGTCCTCGC CCACGATCTC AAGTCACCTT TGACGCGATT GCTGTCGGCG
CTGGATCGCG CTTCGGCCCG CGCCGAGGAT GCCGAGGCGC TTGCCGCCGT TGAGCAAGCC
GAGGCGGAGG CACGGCGGGT GCTTTCGATC ATCGACACGG CGCTTGGCAT CTCGAAGGCC
GAGGCGGGGT TCGGCCGCGA GAGTTTCACG CCTGTCGATC TCGGCGCAAT GCTCGAGACG
ATCGCCGAAA TCTACGCGCC GGTGGTGGAA GAGGAGGGGC GTCGCATGGA AGTGCAGGCA
CCGCCGGGAC TGGTGGTGCC GATCCATCGC CAGCTCATGG ATCAGGCCAT CGGCAACCTG
CTGGACAACA CGATCCGTTA TGGCGCCGGG GCGATAAGCC TTGCTGTGGA GCCGCGCGAC
GGCGCGATGG CGATTTCGGT TGCTGACGAA GGGCCCGGCA TTCCGCAACA CCAGCACGAA
GAAGCGCTGC GCAGGTTCGG AAGGCTCGAC GAGGCGCGTG GCGGTTGGGG CGCAGGGCTC
GGGCTCGCTC TGGTCGAAGC GGTCGCGCAC TTGCACGGAG GGCGGGTCGA ACTGGCCGAA
AACCGGCAAT CGCGGTCCGG CCAGCCGGGT TTGAAGGTGA CGCTGGTGCT CGGGCAACGC
GCTCCGGGCG GCGGCGATAG CGGTTGA
 
Protein sequence
MLRGMLRSTT VRFAALVFLL QVVAAAFMLG GLGAVMRQQS RAQALDTVET LRDDLMATTA 
QGGERQLVEA IRLRLANEVG RGVVVALVDP SGRLVEGNLA RMPDDGFAVH LNRVASVVNV
RRRNHAADEA ALIVAARLPG GQLLLAGTVV ESDRQFLALL ERASIATLAL SLLLAGLASF
LATRQIVQRL RGTVATLEAV GAGDLARRVP PDGSGDAFSR LGEEVNRALA RAEALNGELK
IATDVLAHDL KSPLTRLLSA LDRASARAED AEALAAVEQA EAEARRVLSI IDTALGISKA
EAGFGRESFT PVDLGAMLET IAEIYAPVVE EEGRRMEVQA PPGLVVPIHR QLMDQAIGNL
LDNTIRYGAG AISLAVEPRD GAMAISVADE GPGIPQHQHE EALRRFGRLD EARGGWGAGL
GLALVEAVAH LHGGRVELAE NRQSRSGQPG LKVTLVLGQR APGGGDSG