Gene Saro_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1237 
Symbol 
ID3917868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1287922 
End bp1289565 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content67% 
IMG OID640443974 
Productputative signal transduction histidine kinase 
Protein accessionYP_496516 
Protein GI87199259 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.232329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCGA AGGGATGGAC ACGCAGGACC GGTCGCGTGA TCGCGACGGG GCGTGCCGTA 
CTCGCGCTGA CCTTTCTCCT CGCGCTCTGG CTCGATCCGG TGGTCCCGGT CCGCGGGGTG
ACGCTGGGTT ACGCGCTGGT CAGCAGCTAC CTGCTGTGGA CGGCGCTGCT GATGATCATC
GCCTGGCGGA GCTGGTGGTA CGACTTCAGG CTCGCCCCGG TCGCCCAGGT CGTGGACATC
CTGATGTTCC TGTCGGCGGT CTACTTCACC GAAAGTCCCT TCACCGAATT CCAGAGCCCG
TTCCTGGCCT TCGCAGCCTT CATGCAGGTC AGCGCGATGG TGCGGTGGAA TTGGCGCGCG
ACGGCGCTGA CAGCGCTCGT CCTGCTCAGC ACCAACCTCG CGTTCGGCAT CACGTTGTAC
CAGATGGGGC TGGACATCGA CCTGTTCCGG TTCTCGCGGC GCACGATCTA CATGATCGTG
CTGTCCTCGA TCCTCATCTG GCTCAGCATG TCGCGCGGGA GCGGGCGAAC CGTGGCGTTT
GCCGAGCCTC CGGGCATTCC GGGCGAACGA CGCGACCTGC TGCTTGACAG CGCGCTCCGG
GCGGCGGCGG CAACCCTCGG TGCCGAGCAG GTCGCCCTGG CCTTCGCGGC GAGCGAAGAA
CCGTGGATCG AAATGCGGCG GCTCACGGCA AAGGGGATCG TGACCGAACG GCTTGCCCCC
GAGGGCCTTG CCGACGAACT CCTCGGTCGG CGCTCGACCG CAATCTTCAG CCGCTCGCGA
AAGCGCCAGC TCGCTCTGGC CGAAGACGGT GGGCTCGAGG CACTGGACGG GCCGGTCGAA
TCCCGCCTTG CCGACCTGCT CGGGGCAGAC GAAGGCATCA TCGCCTCAGT GCAGAGCGTG
ACCGGATCGG GCCATCTCGT CGCCTGGGAC CACAAGAACC TGAGCTTCGA CGACATCGCG
CTGGTGCGGG CCCTGGCGGA CGAGATCGGG CACGCGCTGG ACCGCGAGGA AATGGCGCGG
CTGGCCCGCA GCGCCGCCGA GACCGGGGTT CGCCAGGCCG TCGCCCGAGA CCTGCACGAC
AGCGTGGCAC AGTTCCTCGC AGGGACGTTG TTCCGCCTGG AAGCGCTGCG CCGCTGGATC
CGCGAGGGCA ACGATCCCGA GGGCGAGATA GATTCCATCA AGAATGCCCT GCGCCGCGAG
CAAGGACAGT TGCGCCTCCT GATCCAGCGC CTGCGTCGCG GGCAGGAAGG CGACCGGCGC
ACCGAGATCG GCGAGGAACT GCGCGATCTG CTGGCCGAGG CCAGCGGCCA TTGGCACATC
GAGACCGAGC TGGTCATGGA CCAGCGCCCC CTGCCGGTTT CCGTACAGCT CAGCCACGAG
ATCAGGCAGC TCGTGCGCGA GGCAGTGGCC AACGCGGCGC GCCATGGCAA ATGCGGAAAA
GTGAAGGTCG AGCTTTCGGA ACAGGCCGGC CACCTCAACC TTGCCATTAC CGATGACGGC
AAAGGGTTTC CCCAGATCCC CAATGCCCCC CGCCCCCGCT CGATCAGCGA ACGCGTCGAG
GCACTGGGCG GTCGCTTCGA ACTTCACAGC GACGGGGCCG GAACGCGCCT CGCCATTGCC
CTAAGATCCG GAGAACCTGC ATGA
 
Protein sequence
MSAKGWTRRT GRVIATGRAV LALTFLLALW LDPVVPVRGV TLGYALVSSY LLWTALLMII 
AWRSWWYDFR LAPVAQVVDI LMFLSAVYFT ESPFTEFQSP FLAFAAFMQV SAMVRWNWRA
TALTALVLLS TNLAFGITLY QMGLDIDLFR FSRRTIYMIV LSSILIWLSM SRGSGRTVAF
AEPPGIPGER RDLLLDSALR AAAATLGAEQ VALAFAASEE PWIEMRRLTA KGIVTERLAP
EGLADELLGR RSTAIFSRSR KRQLALAEDG GLEALDGPVE SRLADLLGAD EGIIASVQSV
TGSGHLVAWD HKNLSFDDIA LVRALADEIG HALDREEMAR LARSAAETGV RQAVARDLHD
SVAQFLAGTL FRLEALRRWI REGNDPEGEI DSIKNALRRE QGQLRLLIQR LRRGQEGDRR
TEIGEELRDL LAEASGHWHI ETELVMDQRP LPVSVQLSHE IRQLVREAVA NAARHGKCGK
VKVELSEQAG HLNLAITDDG KGFPQIPNAP RPRSISERVE ALGGRFELHS DGAGTRLAIA
LRSGEPA