Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2666 |
Symbol | |
ID | 3918440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2904622 |
End bp | 2905962 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445443 |
Product | signal transduction histidine kinase |
Protein accession | YP_497936 |
Protein GI | 87200679 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3920] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.653198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGGCCG AAGCGCATCT CTCTGCCATA ATACAATCAT CTGATGACGC GATCATAAGC AAGGACCTCT CGGGCACGAT CCTGAGCTGG AACCCCGCCG CCACGCGCAT CTTCGGATTC TCCGAAGCGG AGATGATCGG CCATTCCGTC CGCCGCCTCA TTCCGGCGGA GCGGCAGGCG GAAGAGGACG ACATCCTCGC GCGCATCGCC CGTGGCGAGC GGGTGAAGAG CTTCGACACG ATGCGGCAGC GAAAGGACGG GGTCCAGATC GCGGTCTCGA TCACCGTCTC GCCGGTCTAC GACAAGGCGG GCCGCATCGT CGGGGCCAGC AAGATTGCCC GCGACATCAC GTCGCGCGAG GAAGGCCAGC GAGCCCTGCG CGAGAGCGAG GCCCGCTTTC GCATGCTGGC CGACAACATC TCGCAGCTCA CTTGGGTGGC CGACCGCACG GGCGCCATCG GCTGGTATAA CAAACGCTGG TACGACTACA CCGGGGTGCC GCACGGTTCG ACCGATGGCT GGGGCTGGGA TCGCGTGCAC CATCCCGACC ATCTCGAACG GGTGCGCGAG CATTTTGCCG AGAGCATTGC TGCGGGACGC GAATGGGAGG ACACCTTCCC GCTTCTCGGC CGCGACGGGA CCTACCGCTG GTTCCTGTCG CGCGCGAAGC CGATCCGGGG CGAGGATGGC GGGATCGTCT ACTGGTTCGG CACCAATACC GACGTGACCG AGATGCTCGA GAAGGAAGAG CAGATCCGCG TCCTGCTGAT GGAAGTGAAC CACCGCTCGA AGAACCTGCT CTCGGTCGTC CAGGCGCTGG CCCGGCGGTC TGGCGGAGGC GATCCCGAGT TCCTGCGCCG TTTCGAGAAC CGTCTCGCCA GCCTTTCTGC CAACCAGGAC CTGCTGGTGC GGCGCGGTTG GTCGACGATC ATGATGGACG AGCTGGCCGA CGCCCAGCTC GCGATCCTCG GCCGCGACAG CCGCGAACAG GTCCTGACGC AAGGCCTGTC CCTGGCCCTG AGCCCCCGCA GCGCCGAGAT CATCGGCATG GCGCTGCACG AGCTGGCAAC CAACGCGCTC AAGTACGGGG CGCTCAGCGT GCCGACCGGC CGCGTTTCGC TGTCATGGGA GGAGACACCG GACGGGCATT TCCAGATCGA CTGGCGCGAA AGCGGCGGCC CAGCCGTGCG CGACCCGAAG CAGCACGGCT TCGGGACAAC GCTCATCCGC CATATTCCGG CGCGCAGCCT CCACGCAGAC GTCACGCTCG ACTACGCGCC CGCAGGCCTG CGCTGGCAAT TGCGCTGCAC CAGCGCGACG GCGCGGACCC TTTCGAGTTA G
|
Protein sequence | MLAEAHLSAI IQSSDDAIIS KDLSGTILSW NPAATRIFGF SEAEMIGHSV RRLIPAERQA EEDDILARIA RGERVKSFDT MRQRKDGVQI AVSITVSPVY DKAGRIVGAS KIARDITSRE EGQRALRESE ARFRMLADNI SQLTWVADRT GAIGWYNKRW YDYTGVPHGS TDGWGWDRVH HPDHLERVRE HFAESIAAGR EWEDTFPLLG RDGTYRWFLS RAKPIRGEDG GIVYWFGTNT DVTEMLEKEE QIRVLLMEVN HRSKNLLSVV QALARRSGGG DPEFLRRFEN RLASLSANQD LLVRRGWSTI MMDELADAQL AILGRDSREQ VLTQGLSLAL SPRSAEIIGM ALHELATNAL KYGALSVPTG RVSLSWEETP DGHFQIDWRE SGGPAVRDPK QHGFGTTLIR HIPARSLHAD VTLDYAPAGL RWQLRCTSAT ARTLSS
|
| |