Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1928 |
Symbol | |
ID | 3917151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2040075 |
End bp | 2041508 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640444674 |
Product | two component, sigma54 specific, Fis family transcriptional regulator |
Protein accession | YP_497202 |
Protein GI | 87199945 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains |
TIGRFAM ID | [TIGR01818] nitrogen regulation protein NR(I) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGC GTGTACTTCT CGTCGAGGAC GATGCTTCGA TTGCCCTGGT CATTACCGCG GCACTTGAAG CTGAAGGCTT CACTGTCGAT CGCTGCGATT CGATCGCTGG ACGGGACCGT CTGCTTTCAG CGCAGACCTA CGACCTCCTG CTGACCGATG TCATGCTGAC TGACGGTGAC GGCATCGAAA CGCTGGGCCC TGTGCGCGAG GCGCATCCCA CGCTTCCGAT CATCATCCTT TCAGCACAGA ACACCCTCGA TACAGCCGTC AGGGCGAGCG ACACAGGCGC ATTCGAATAC TTTCCCAAGC CCTTCGATCT GGAAGAACTG GTCCGCACCG TAACCCAGGC CATCGGCAAT GCCGGCGGGG TTGGTGCCGA ACTGCCACAG GATGTGCCGC AGGGCCTCCC GCTCGTTGGG CGAAGTTCCG CCATGCAGGC CGTGTATCGG ATGATCACGA GGGTTCTGCG CAATGATCTG ACAGTGCTGA TTCTCGGAGA GTCCGGCACC GGCAAGGAGC TCGTGGCAGA GGCAATTCAC CAGCTCGGCA ACCGCCGGTC GGGGCCTTTC GTCGCAGTGA ATACCGCCGC CATTCCCGCA GAACTGATCG AAAGTGAGCT GTTCGGGCAT GAAAAAGGCG CCTTCACCGG TGCCGTAGCG CGATCCATCG GCAAGTTCGA ACAGGCCAGC GGCGGGACCC TGTTCCTCGA CGAAATCGGC GACATGCCCA TGCAGGCCCA GACCCGTTTG CTGCGGGCTT TGCAATCAGG CCGGATTCGA CGGGTTGGCG GGCGTGAGGA AATCATCCTC GACTGCCGCA TCGTTGCCGC GACAAACCGC GATCTCCTGC CGATGATCGC GGCGGGGACA TTCCGCGAGG ACCTCTACTA CCGCCTCGCC GTCGTACCGA TTGAACTGCC CCCGCTGCGG GAACGGGCAG ATGATATTCC AGCGTTATCG CAGCATTTCC TCGCCAAGGC AGCCCTCGAA GGTCTGCCAC GACGCCAACT TACACAAGCG GGCGCGGACC TTCTGTCCCG CCAGCCCTGG CGAGGCAACG TCCGCGAACT GCGCAATTTC GTATACCGCC TTGCACTGCT GGCACGTGAC GAAGTGATCG ATGCCTCGAC CATCGAGCCA CTTCTGGCGC AAGAAGCCAC GGGGGCGGCG CGTTCATCCG AATCGGACGA AAGGCGACCA TCCGATCTTG CCTCTGCAGT GGCCGCGTGG CTGTCGGCGC AGAACCTCCA GCCGGGCGAG GTCTATGATG CAGCGCTTGC CGCATTTGAA CGACCTCTGT TCCTCCAGAT CCTTGCGCTG ACTGGCGGGA ACCAGCTTCG TGCCGCCCAA ATACTTGGTA TCAATAGAAA TACTCTGCGC AAACGGCTTT CCGACCTGAA TATCACACCC GACGAGTTCG CCAGTCGCGA TTAG
|
Protein sequence | MSKRVLLVED DASIALVITA ALEAEGFTVD RCDSIAGRDR LLSAQTYDLL LTDVMLTDGD GIETLGPVRE AHPTLPIIIL SAQNTLDTAV RASDTGAFEY FPKPFDLEEL VRTVTQAIGN AGGVGAELPQ DVPQGLPLVG RSSAMQAVYR MITRVLRNDL TVLILGESGT GKELVAEAIH QLGNRRSGPF VAVNTAAIPA ELIESELFGH EKGAFTGAVA RSIGKFEQAS GGTLFLDEIG DMPMQAQTRL LRALQSGRIR RVGGREEIIL DCRIVAATNR DLLPMIAAGT FREDLYYRLA VVPIELPPLR ERADDIPALS QHFLAKAALE GLPRRQLTQA GADLLSRQPW RGNVRELRNF VYRLALLARD EVIDASTIEP LLAQEATGAA RSSESDERRP SDLASAVAAW LSAQNLQPGE VYDAALAAFE RPLFLQILAL TGGNQLRAAQ ILGINRNTLR KRLSDLNITP DEFASRD
|
| |