Gene Saro_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1928 
Symbol 
ID3917151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2040075 
End bp2041508 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content61% 
IMG OID640444674 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_497202 
Protein GI87199945 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGC GTGTACTTCT CGTCGAGGAC GATGCTTCGA TTGCCCTGGT CATTACCGCG 
GCACTTGAAG CTGAAGGCTT CACTGTCGAT CGCTGCGATT CGATCGCTGG ACGGGACCGT
CTGCTTTCAG CGCAGACCTA CGACCTCCTG CTGACCGATG TCATGCTGAC TGACGGTGAC
GGCATCGAAA CGCTGGGCCC TGTGCGCGAG GCGCATCCCA CGCTTCCGAT CATCATCCTT
TCAGCACAGA ACACCCTCGA TACAGCCGTC AGGGCGAGCG ACACAGGCGC ATTCGAATAC
TTTCCCAAGC CCTTCGATCT GGAAGAACTG GTCCGCACCG TAACCCAGGC CATCGGCAAT
GCCGGCGGGG TTGGTGCCGA ACTGCCACAG GATGTGCCGC AGGGCCTCCC GCTCGTTGGG
CGAAGTTCCG CCATGCAGGC CGTGTATCGG ATGATCACGA GGGTTCTGCG CAATGATCTG
ACAGTGCTGA TTCTCGGAGA GTCCGGCACC GGCAAGGAGC TCGTGGCAGA GGCAATTCAC
CAGCTCGGCA ACCGCCGGTC GGGGCCTTTC GTCGCAGTGA ATACCGCCGC CATTCCCGCA
GAACTGATCG AAAGTGAGCT GTTCGGGCAT GAAAAAGGCG CCTTCACCGG TGCCGTAGCG
CGATCCATCG GCAAGTTCGA ACAGGCCAGC GGCGGGACCC TGTTCCTCGA CGAAATCGGC
GACATGCCCA TGCAGGCCCA GACCCGTTTG CTGCGGGCTT TGCAATCAGG CCGGATTCGA
CGGGTTGGCG GGCGTGAGGA AATCATCCTC GACTGCCGCA TCGTTGCCGC GACAAACCGC
GATCTCCTGC CGATGATCGC GGCGGGGACA TTCCGCGAGG ACCTCTACTA CCGCCTCGCC
GTCGTACCGA TTGAACTGCC CCCGCTGCGG GAACGGGCAG ATGATATTCC AGCGTTATCG
CAGCATTTCC TCGCCAAGGC AGCCCTCGAA GGTCTGCCAC GACGCCAACT TACACAAGCG
GGCGCGGACC TTCTGTCCCG CCAGCCCTGG CGAGGCAACG TCCGCGAACT GCGCAATTTC
GTATACCGCC TTGCACTGCT GGCACGTGAC GAAGTGATCG ATGCCTCGAC CATCGAGCCA
CTTCTGGCGC AAGAAGCCAC GGGGGCGGCG CGTTCATCCG AATCGGACGA AAGGCGACCA
TCCGATCTTG CCTCTGCAGT GGCCGCGTGG CTGTCGGCGC AGAACCTCCA GCCGGGCGAG
GTCTATGATG CAGCGCTTGC CGCATTTGAA CGACCTCTGT TCCTCCAGAT CCTTGCGCTG
ACTGGCGGGA ACCAGCTTCG TGCCGCCCAA ATACTTGGTA TCAATAGAAA TACTCTGCGC
AAACGGCTTT CCGACCTGAA TATCACACCC GACGAGTTCG CCAGTCGCGA TTAG
 
Protein sequence
MSKRVLLVED DASIALVITA ALEAEGFTVD RCDSIAGRDR LLSAQTYDLL LTDVMLTDGD 
GIETLGPVRE AHPTLPIIIL SAQNTLDTAV RASDTGAFEY FPKPFDLEEL VRTVTQAIGN
AGGVGAELPQ DVPQGLPLVG RSSAMQAVYR MITRVLRNDL TVLILGESGT GKELVAEAIH
QLGNRRSGPF VAVNTAAIPA ELIESELFGH EKGAFTGAVA RSIGKFEQAS GGTLFLDEIG
DMPMQAQTRL LRALQSGRIR RVGGREEIIL DCRIVAATNR DLLPMIAAGT FREDLYYRLA
VVPIELPPLR ERADDIPALS QHFLAKAALE GLPRRQLTQA GADLLSRQPW RGNVRELRNF
VYRLALLARD EVIDASTIEP LLAQEATGAA RSSESDERRP SDLASAVAAW LSAQNLQPGE
VYDAALAAFE RPLFLQILAL TGGNQLRAAQ ILGINRNTLR KRLSDLNITP DEFASRD