Gene Saro_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2047 
Symbol 
ID3917694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2185739 
End bp2187115 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content64% 
IMG OID640444799 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_497320 
Protein GI87200063 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR02915] putative PEP-CTERM system response regulator 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAG CGAAGCCCGG ACAGCGACCG GCACTGCTGA TCGTCGAGGA CGATCCCGGT 
CTGCAGGCGC AGTTGAAATG GGCGTACGAG GATTTCGACG TCTTCATCGC GGGCGACAGG
GTAAGCGCGC TGACCCTGCT ACGTTCGGTG GAACCGGCGG TCGTGACCCT CGACCTCGGG
TTGCCGCCCG ATCCTGACGG AACCACCGAG GGCTTTGCCG TGCTCGACGA GATCATGGCC
CTGCGCCCCG ACACCAAGGT GATCGTCGCA AGCGGCCACG GTGCCCGCGA AAGCGCGCTC
AAGGCCATCG AGAAGGGGGC GTACGACTTC TACCAGAAGC CGGTGGACAT CGATGCGTTG
GGCCTGATCG TTCGCCGTGC GCTGCACCTT TCGCGGATCG AGTCCGAAAA TCGCCATCTC
GCGACCCGTG CAAGCACCGA CAACAGGGTG CTGGGGCGCA TGATCACCGC GGCACCCGAG
ATGATCAAGG TGGCCCGCAC AATCGAGCGC GTCGCCAATA CCAGCGTCTC GGTGATGCTG
CTGGGCGCGA GCGGCACCGG CAAGGAACTG TTGGCGCGCG GCCTGCATGA TGCGTCCGGA
CGCGCACGCG GATCGTTTGT CGCGATCAAC TGCGCGGCCA TTCCGGAGAA TCTGCTCGAA
AGCGAACTGT TCGGGCACGA GAAGGGAGCG TTTACCGGAG CGGTCAAGAC GACCGAGGGC
AAGATCGAAC AGGCCAGCGG CGGCACGCTG TTCCTCGACG AAGTGGGCGA CATTCCGCTC
CAGCTTCAGG TCAAGCTGCT GCGCTTCCTG CAGGAACGGA CGATCGAGCG CATCGGCGGG
CGAAAGTCGA TCGAGGTCGA TACACGCATC GTCTGCGCCA CGCACCAGAA CCTCGAGGCC
ATGATTGCCG ATGGGCGGTT CCGCGAGGAC CTTTACTATC GCCTCGCGGA AATCGTTGTG
CGCATTCCCA GCCTGGCGGA GCGCCCCGGC GATGCGACGC TTCTCGCCAA GACCTTTCTC
ATGCGCTTTG CCAAGGAGAT GAACCCGCAG GTCAAGGGCT TCGCGCCGGA TGCGCTGGCG
GCGATCGACA GCTGGAACTG GCCCGGAAAC GTCCGCGAGC TGGAGAACCG CGTCAAGCGT
GCGGTCATCA TGGCCGACGG CAGGCTGGTT ACCGCAACCG ATCTCGACCT GCCGGGAAAT
GCGGACGAGG AATCATCGCC GCTCAACCTG AAGACCGCGC GCGAAGCGAC TGACCGCAAG
GTCATCCGCC ACGCGCTCGC CCGCAGCGAA GGCAACATCT CCAGCACCGC GCGCCTGCTC
GGCATCAGCA GGCCGACGCT TTATGATCTG CTCAAGCAGT ACGACCTCCA GAACTAG
 
Protein sequence
MSEAKPGQRP ALLIVEDDPG LQAQLKWAYE DFDVFIAGDR VSALTLLRSV EPAVVTLDLG 
LPPDPDGTTE GFAVLDEIMA LRPDTKVIVA SGHGARESAL KAIEKGAYDF YQKPVDIDAL
GLIVRRALHL SRIESENRHL ATRASTDNRV LGRMITAAPE MIKVARTIER VANTSVSVML
LGASGTGKEL LARGLHDASG RARGSFVAIN CAAIPENLLE SELFGHEKGA FTGAVKTTEG
KIEQASGGTL FLDEVGDIPL QLQVKLLRFL QERTIERIGG RKSIEVDTRI VCATHQNLEA
MIADGRFRED LYYRLAEIVV RIPSLAERPG DATLLAKTFL MRFAKEMNPQ VKGFAPDALA
AIDSWNWPGN VRELENRVKR AVIMADGRLV TATDLDLPGN ADEESSPLNL KTAREATDRK
VIRHALARSE GNISSTARLL GISRPTLYDL LKQYDLQN