Gene Saro_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1842 
Symbol 
ID3918402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1941856 
End bp1943274 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content68% 
IMG OID640444584 
Producttwo component, sigma54 specific, Fis family transcriptional regulator 
Protein accessionYP_497116 
Protein GI87199859 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.479539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAGT CCGAACAGCG CCTTCTCATG CTGATCGACG ATGAACCCGC GCAGTGCCGC 
CTGATTTCGG CGCTGGCATC GCGGGAGGGG TGGCGCACGA TCATCGCGCG CGATTCCGAA
AGCGCGATCG CCACGCTCGG CACCCGCCAG GGCATGCAGC TCGGCGCGAT CATCCTCGAC
CAGTGGGTGC CAGGCGACGA TGCCTGCACC CTTATCGCCG AACTCAAGGC GCGGCGCCCC
GCCCTGCCGA TCCTGATGCT GACGACAAGC AGTTCGCCGC TTCTCGCGGT CGAAGCGATG
CGCGCCGGGG CCACCGACTA TCTCATCAAG CCGGTCGCAC CCGAGCGCCT GCTCCAGGCG
TTGCGCAGCG CCACCTCGCG CGAGACCAGC GCGAGCGAGT TGCAACCGCT GACCGAGAAG
ATCGGCGCGA CGCTCGACTT CGATTCGATG ATCGGCGCCT CGCCCGCGTT CCGGGCCGCG
CTGGCTGTCG CGGCAAAGGC CGCTCGCACG CACGGCACCG TCCTGATCGA AGGCGAAAGC
GGCACCGGCA AGGAAATGCT CGTCCGCGCC ATGCACGCCG CTAGCCCACG CGCCAAGGCG
CCCTTGCGCA TCGTCAATGC CGGCGGCACT TCGGCCAGCC AACTCGAATC CGCCCTGTTC
GGGCATGAAA AAGGCGCCTT CCCCGGCGCA TTCGAACGCA ACATCGGCAT GTTCCAGCAT
GCAGACGGCG GCACGCTGGT GATCGACGAG GTCGACCGAC TTCCAGCGTC CGTCCAGGAA
CGGCTCGTGC GTTTCCTCAC GCGGGGGGAC ATCCAGCCCG TGGGCGCGCG CCACTCGTTC
CGGGTCGATG TACGCCTCAT CGTCTGCGCC AACGCGGGCT TGCGCGACCT CGTCCATATC
GGCGATTTCG ACGCTGACCT TCATGCCCTG CTGACGCAGA CGCAGGTCCA GCTTCCGTCC
TTGCGCGAAC GGCCCGCCGA CATCCCGGCT CTTGCCCGCC ATTTCCTGGC GCGCATCGGC
GAGCAGCCGG GGCTCAGGCC GCTCGGCATC ACCGACGGCG CCCTCGCCCT GCTTGCCGCC
TACGACTGGC CAGGCAATGT TCGGCAATTG CAGGCCACAC TGTTCCGCGC CGCCGTGTTC
TGCGACGGCG AGGCGCTGAC CGCGCAGGAT TTCCCGAGCC TTTCGAACAT GATCGGCGAA
GGAAGCCGCC GCGCGCCCAG CGTTACGGAC GGAGCCGGCA TCACTCTGTT CGCGGCCGAC
GGAAACCTGC GCCCGCTGGA AGACATCGAA GCGGACGTCA TCCGCCTGGC CATCGGCCAC
TATCGCGGCC GCATGACCGA GGTCGCCAGA AGGCTCGGCA TCGGGCGCTC GACGCTCTAC
CGCAAGCTGT CCGAGCTCGG GATCGACAAC GCGGCCTGA
 
Protein sequence
MQESEQRLLM LIDDEPAQCR LISALASREG WRTIIARDSE SAIATLGTRQ GMQLGAIILD 
QWVPGDDACT LIAELKARRP ALPILMLTTS SSPLLAVEAM RAGATDYLIK PVAPERLLQA
LRSATSRETS ASELQPLTEK IGATLDFDSM IGASPAFRAA LAVAAKAART HGTVLIEGES
GTGKEMLVRA MHAASPRAKA PLRIVNAGGT SASQLESALF GHEKGAFPGA FERNIGMFQH
ADGGTLVIDE VDRLPASVQE RLVRFLTRGD IQPVGARHSF RVDVRLIVCA NAGLRDLVHI
GDFDADLHAL LTQTQVQLPS LRERPADIPA LARHFLARIG EQPGLRPLGI TDGALALLAA
YDWPGNVRQL QATLFRAAVF CDGEALTAQD FPSLSNMIGE GSRRAPSVTD GAGITLFAAD
GNLRPLEDIE ADVIRLAIGH YRGRMTEVAR RLGIGRSTLY RKLSELGIDN AA