Gene Saro_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3089 
Symbol 
ID3916704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3309817 
End bp3311067 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID640445872 
ProductLuxR family transcriptional regulator 
Protein accessionYP_498358 
Protein GI87201101 
COG category[K] Transcription 
COG ID[COG2771] DNA-binding HTH domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAGC GGCACCTACG ATTCCCTGCG CTTCTGTCGT GTCGCAATGG CCAGGCTGGG 
CAGCACGCAT ACTGCTTGCC GGGCAATTTT CGGGAGACTA CTCACGATCT CATGTCCGAT
CTTCCCCTTC TGTCCGGAAG GCCCCTGCCC GACGACGTTG AGGACCTTGC TGACCTGCTT
TTCAGCGGCC TGGCCGAGCG GCCACTTTGG AGCAGCTTTC TGATCCGGAT CGCCCGGAGA
CTGAAAGCGG ATGCAGCCGC GTTTGTAATC TCTTCGCGCG GGCACAAGGC GACCGACAGC
GCTGTTCTCG TTCCCGATGG TCAGGAAGCC GAGCCTTTCG CCCGGTTGAT CGAGCTTGAG
ACTTTCGCCG ACGTGGACTT CGACCGTCCG CAGATTCTTG TCGGGCGCAA CGAGAATTGT
CCTGCGGGCG AGCATGTCGT GCTGCGGCTG CGGTTCGATG GTGACCGATC CGTCTGGATG
ATCTGTTCGA CCCAGGCTTC TGGTGCAGCC ATGCTGGTCG CGGATTGGCA GGAGGTCCTG
CTGGCGCTGC TCCCGCTGTT GCAGCGAGTT GTCCGGCTTT ATCTCGCGAT TGGCGAGAGC
GAGCGACAGC GAAGAATTGC CGAGTACGTG CTTGAGACCA GCGGCGTGGG AGTGATCCTG
GTTGACAGTG CGGGGTCTGT GGTCACGGTC AACGCGGCGG CCGAGGCGAT CATGGCCCAA
ACACACGTAT TGCATATTCA TGGCGGGCAG TTGCATGCCC AGCGCCAGAC CGACCAGCAG
CTACTGCTTC GACACATCCG TGAAAAGTCC GAGCAGCAAA GCGCGAACGA TGCCGTTCCA
GGTTGTTATG CCGCATTTGC GCTGCTGCGT GACGATCATC CTCTGCCTGT CACAGTGATG
GTCCGTCCAG GGCCGCCATT CGGTCCGGTA TCCGCGCCAC TGCGCCGGAC CGCCACGGTT
ATCCTGCGCG ACCCGGCGCG GCGGCTCGGT CTGGCCAGTC CAGATCTGGA GCAACTGTTC
GGCCTCAGCC CGGCCGAAGC CCGGCTGGCC CAGTTGCTCG CTGATGGCCT CAGCACCGAA
GAGGCCGCGC TGCAGTTGGG GGTCAGCCGC AACACCGTGC GTTCCCAGCT CCAGGCCGTG
TTCGCGAAAA CCGGAACCAA CCGGCAGGGT GATCTGGTGC GCCTGTTGCT GAGTTCCGCC
GCCACGCTGA CCCAGCGTAG CGGGGAGGTG CCCTCGACGA CCAGGAGGTG A
 
Protein sequence
MVERHLRFPA LLSCRNGQAG QHAYCLPGNF RETTHDLMSD LPLLSGRPLP DDVEDLADLL 
FSGLAERPLW SSFLIRIARR LKADAAAFVI SSRGHKATDS AVLVPDGQEA EPFARLIELE
TFADVDFDRP QILVGRNENC PAGEHVVLRL RFDGDRSVWM ICSTQASGAA MLVADWQEVL
LALLPLLQRV VRLYLAIGES ERQRRIAEYV LETSGVGVIL VDSAGSVVTV NAAAEAIMAQ
THVLHIHGGQ LHAQRQTDQQ LLLRHIREKS EQQSANDAVP GCYAAFALLR DDHPLPVTVM
VRPGPPFGPV SAPLRRTATV ILRDPARRLG LASPDLEQLF GLSPAEARLA QLLADGLSTE
EAALQLGVSR NTVRSQLQAV FAKTGTNRQG DLVRLLLSSA ATLTQRSGEV PSTTRR