Gene Saro_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3839 
Symbol 
ID5077450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp6095 
End bp7735 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content62% 
IMG OID640480949 
Productsigma-54 dependent trancsriptional regulator 
Protein accessionYP_001165611 
Protein GI146275450 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.719613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACAC TTCCAGGCTA CGCCGACATT GCCAGCAAGC TGCAGTTCTC ACCGGAAATG 
GGCCGGATCT GGCACGATGG TGAACGCTGC GTACTGCTCA GCCAAGCGGC TCTGGCCAGC
TGGCGCAGCC GCCTAGTTGC CGAATTGGGG CACGAAGCGG CCAGCCGGTT CTTCTGGGGT
GTCGGTTTTG CCGAAGGCGC ACGGTGCGCG ATCGGCGCGA AAAAATTACG CCCGGATGGT
GATTATCTTG AAGCATTCGC GGTCGGACCG CAGGCGCACG CCTTGACCGG GTTTGGCTGG
ACCCAGATCG AAATCCTGGA AAATGATTCC TCGCGAGGGC ATTTCGAAGG CCGGTTCAGG
GTCCACGATT CGATCGAGGC GGCGATCCAT CTCAGCGCGA CCGGGTACAG CACTGACCCG
GTGTGCTGGA TGCAGACTGG CTTTGCCAGC GGCTTTGCCA CAACCTTCGC GGGACAACCG
ATCATCATGC GCGAAGTGGA ATGTGCGGGG CGAGGCGATG CCGCCTGCGT ACTTCACGCC
AAACCAAAAC CCGAATGGGA TCAGTTGGAC AACCTCGAAT TGGCCGCGAC GCCGCTGGTC
CTGCCCGAAA CGCGGCATGA TGGGGGCCAA ACGGTGATTG GGATCTCGGC AGGGTTCCTC
TCAGCCAAGA CCATGATCGA ACGCACCGCG TCGAGCAATG CGACGCTGCT GTTAATGGGC
GAGACCGGGG TCGGCAAGGA AGTGCTGGCC AAGCTCGCTC ATCGGCTGAG CATGCGCGAG
GCGGAGCCGT TCATCGCGCT CAACTGCGCG GCGATCCCCG AGGGGTTGAT CGAATCGGAG
CTGTTCGGCG TGGCCAAGGG GGCCTATACT GGCGCGGTTG CCGCCCGGCC CGGTCGGTTC
GAGCTGGCCA ATGGCGGCAC GCTGTTCCTG GACGAGATTT CGACACTGTC GCCACTTGCA
CAATCGAAGA TTCTGCGCGC TGTCCAGGAA GGTGAGTTCG AGCGGGTCGG CGATACGCGG
ACGATCAAGG TGGATGTCCG ATTGATCGCA GCATCCAATG TGGAACTTAA CGAGGCGGTG
CGGGAAGGCA CGTTTCGTGC CGACTTGTTC TACCGGATAT CGACTTTGCC GGTCCGGGTG
CCGCCGCTGC GCCAGCGCCG CGAGGACATT CCGGTGCTGC TGGAACACTT CCGTCTGCAT
TATGCCATGC GCCATGGCCG AACGGTATCG GGCTTCACCC CGCGGGCGAT CAACGCGCTG
CTGGTGTACG ATTTCCCCGG TAATGTGCGC GAACTCGAGC GAATGGTGGA ACGTGCCGTG
CTGCTGGCGG ACGACGGGCG AGCGATCGAC GTCAGGCACC TGTTCCTCGA AACCGACGGT
CTCGAATTGA AGCCGACGAT GGGCATGACC AACGATGGCA GGATCAGCGC CGTTGACAAT
GTCGGCGATC GCGCCGGCTT GGTGCGCCAG ATGCTGGATC TGATCGTCGA TGGGGGCGGA
AGCCTGCTGG AGATGGAAGG GCTGGTGATC CGCGAGGCCC TTGATGCGAG TGGGGGCAAC
GTTGCGCGCG CGGCGCGCAC CTTAGGCTTT ACCCGTCGGC AGCTCGCCTT GCGTCTTGAA
AAGTTGGAAA TCCAGGAATA A
 
Protein sequence
MKTLPGYADI ASKLQFSPEM GRIWHDGERC VLLSQAALAS WRSRLVAELG HEAASRFFWG 
VGFAEGARCA IGAKKLRPDG DYLEAFAVGP QAHALTGFGW TQIEILENDS SRGHFEGRFR
VHDSIEAAIH LSATGYSTDP VCWMQTGFAS GFATTFAGQP IIMREVECAG RGDAACVLHA
KPKPEWDQLD NLELAATPLV LPETRHDGGQ TVIGISAGFL SAKTMIERTA SSNATLLLMG
ETGVGKEVLA KLAHRLSMRE AEPFIALNCA AIPEGLIESE LFGVAKGAYT GAVAARPGRF
ELANGGTLFL DEISTLSPLA QSKILRAVQE GEFERVGDTR TIKVDVRLIA ASNVELNEAV
REGTFRADLF YRISTLPVRV PPLRQRREDI PVLLEHFRLH YAMRHGRTVS GFTPRAINAL
LVYDFPGNVR ELERMVERAV LLADDGRAID VRHLFLETDG LELKPTMGMT NDGRISAVDN
VGDRAGLVRQ MLDLIVDGGG SLLEMEGLVI REALDASGGN VARAARTLGF TRRQLALRLE
KLEIQE