Gene Saro_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1867 
Symbol 
ID3917088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1968031 
End bp1968969 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content66% 
IMG OID640444611 
ProductAraC family transcriptional regulator 
Protein accessionYP_497141 
Protein GI87199884 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.122856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGCGC TTCTCAAGTT TCATACGCAG GACGTCGCTC CGCAGGACCG GGCGCGCTAC 
TGGAACGAGA TTGCCGACCG GGTCTTCACG GGCACGTTCG TCAACGTTCC GGGCGAGGAT
TTCAGCGGCC GGATGCTGTC GTGGCGCGTC GGCGAACTCG ACATGATCCG TACGGATTCG
ACCCATTCCG GGGTCGGCCG CACCCCCATC GCGCAGGACG ACGAAAGGCT GATCCTGCAT
CTGCAATGCC GCGGCACCAG CCAGCACATG CAGAAGCAGG CCGAATGCGC GCTCGAGCCG
GGCGACTTCG TACTGGCGAG CCCGCACATT CCCTATTCGA TCAAGCTGAC CGGGCACGAG
ATGCTGGTCG TCGAGTTCCC GCGCGCGCCC CTGGCGGAGC GGTTTCCCGG CGTGGACGAT
GCCTTGTTGC AGCGCATGTG CGGTGCGTCG CCCGGCGGAC GCGTGTTCCA CGACTTCCTG
CTTTCGTTGT GGCAGCAGGG TGAACGGGCC GCCGAAGACC CCGAATGGGA AGTCGGCGTG
AACGCGGTGT TCTATGACCT TGCGGCGATG GCGATGCGCG GGGCGCAGCG CCCGAACGCC
GAGGTTGGCG AGGCCGACTT GCGACGCAAG GTGCTGGCAA TGGTCTCCTC CAGCCTGGAG
GACCCCGCGC TGCGCACGGC ATCGATCGCC GATGCCTGCA ACATCTCGGT TCGCACGGTG
CAGAACGTGT TCGCGGCAAT GGGTACGACG CCGACCGCGT ACATTCTCGA GCAGCGCCTT
CGCCGCGCGG CGGACCGGCT CGTTGGAAGG CCCGACGCCA GCATCACGGA GATCGCCTTC
GAACTGGGCT TCAACGACAG CGCCTACTTC ACGCGGTGTT TCCGCCAGCA GTTCGGCGCG
GCGCCGCGCG ACTGGCGATT GGGAAGGATG TCATCATGA
 
Protein sequence
MGALLKFHTQ DVAPQDRARY WNEIADRVFT GTFVNVPGED FSGRMLSWRV GELDMIRTDS 
THSGVGRTPI AQDDERLILH LQCRGTSQHM QKQAECALEP GDFVLASPHI PYSIKLTGHE
MLVVEFPRAP LAERFPGVDD ALLQRMCGAS PGGRVFHDFL LSLWQQGERA AEDPEWEVGV
NAVFYDLAAM AMRGAQRPNA EVGEADLRRK VLAMVSSSLE DPALRTASIA DACNISVRTV
QNVFAAMGTT PTAYILEQRL RRAADRLVGR PDASITEIAF ELGFNDSAYF TRCFRQQFGA
APRDWRLGRM SS