Gene Saro_1320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1320 
Symbol 
ID3917769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1363722 
End bp1364750 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID640444057 
ProductLacI family transcription regulator 
Protein accessionYP_496598 
Protein GI87199341 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.593334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGCG ACAGGAAGAT CATCCGCGTA ACGTCGTTCG ACGTGGCCGA AGCGGCGGGC 
GTCAGCCAGT CCACCGTCAG CCGCGCACTG GCAGGCGACA CGTCGATCAG CGAGCCGACG
CGCCAGCGCG TGATCGAAGC GGCACGCCGC CTGAACTACC AGGTGGACGA GAACGCGGCA
CGCCTGCGCC GGGGCCGCAC CGGCACACTC GCGGTCGTGA TGATCTGCCG CGAGGCGCAG
GACCGCAAGG ATATCAACCC GTTCTACTTC TCGCTCCTGG GCAGCACCTG CGCGGCTGCA
TCGGCGCGCG GATACGAAAC GCTGGTCTCG TTCCAGGATG CTCCCGAAAA CTTCTGGGGC
CACTTCCAGG AGCGGCGCAA GGCCGATGGC ATGATCGTGA TCGGCACGAC GACCAACACG
GCGGCGTGGG ACTATTTCCG CGACATGCCG GAAGGCACGC ACTGGACCTG CTGGGGCTCA
CCCGACAACG ACATGCCCTG GGTGCGCAGT GACAACCTTT CGGGCGCCAC GCTGGCGACG
CGCCACCTGC TGGTGCGCGG CTATCGCCAG ATCGTGTGCA TCGGCTCGGC CACCTCGCCC
CAGCGCCAGT TCCAGGAACG GTATGAAGGC TACGCCGAGG CCATGCGCTC GGCCGGGCTC
GAACCGCGCC TGCAACAGGT CGAGAGCGGC CTCGCACGCG AGGAACAGGG CCGCCGCGCG
GCGATAGCGC TGGCGGAAAG CGGAGAGCAG TTCGACGCCA TCTTCGCGGT CTGCGACGAG
ATGGCGCTGG GCGCGCTCAA GGAACTGACC GCGCGTGGCT ATGCCGTGCC GGACCAGGTC
GGGATCATCG GCTTCGACGG CATCCGCGCC GGCGCATGGT CGACCCCACC CCTCACCTCG
ATCGAACCCG ATTTCCAGAT GGCCGGCGGA TTGCTGGTCG AACAGCTGCT GGCAAAGATC
AACGGGACCG AAGGCTCGGG CCGGCGCGTG CCGGTAAGGC TGGTTATCCG GGGCTCAACA
CGGCCCTGA
 
Protein sequence
MPSDRKIIRV TSFDVAEAAG VSQSTVSRAL AGDTSISEPT RQRVIEAARR LNYQVDENAA 
RLRRGRTGTL AVVMICREAQ DRKDINPFYF SLLGSTCAAA SARGYETLVS FQDAPENFWG
HFQERRKADG MIVIGTTTNT AAWDYFRDMP EGTHWTCWGS PDNDMPWVRS DNLSGATLAT
RHLLVRGYRQ IVCIGSATSP QRQFQERYEG YAEAMRSAGL EPRLQQVESG LAREEQGRRA
AIALAESGEQ FDAIFAVCDE MALGALKELT ARGYAVPDQV GIIGFDGIRA GAWSTPPLTS
IEPDFQMAGG LLVEQLLAKI NGTEGSGRRV PVRLVIRGST RP