Gene Saro_1609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1609 
Symbol 
ID3918717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1676725 
End bp1677795 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content66% 
IMG OID640444349 
ProductLacI family transcription regulator 
Protein accessionYP_496883 
Protein GI87199626 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGCA GGCGCCAGGC GGTAACGATC AAGCACGTGG CAGCGGACGC CGGCGTCTCG 
CTGCAGACGG TCAGCCGCGT TATCAACAAC GAACCCAACG TGCGTCCGGA AATGCGCGAA
AAGGTCCAGG CCTCGATCGA CCGGCTCGGC TACGTGCCGT CGATCGCCGC CCAGCGGATG
AGCGGATCGC GTTCCTACCT CATACTCGCG CTGAACGACC GCGAGCGCAC GATTGCCGAC
TGGCGGGCAC GTCAGGGCAC CGACTGGGTT GACCAGATGC TGCTCGGCGG GATGCTCGAA
TGCGCAGAGC ACGGTTATCG CCTGATCTTC GAACTGGTCG ATACGCACAA CGACCATGTC
GAACGCGAGC TGACCGCTGC CATCGCGGCG CTTCAGCCGG ACGGCGCGAT CCTCACGCCG
CCGCACTCGG ACAATCCCAA GATCCTCGCG GTCCTTGCCC GGCACAAGGT GCCCTTTGCA
AGGATCGGCG CGCAGACCAG CGAAAAGGGC CTGCCGCCGG GCATCCTCGT TTCGATGGAC
GACGAAGGCG GCGCCCGCAC CGCGACGCGG CACCTGCTAG ATCTCGGACA TCGCCGCATC
GGCTTCATAT CCGGACCTAC CGAATATCGC CTCGCGGGGA AGCGAGTCGA AGGCTGGCGC
GCCGAAATGG AGGCGGCAGG GCTCGGTGTC GATGGCCTGC TCGAGGCTGG CGACTTCACC
TACCAGTCTG GCGTGCGGGC TGCGCGCGCC CTGCTGACGA GGCCTGACCG GCCGAGCGCG
ATCATCGCCA GCAACGACCA GATGGCGCTT GCCACGGTCG AGATTGCGGA CGAACTGGGC
CTGTCTATCC CTGCCGACCT TTCGCTGGTC AGTTTCGACA ATACGCCGCT TGTGCGCTTC
ACCCGCCCGG CGCTGACTGC CGTGGATCAG CCCATTGCCG ATACGACCGC GCGGGCGGTA
AGGATGCTCA TCGCCTCACA CCGCAAGCCC GATGCCGACA TGGGCCCGGT GGTCATGCCG
ATGGGTTTCG AGATCCGCGG CTCTACCGCG CCTTTCGGCA AGGGCGGCTA G
 
Protein sequence
MGRRRQAVTI KHVAADAGVS LQTVSRVINN EPNVRPEMRE KVQASIDRLG YVPSIAAQRM 
SGSRSYLILA LNDRERTIAD WRARQGTDWV DQMLLGGMLE CAEHGYRLIF ELVDTHNDHV
ERELTAAIAA LQPDGAILTP PHSDNPKILA VLARHKVPFA RIGAQTSEKG LPPGILVSMD
DEGGARTATR HLLDLGHRRI GFISGPTEYR LAGKRVEGWR AEMEAAGLGV DGLLEAGDFT
YQSGVRAARA LLTRPDRPSA IIASNDQMAL ATVEIADELG LSIPADLSLV SFDNTPLVRF
TRPALTAVDQ PIADTTARAV RMLIASHRKP DADMGPVVMP MGFEIRGSTA PFGKGG