Gene Saro_0807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0807 
Symbol 
ID3915861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp858057 
End bp859577 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID640443538 
ProductXRE family transcriptional regulator 
Protein accessionYP_496086 
Protein GI87198829 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCGGGC TCAACTGCGG TTTGGCCTTC CACCTGCGCC AGAGCGGTTG CCAAGTCCAA 
TGTTATTCTG CAAACTTGCG AAGGCACACT TTGCAAATTG GAATACGGGA AATGGCCAGA
CGCCGCCTCT TTGCGGGAGA ACAGCTCAAG GCCTTGCGCA GCGCGCGCAA GCTGCGCCAG
GGCGAAATGG CCGCGCTGCT GGGCATCAGC GCCTCCTACC TCTCGCAGAT CGAGAACGAC
GAACGCCCGC TGACGCCGGC GCTGACGGAC CGCCTGCAAT CGAGCTTCCC CGTCGAATGG
CAGGACTTCG CCTCGGACCG GGTCGAGCCG GTCCTGGCCG CGCTGCGCGA TGCCACCGCC
GACCCGCTCA TCGGCCAGGC CCTGCCGGGC GAGCAGGTGG AGCGCGTGGC AGAACAATAC
CCCGCTTTCG CCCAGGCCTT CGCCCGCCTG TGGGACCAGC ACCGCCGCTC GGTCCAGCGG
CTCGAGATCA TCGATGAGGC ACTGGGCTCC GACAACATCT CCGGCGGTCG ACTGCCGTGG
GAGGAAGTGC GCGACTGGTT TCACCACGCC AACAACTACG TCGACGCCAT CGACCGCGCT
GCGGAACGGC TAGCCATCCG CCTTTCCGGC ACCGGCATGT CTCCCACGAT GGGACAGATG
GCGGTCTGGC TCGAAAGCCG GGGCATCGCG GTCGAGCAGG TCAGCGGCGG AGCCATGCGG
CGTTTCGACC CCGAGGCTCG CCGCCTCACC CTCGATCCAA ACCAGCCGGT CGAGTCCGGC
CGGTTCCAGA TGGCCTACCA GCTCGCCGCC GAAGCCCTGA GCGAGGAGAT CGCGGCCATC
GTGAACGAGG CGACGCTCCA ATCCGCCGCC GCGCGCCAGC TCCTCACCGT CGGCCTCGGC
AACTATGCCG CGGGCGCGCT GATCATGCCC TATGAGTGGT TCCGCACCCG CGCGCGCGAA
CTGCGCCACG ACATCGACCA GTTGCGCCAG CTCTTCGGCG CCAGTTTCGA ACAGGTCTGC
CACCGCCTGT CCACGCTGCA ACGCCCCCAG GCGCGCGGCA TTCCGATGTT CTTTTGCCGT
GTCGACATGG CCGGGAACAT CACCAAGCGC CATTCCGCCA CGCGCCTGCA ATTCGCCCGC
TTCGGCGGCG CCTGCCCGCT ATGGGTGGTG CACGAAGCCG TGGCGATCCC CGACCGCATC
CACGTCCAGG CCGCGGAGAT GCCCGACGGC GTGCGCTACG TCTCGATCGC CAAGGGTCTG
GTGAAGCCTT CGGGCAGCTA CTATCGCCCG CCGCGCCGCT ACGCCGTGGC GCTCGGCTGC
GAGGCGGCGC TGGCGGACGA GTTCATCTAC GCCGACGGCA TCAATCTGGC GCGGCCCGAG
GCGGTTACCC GCATCGGCAT TTCCTGCCGC ATCTGCCCGC GCGACCGCTG CGACCAGCGC
GCCTTCCCGC CCAGCGACCG GGCGATCCTC GTCGACCCCC ACGCCCGCGA CCTCGTCCCT
TACGGAATCA CCGACATCTA G
 
Protein sequence
MFGLNCGLAF HLRQSGCQVQ CYSANLRRHT LQIGIREMAR RRLFAGEQLK ALRSARKLRQ 
GEMAALLGIS ASYLSQIEND ERPLTPALTD RLQSSFPVEW QDFASDRVEP VLAALRDATA
DPLIGQALPG EQVERVAEQY PAFAQAFARL WDQHRRSVQR LEIIDEALGS DNISGGRLPW
EEVRDWFHHA NNYVDAIDRA AERLAIRLSG TGMSPTMGQM AVWLESRGIA VEQVSGGAMR
RFDPEARRLT LDPNQPVESG RFQMAYQLAA EALSEEIAAI VNEATLQSAA ARQLLTVGLG
NYAAGALIMP YEWFRTRARE LRHDIDQLRQ LFGASFEQVC HRLSTLQRPQ ARGIPMFFCR
VDMAGNITKR HSATRLQFAR FGGACPLWVV HEAVAIPDRI HVQAAEMPDG VRYVSIAKGL
VKPSGSYYRP PRRYAVALGC EAALADEFIY ADGINLARPE AVTRIGISCR ICPRDRCDQR
AFPPSDRAIL VDPHARDLVP YGITDI