Gene Saro_1672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1672 
Symbol 
ID3918781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1752855 
End bp1754036 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content64% 
IMG OID640444413 
ProductRieske (2Fe-2S) protein 
Protein accessionYP_496946 
Protein GI87199689 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0251995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGATC TTTCCGGCGC GTTCCGGCAC ATTTCATCGG ACGAGGTCGA TCCGGACGCG 
GACTGGAGCC TGCCTGGCTG GCTCTACACC GATCCGGAAT ACTTCGCGGT GGAGATGGAG
CGCGTGATCC GCCCGTCGTG GCAGATCGTC TGCCACGAAA GCGACATTGC CGCGTCAGGC
GCGTACCGCA CGCTGGATTA TCTGGGTGAA AGCGTGATCG CGATCCGGGG TGAAGACGGC
GCGATCCGTG CTTTCGCCAA CGTCTGCCGC CACCGTGCGA TGCGGCTGGT CGAAGGGCCT
GCGGGCTGCG CCAAGAAGCT CGTCTGCCCG TATCATGCCT GGACGTTCGA ACCGGACGGT
CGACTTTCGG GCGTGCCGAT GAAGTCCGAT TATCCCGCGC TAAAACTCGA AGAGAACGGC
CTCGCGCCGG TCGCGGTCGA GATCTGGCGT GGCTTCGTGT TCGTGCGTCT GGTCGACGGC
GGATTCCCCA GCGTGGCCGA GATGATGGCG CCGTTCGAGG AAGAGGTTGC GCCCTATCGC
TTCGAGGACA TGCGCCGCAT TGGCGACGTG CGTTTGCGGA CGCGCGACGT GAACTGGAAG
AACGTTGGCG ACAATTATTC CGACAACCTC CACATCCCCG TCGCGCACGA TGGCCTGACG
CGCATCTTCG GCAAGTCCTA CGAGATTTCC GACCACGGTT GGGCCGATCG CATGAAGGGC
GATCTGGTCG ACAAGCCTTC GGCCAACTTC TGGGAGCGGT TCTACCAGGC GCACCTGCCG
GAGGTGCCGC ACCTGCCGGC GCAGTCGCAG CGGCGCTGGC TGTACTACAA GCTCTGGCCG
AACATCGCGT TCGACATCTA TGCCGACCAG ATCGACTTCA TGCAGTGGCT GCCACTCACG
CCGACGACCT CGGTCCTGCG CGAGATGTGC TTCGCGCTGC CCGATGAAAG GCGCGAGATG
AAGCTGGTCC GCTATGCCAA CTGGCGGATC AATCGCGTGG TCAACAAGGA GGACACCTGG
CTGATCGAGC GCATCCAGCA GGGCATGGCC TCGCAAAGTT ATGGCGCGGG ACCGATCGGC
AAGAGCGAGG TCTGCCTGCG CAGCTTCGCG CGCAAGATTC GCGCAATCAC CCCCGAGGCC
CGCCTGCACA AGGCGCCGGC GCCGGGGTGG AGCAGGAAAT AG
 
Protein sequence
MGDLSGAFRH ISSDEVDPDA DWSLPGWLYT DPEYFAVEME RVIRPSWQIV CHESDIAASG 
AYRTLDYLGE SVIAIRGEDG AIRAFANVCR HRAMRLVEGP AGCAKKLVCP YHAWTFEPDG
RLSGVPMKSD YPALKLEENG LAPVAVEIWR GFVFVRLVDG GFPSVAEMMA PFEEEVAPYR
FEDMRRIGDV RLRTRDVNWK NVGDNYSDNL HIPVAHDGLT RIFGKSYEIS DHGWADRMKG
DLVDKPSANF WERFYQAHLP EVPHLPAQSQ RRWLYYKLWP NIAFDIYADQ IDFMQWLPLT
PTTSVLREMC FALPDERREM KLVRYANWRI NRVVNKEDTW LIERIQQGMA SQSYGAGPIG
KSEVCLRSFA RKIRAITPEA RLHKAPAPGW SRK