Gene Saro_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3539 
Symbol 
ID5077688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp156111 
End bp157457 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID640481263 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001165925 
Protein GI146275765 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACCT ACATCCAGGT CCCCGAAACC GATGCCCCCT ACCCGACCGA TGCCCGCGCG 
CAGTTTCCCG CCGCGCGCGG CGATGCGATC ACCGGCGACC GCTACTGGTC GAAGGAATTC
GCCGCGAAGG AATGGGAACA CATGTGGAAG CGCGTGTGGC ACGTCGGCGG GCGCACGGCG
CAGCTTGAAG AGCCGGGCGA TTTCATCACC CACAACTTCA TGCGCCAGTC GGTGGTCATG
GTCCGCCAGA AGGACGGCGG CATCCGCGCG TTCCACAACG TCTGCCGCCA TCGCGGCAAT
CGCCTCGTCA CCGTGGAAGA AGGCGGTGTC GGCGAACACT TCACCTGCCC CTACCACGGC
TGGAAGTGGA ACATAAACGG CGCGCTCGAC CATGTGCAGG ACGAGGAGGA TTTCCCCCAG
GGCAGCCCTT GCGGCAAGCT GCGGATGAAG GAAGTCCCGT GCGAGACCTG GGGCGGCTTC
GTTTTCTACA GCTTCGATCC CAACGCGGTG CCGCTGATGG AATATCTCGA TCCCATCCCG
TCGCTGCTCG GCAACCGCGA TCTCGCCAAC TGGAAGCGCG TGGTGTGGCG GACGCTGCGG
GTGAACACCA ACTGGAAGTT CGCGTCCGAC AACTTCAACG AGGCCTACCA CATCCCCGCC
GTGCATCCGC AGTTCGAGGG GATGATCGAC GATCACTACT CGACCACCGT GTTCGAGATG
TACCCCACCG GGCACAACCG CATGATCGAG AAGCTGCAGC CATCGAGCCG CTATCCCGAT
GCCCAGCAGA TGAAGCCGCT GTGGGCGCAG GTGCTCAAGG AATGGGACCT CGATCCCGCC
GAGTTCGAAG GACGCGCGCA GGAAGGCCGT CTGGCCCTGC AGCAGGCGCG GCGCAAGCTG
GGGCCGGCAC GCGGATTCAC GCATTTCGCG GCACTGACCG ACGACGAGCT GACCGACCAG
TTCCACCACA CCTGCTTCCC CAACCTGACG CTGACCGGCA CGCCTGAAGG GCTGCACGTG
TTCCGCACCG AGCCGGACAT GGAAGACCCC AACTGGTCGA CCTTCGACTA CTGGTACCTT
GCGCCGGAAG TCGCGGGCGG AGCGGATGTG CCGACGCTAT ATGGCCTGCG CCCGTGGAAG
GAAGCCGAGC ACCAGACCGG CGACTTTACC GCCTACAGCG CCGAGATTCC GCAGGGCGAC
TTCCTGATCC AGGACCTCGA CGTGGCGGTG ACGCAGCAGC AGGGGCTGCA CTCGCTCGGC
CATGACGATG CCTACCTCGC CGGCCAGGAA ACGCGCGTGC GCAGGTTCCA CGAAGTGATC
AACGACTACA TCGAGGGGCG GCGCTGA
 
Protein sequence
MATYIQVPET DAPYPTDARA QFPAARGDAI TGDRYWSKEF AAKEWEHMWK RVWHVGGRTA 
QLEEPGDFIT HNFMRQSVVM VRQKDGGIRA FHNVCRHRGN RLVTVEEGGV GEHFTCPYHG
WKWNINGALD HVQDEEDFPQ GSPCGKLRMK EVPCETWGGF VFYSFDPNAV PLMEYLDPIP
SLLGNRDLAN WKRVVWRTLR VNTNWKFASD NFNEAYHIPA VHPQFEGMID DHYSTTVFEM
YPTGHNRMIE KLQPSSRYPD AQQMKPLWAQ VLKEWDLDPA EFEGRAQEGR LALQQARRKL
GPARGFTHFA ALTDDELTDQ FHHTCFPNLT LTGTPEGLHV FRTEPDMEDP NWSTFDYWYL
APEVAGGADV PTLYGLRPWK EAEHQTGDFT AYSAEIPQGD FLIQDLDVAV TQQQGLHSLG
HDDAYLAGQE TRVRRFHEVI NDYIEGRR