Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3868 |
Symbol | |
ID | 5077479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | + |
Start bp | 34452 |
End bp | 35714 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640480977 |
Product | Rieske (2Fe-2S) domain-containing protein |
Protein accession | YP_001165639 |
Protein GI | 146275478 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.555441 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCATTC CCCAGCAAAC AGACGCCTTG GATCAAGCCC CGTCCACTGC AGGATGCCGG GTTCCGTACC GCGTTTTCAC CGATCGCCAG TACTACGATC GCGAACAGGA GAACATTTTC AAGGGTAACT GCTGGTCCTT CGTCGGACTT GAGGCTGAAG TTCCGCAGGC GGGTGATTTC AAATCGACCT TCGTGGGCGA AACACCGGTG ATCCTCACGC GCGACAGCGA CGGCTCGTTG CATGTCGTGG TCAATCGCTG CGCCCATCGC GGCGCCCTGG TCTGCCGCGA AATGCGCGGC AACCGCGCAA GCCTGGAATG CGTCTACCAC CAATGGGCCT ACGATCTGAA GGGGAACCTG ATCGGGGTAC CCTTCCGCCG CGGCCTCAAG GGACAGGGCG GGATGCCGGG CGATTTCGAC ATGTCCCAGC ACAATCTTCG CCAGCTGCGC ACCGAATCCG TTGGCGGTCT GGTGTTTGCC TCATTCTCCG AAACTGTCGA ATCCTGCCGC GATTTCCTCG GCCCGATCGT GGTCGAGCAG ATCGAGCGGA TCATGTGCAA GCCGATCACC GTGCTGGGCG ACCAGCGCCA GCGGATCCGC GGGAACTGGA AGCTCTACGC CGAGAACACC CGCGATCCCT ACCACGCGAG CCTGCTGCAC CTGTTTCACA ACACGTTCGG TCTCTATCGC TCGACCCAGA CAGGTAAGGC ATTGATGGAT GCCAACAAAC GCCACTCGCT GCTCTATTCG ATCGCGGCCA GCAACGACGA TGCCGCCGAC AAGCAGGCCT ATGGCGATTC CCGCACCTTC GACACCGAAT TCAAGCTGCA GGACATGTCG CTGCTGAAAG GTCGGCAGGA ATTCGCGGAC AATGTTACGC TGGTGATCCT CGCGGTCTAC CCCAACCTGG TCCTCCAGCA GATCCAGAAC ACTCTCGCGG TGCGCCAGAC GGTGACCTAC GGGCCGGATG AATTCGAGCT CGTCTGGACC CATTTCGGCT ACCAGGACGA CGATGCGGAA ATGCAGGCCA TCCGGCTCAA GCAAGCCAAC CTGATCGGCC CCGCCGGGCT CATTTCGATG GAAGACGGCG AAGCGGTGGA AATCGTCCAG AACGCCATCG TCGGCGAGGC AAGCGCGACG TCCTACATCG CGATGGGTGG CGGCCGGGCC GAGGATGCCG ACCATCTCGT CACCGAGGGC GCGATCATCG GCTTCTGGGA CAATTATCGC GAAATGGTCG GCTTTGAGGT GGAGCCGACA TGA
|
Protein sequence | MSIPQQTDAL DQAPSTAGCR VPYRVFTDRQ YYDREQENIF KGNCWSFVGL EAEVPQAGDF KSTFVGETPV ILTRDSDGSL HVVVNRCAHR GALVCREMRG NRASLECVYH QWAYDLKGNL IGVPFRRGLK GQGGMPGDFD MSQHNLRQLR TESVGGLVFA SFSETVESCR DFLGPIVVEQ IERIMCKPIT VLGDQRQRIR GNWKLYAENT RDPYHASLLH LFHNTFGLYR STQTGKALMD ANKRHSLLYS IAASNDDAAD KQAYGDSRTF DTEFKLQDMS LLKGRQEFAD NVTLVILAVY PNLVLQQIQN TLAVRQTVTY GPDEFELVWT HFGYQDDDAE MQAIRLKQAN LIGPAGLISM EDGEAVEIVQ NAIVGEASAT SYIAMGGGRA EDADHLVTEG AIIGFWDNYR EMVGFEVEPT
|
| |