Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1604 |
Symbol | |
ID | 3918712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1671027 |
End bp | 1672568 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444344 |
Product | tryptophan halogenase |
Protein accession | YP_496878 |
Protein GI | 87199621 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGAGG GTGAGGTCAG GCGGATCGTG ATCGTCGGCG GCGGGACCGC CGGCTGGCTG TCGGCCTGCT TCCTTGCCGC GGAATTGCGT TCGCGCAGGG TCGACGTCTC GATAACCCTG ATCGAGGCAC CCGACATTCC CACGATCGGC GTGGGCGAGG GGACCTGGCC GACGATGCGC GGTACCTTGC GCGCCATCGG CATCGCAGAG GACAATTTCC TTGCCGCCTG CGATGCCTCG TTCAAACAGG GATCGGTCTT CTCCGGTTGG GTCACCGGAG AGCCGGGGGA CCGGTACCAG CATCCGTTCA CGCCACCACC CGCCGGGGCG CTCGACGACA TCGTCGCGGC CTGGGCGGGC AGGGGCGACC AGCCTTTCGC AAGCGCAATG ACCGCGCAGG CGGCGGTGAT CGACCGGGAC CTTGCGCCGC GCCAGCCGAA CATGCCGTTC TATGCCGGCG CCCTGAACTA CGCCTACCAC CTCGACGCCG GAAAGTTCGC CGCACTTCTT TCCTCCCGTG CGACCGGACA CCTTGGCGTC ACCCACGTCC GCGATCGTGT CCGCAACACG GTCGCGGACG ACAGGGGGCA TCTGCTGGCG GTGGAACTGG CCTCTGGCGA CCGGGTCGAG GGCGATCTCT TCATCGATTG TTCGGGCCAT GCCGCGCTGC TGATTGGCGG CGCCTGCGGC GTCCCCTTCA TTGACCGCAG CGACGTCCTG CTCAATGATC GCGCGCTCGC GGCGCAAGTG TCCGTGCAGC CGGGCACGCC GGTAGCTTCT GCCACCGTCG CAACCGCACA CCGCGCCGGC TGGCTGTGGG ACATCGGGCT TCCCGGCAGG CGCGGCATCG GCTGCGTCTA TTCATCCCGC TTCCTGTCCG ACGACGATGC GCTGGCAGTG CTGCGCGCCT ATGTCTCGGC AAACGTCCCG GGCGCAGATC CTGCGGCGGT AGAGCCTCGG CGCATCGCCT TTCCGACCGG GCACAGGGAG CGTTTCTGGC ATGGCAACTG CATCGCCATT GGCCTTTCGG CGGGCTTTCT CGAACCGCTC GAGGCCTCGG CCATCGTGAT GATCGAGCTT TCACTCCGAG CGCTGGCCGA GAATTTCCCC CGCCAGCGCC AGGCCATGCC GTTCCTCGCC GGCCGCTTCA ACGACCTGTT TCGCTATCGC TGGGACCGGG TGATCGAGTT CCTCAAGCTT CACTACCTGC TCTCGAAGCG GCAGGAGCCC TACTGGCAGG CACAGCGCGA TCCGGCCGCT GTGCCGCAGA GGTTGGCCGA ACTTGTCGCG CTCTGGCGTC ACCAGCCGCC GAGCGCATGG GATTTCCCGC AGGTGGACGA GATATTCTCG GCCGAGAGCC ATGCCTACAT TCTCTATGGC ATGGGCTTTC CACCGCCTGC CTCCACCGCG CGCGATCCGC TTGCCGCCCG CGCGCTAGAT GACGTGGCGC AGCGCACCCG CGCCTTGTGT GCCGCGCTGC CGACGAACCG CAGCTACCTG TCTTCCCTCA TGCCCGCGCA AGGAGCCGCC GCCCTCCCAT GA
|
Protein sequence | MDEGEVRRIV IVGGGTAGWL SACFLAAELR SRRVDVSITL IEAPDIPTIG VGEGTWPTMR GTLRAIGIAE DNFLAACDAS FKQGSVFSGW VTGEPGDRYQ HPFTPPPAGA LDDIVAAWAG RGDQPFASAM TAQAAVIDRD LAPRQPNMPF YAGALNYAYH LDAGKFAALL SSRATGHLGV THVRDRVRNT VADDRGHLLA VELASGDRVE GDLFIDCSGH AALLIGGACG VPFIDRSDVL LNDRALAAQV SVQPGTPVAS ATVATAHRAG WLWDIGLPGR RGIGCVYSSR FLSDDDALAV LRAYVSANVP GADPAAVEPR RIAFPTGHRE RFWHGNCIAI GLSAGFLEPL EASAIVMIEL SLRALAENFP RQRQAMPFLA GRFNDLFRYR WDRVIEFLKL HYLLSKRQEP YWQAQRDPAA VPQRLAELVA LWRHQPPSAW DFPQVDEIFS AESHAYILYG MGFPPPASTA RDPLAARALD DVAQRTRALC AALPTNRSYL SSLMPAQGAA ALP
|
| |