Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1889 |
Symbol | |
ID | 3917110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1998001 |
End bp | 1999515 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444633 |
Product | tryptophan halogenase |
Protein accession | YP_497163 |
Protein GI | 87199906 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.140819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCCTT CGAACAAGCG CAGGATAGTC ATCGCCGGTG GCGGGACGGC CGGATGGATG ACGGCCGCCG CGCTTGCCCG CTTTGCCATG CCGCACTGGC AGGTCACGCT CGTCGAATCC GAAGAGATCG GGACGATCGG CGTCGGCGAA GCGACCATTC CGATGGTGCG GCTATTCAAC CGCTCGCTGG GGATCGACGA ACGCCAGTTC CTGCGCGAGA CCCACGGTAC GTGGAAGCTG GGCATCGCCT TCGAGGGCTG GGGCGGCTTC GACGAGCGTT ACATCCATGG CTTCGGACTG ACAGGCCGGA GCCTTGGCGT CCTGCCCTTT CACCATTACT GGTTGCGGGG TCGCGCCATG GGAGCCGCAG GACCGCTGGG CGATTACGTG CTTAATGCCG TCGCCTGCGA GGAAGGCCGT TTCGCCCACA TCGACCGTCC AGAGGGCAGC GCATTGCCGC CGATGCCCTA TGCCTATCAT TTCGACGCGT CGCTCTATGC CGCGTTCCTG CGGCGCTATG CCGAAGAGCG TGGCGTGCTG CGGATCGAAG GTCGTATCCA GTCGGCCGCG CGCGATTGTT CCACCGGTGA CATTGCCTCG CTGAAGCTTG CCGATGGCCA CGAGGTGGAG GGCGATCTGT TCATCGATTG CTCGGGCTTT CGCAGCCTTC TCCTCGGACA GGAAATGGGC GTCGAGTACG TCGACTGGAG CCACTGGCTG CGCTGCGACC GCGCGGTTGC CGTTCCGTGC GAGCGCGCCG GGCCGCTGGT TCACTATACG CGGTCCATTG CCCAGCCCGC AGGCTGGTGC TGGCGCATCC CGCTGCAGCA CCGGACCGGC AACGGCCACG TGTTCTGTTC CGACGCGATG GGCGAGGACG AGGCGACCGC GCGCCTTCTG GGAAGGCTCG ACGGGCGCCA ACTGGCCGAG CCGCGCACGA TCCGGTTCAA GGCAGGGCGG CGCGAGACGT TCTGGAAGAA CAACGTGGTT GCGGTCGGCC TGTCCTCCGG CTTCATCGAA CCTCTGGAAT CGACTTCCAT TCACCTGATC CAGACGGCAA TCAGCCGGAT CATCGATTTT TTGCCGAACG GGCCGGTACA GGCAGCGGAC CGCGATGCGT ACAACCGGCT TTCGGTCTTC GAGATCGAAC GCATCCGCGA CTTCGTGATC CTTCACTACG TCGCAAATGG CCGTCACGGA GAGCCGTTCT GGGATAGCCT CAGAGCGATG GAAATTCCCG AAACGCTGGC TAAGCGCATC GCCATGTTCC GCGCATCGGG CCGCATCGTG CGGGAACACG AAGAGCTTTT CGACGTGCCC GGTTGGGTGC AGGTAATGGT GGGGCAGGGG ATCATGCCGG AGCGGTGGCA CCCTCTTGCC GACCAATTGG ACCGGACCAA CCTTTCCGCG TTCCTCGAAA CCGTCAGCGA AGCCTACAGA AAGGACGTTG CGCGAATGCC CCTTCACGCC GACTACCTCG AACATGTCTG CGGAAAGGTG GCTGCCCATG CGTAG
|
Protein sequence | MPPSNKRRIV IAGGGTAGWM TAAALARFAM PHWQVTLVES EEIGTIGVGE ATIPMVRLFN RSLGIDERQF LRETHGTWKL GIAFEGWGGF DERYIHGFGL TGRSLGVLPF HHYWLRGRAM GAAGPLGDYV LNAVACEEGR FAHIDRPEGS ALPPMPYAYH FDASLYAAFL RRYAEERGVL RIEGRIQSAA RDCSTGDIAS LKLADGHEVE GDLFIDCSGF RSLLLGQEMG VEYVDWSHWL RCDRAVAVPC ERAGPLVHYT RSIAQPAGWC WRIPLQHRTG NGHVFCSDAM GEDEATARLL GRLDGRQLAE PRTIRFKAGR RETFWKNNVV AVGLSSGFIE PLESTSIHLI QTAISRIIDF LPNGPVQAAD RDAYNRLSVF EIERIRDFVI LHYVANGRHG EPFWDSLRAM EIPETLAKRI AMFRASGRIV REHEELFDVP GWVQVMVGQG IMPERWHPLA DQLDRTNLSA FLETVSEAYR KDVARMPLHA DYLEHVCGKV AAHA
|
| |