Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1607 |
Symbol | |
ID | 3918715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1674284 |
End bp | 1675801 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444347 |
Product | tryptophan halogenase |
Protein accession | YP_496881 |
Protein GI | 87199624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGTC CGAACCCCGC AGTCCCCGCC CTGCCGCGCC GCATCGTCAT CGCCGGGGGC GGAACTGCCG GCTGGATGAC CGCCGCAGCG CTTGCCCGGA CGCTCGGCAA GGTGGCGCAG GTCACGCTTG TCGAAAGCGA GCAGATCGGC ACCATCGGCG TGGGCGAAAG CACGATCCCG CCGCTGGTGG CCTACAATCG CATTCTCGGC ATCTCCGAGG CCGAATTCAT GCGCGCCACG CAGGCCACCT TCAAGCTCGG CATCAATTTC GAGAACTGGC GCGTGCCGGG CGAGGCCTAC TTCCATTCCT TCGGCGGCAC CGGCAAGGAT CACTGGTCGG CCGGCTTCCA GCACTTCTGG ATGCACGGCC TCGCACGCGG CCACGACGAG GCCTACGGCG AGTATTGCCT GGAGCTCAAG GCCGCGCAGG CCGGCCGTTT CGCGCACCTT CCGGAAGACC GGATGAACTA CGCCTACCAG CTCGATTCCG GCCTCTACGC CCGCTTCCTG CGACAGATGG CCGAAGCCGA TGGTACGGTC CGGATCGAGG GCAAGATCGG CGGGGTTCCG CTCGATCCCG AAACCGGAGA CATCGCCGCG CTTGTTCTCG AGGATGGTCA GCGCATCGAA GGTGACCTTT TCATTGACTG CACCGGCTTT CGCGCGCTGC TGATCGAGGG CGCTCTCCAC GTCGGCTATG ACGACTGGAC CCACTGGCTG CCCTGCGACG GCGCAATTGC CATCCAGACG TCAAGCGTTC GCGCTCCCGT TCCCTATACC CGCGCAATCG CGCACGGCAG CGGATGGCAA TGGCGCATTC CGTTGCAACA CCGCCAGGGC AACGGCATCG TCTATTGCAG CGCCTACATG GACCACGAGG CAGCGCTGGA AATGCTGCTG TCCACGGTGG AAGGCGAAAA GCTCGTCCGG CCCAATCCGA TCCGCTTCCG CACGGGGGTG CGGCGCAAGC AATGGCATCG CAACTGCGTT GCCGTTGGTC TTTCGGGCGG CTTCATGGAA CCGCTGGAAT CGACGTCGAT CCACCTCATC CAGCGTGCGA TCCTGCGCAT CGTGCGGATG CTGCCGGCAG GAACGATCAG CGCACGTGAC GTTGCCGAGT TCAATGACCA GCAGATGCTC GACATGGAGC AGATCCGCGA TTTCCTGATT CTACACTACA AGGCGACCGA CCGCCGCGAC ACGCCGTTCT GGCGCCACTG CGCGAGCATG GAGGTTCCGG AAAGCCTCGC TCACCGGATC GAGCTGTTCC GCGAGACCGG CCGCGTCTTC CGTCGCAACG AGGAGCTGTT TGCCGAAAAC TCGTGGGTCC AGGTGATGAT GGGGCAGGGA ATCGTGCCGC AGAGCTATCA CCCGGTTGCG GCCAAGCTGC GCGACGAGGA ACTGGCGCAT CTGCTCCAGA CCCTGCGCGA GCAGGTCGAT CGCACCGTTG CCGCGCTGCC CGCGCACGGT GATTATATCG CGCGCTACTG TGGCGCCGAA ATGCCGGCGG CTGCATGA
|
Protein sequence | MKRPNPAVPA LPRRIVIAGG GTAGWMTAAA LARTLGKVAQ VTLVESEQIG TIGVGESTIP PLVAYNRILG ISEAEFMRAT QATFKLGINF ENWRVPGEAY FHSFGGTGKD HWSAGFQHFW MHGLARGHDE AYGEYCLELK AAQAGRFAHL PEDRMNYAYQ LDSGLYARFL RQMAEADGTV RIEGKIGGVP LDPETGDIAA LVLEDGQRIE GDLFIDCTGF RALLIEGALH VGYDDWTHWL PCDGAIAIQT SSVRAPVPYT RAIAHGSGWQ WRIPLQHRQG NGIVYCSAYM DHEAALEMLL STVEGEKLVR PNPIRFRTGV RRKQWHRNCV AVGLSGGFME PLESTSIHLI QRAILRIVRM LPAGTISARD VAEFNDQQML DMEQIRDFLI LHYKATDRRD TPFWRHCASM EVPESLAHRI ELFRETGRVF RRNEELFAEN SWVQVMMGQG IVPQSYHPVA AKLRDEELAH LLQTLREQVD RTVAALPAHG DYIARYCGAE MPAAA
|
| |