Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2003 |
Symbol | |
ID | 8225575 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 2444588 |
End bp | 2445835 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644929840 |
Product | tryptophan halogenase |
Protein accession | YP_003086391 |
Protein GI | 255035770 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.923769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTG TCGATGTGCT GGTTATCGGA GCCGGCCCTG CCGGAACTGT CGCTGCTTCT TACCTGAAAA AGCAGGGCTA CGACGTCACG ATCCTGGAAA AAGAAAAATT CCCGCGTTTC CAGATCGGTG AAAGCCTTTT GCCGTGTTGT ATGGAACATT TAACGGAAGC CGGGCTGCGG GAGGCCATCG AGCCATTGAA TTTTCAGAAA AAAACAGGTG CGGCATTCAT GCGTGGCGAG AAGCGGTGCG AGTTCTTTTT TGAAGATCAG TTTACCAAAG GCTGGACATG GACCTGGCAG GTGAAGCGCG CCGACTTTGA CAGCAAACTG GCGGAAGCAA CGCGTGAAAA AGGCGTGGAT GTGAATTTTG AATGCGAAGT GACCGCGGTG GAATGTGGCC CGGAGAAACA GATTGTGGAT TACAAGGATG CCGAAGGCAA TGCGCATCGC ATTGAAGCGA AATTTATCAT CGATTCGAGC GGTTATGGCC GCGTTCTGCC CCGGCTTTTC AACCTGAGCA AAGCTTCCGC ATTCACGCCG CGCGGGGCGG TTTTCTCGCA TTTGGAAGAC AAAAACCGTT CGGAAGAGGC CAGCAACAAC ATTTTTGTGC ATTCGTTCGA CAATAACAAA TCGTGGATCT GGGCAATCCC ATTCTCCGAC GGCTCCACGT CCGTGGGTAT TGTGGGCGAC AGGGAAAAGA TCGAAGCGCT GGCCGAAAAC AATGGCGAAC AATACAAGGA GTTCATCCGC AATTTCGCCG ACCTGAACGG CCGGTTCAAA GAGTCTGATT TCAAATTCGA GCCGCGGGCG ATCCTCGGGT ATTCAATCGG CGTAACGCAA ATGTCGGGCG AAGGTTTCGT GCTTTCGGGC AACAGCACCG AGTTTCTCGA CCCTATTTTC TCATCGGGCG TGACATTCGC CACGGCCTCG GGCCTGCTTT CCGCCAAAAT GACCCACAAG CATTTACAAG GTGAAGCGGT GAATTGGAAA AAGGATTACG AAGAAGTAAT CCAGCAGGGT ATTAATGTGT TTAGAAGCTA CGTTACCGGC TGGTATTCAG GCGACTTTCA GACCATAGTT TTTGCCAAAC ACATCGACGA GGACATTAAG CGGCAAATCT GCTCGGTATT AGCTGGCTAC GTTTGGGATC AGAGCAACCC GTTCGTCAAA AAGCACGACA CCATCCTGCC CACACTCGCG AAGGTGATCA AAATGAAGGA AAAGGCGGAG GAAACCGACC TGAGCTAG
|
Protein sequence | MQTVDVLVIG AGPAGTVAAS YLKKQGYDVT ILEKEKFPRF QIGESLLPCC MEHLTEAGLR EAIEPLNFQK KTGAAFMRGE KRCEFFFEDQ FTKGWTWTWQ VKRADFDSKL AEATREKGVD VNFECEVTAV ECGPEKQIVD YKDAEGNAHR IEAKFIIDSS GYGRVLPRLF NLSKASAFTP RGAVFSHLED KNRSEEASNN IFVHSFDNNK SWIWAIPFSD GSTSVGIVGD REKIEALAEN NGEQYKEFIR NFADLNGRFK ESDFKFEPRA ILGYSIGVTQ MSGEGFVLSG NSTEFLDPIF SSGVTFATAS GLLSAKMTHK HLQGEAVNWK KDYEEVIQQG INVFRSYVTG WYSGDFQTIV FAKHIDEDIK RQICSVLAGY VWDQSNPFVK KHDTILPTLA KVIKMKEKAE ETDLS
|
| |