Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rmet_4387 |
Symbol | |
ID | 4041245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cupriavidus metallidurans CH34 |
Kingdom | Bacteria |
Replicon accession | NC_007974 |
Strand | + |
Start bp | 984931 |
End bp | 986160 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637979808 |
Product | tryptophan halogenase |
Protein accession | YP_586521 |
Protein GI | 94313312 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000478087 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.336171 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAG AGACAGTGGA CGTACTGATC GTCGGCGCGG GCCCGGCGGG TTCCGTGGCG GCCGGATTGC TGCGCAAGCG CGGCATCGGC GTGCTGGTGA TCGAAAAGGA AACGTTCCCG CGTTTCTCGA TCGGCGAGAG CCTGCTGCCC CAGAGCATGG CCTACATCGA GGAGGCCGGT ATGCTGCAGG CTGTGGTGGA GGCGGGCTTC CAGTACAAGA ACGGGGCGGC CTTCGCGCGG GGCGACCGCT ATACCGATTT CGATTTTCGC GAGAAATTCT CGCCGGGATG GGGCACCACC TACCAGGTGC AGCGCGCCCA TTTCGACGAT GTGCTGATTC GCGAGGCGCA GAAGCAGGGC GCCGAGGTGC GCTTTCGCCA TGTTGTGGAA GCGGTGGACG TCAGCGGTGC CGCGCCGGTT GTCACCGTGC GCGATCCGGA TGGCAACGTG TATCAGGTGG AATCACGCTT CCTGCTGGAC GCCTCTGGCT TTGGCCGGGT TTTGCCGCGC CTGCTCGACC TGGAGTCGCC GTCGAATTTT CCGGTGCGCG CCGCCATCTT CACGCACGTG GCCGACAGGA TTCCGGGGGG CAGCTTCGAT CGCAACAAGA TCCGCGTCAG CGTGCATCCG GAACACGTCG ATGTCTGGTA CTGGACCATC CCGTTCTCCA ACGGGCGTTG CTCGCAGGGC GTGGTGGCCG AGAAGTCGTT CCTCGACCGG TACGAAGGCG ATGAAATGAC TCGCCTCAGG CAACTCGTGG CCGAGGAGCC GGGCCTCGCG AAGCTGCTGA AGGATGCCGA GTGGGACACG CCGGCCCGCC AGATCGTCGG CTATTCGGCC AACGTCCGGT CGCTCTGGGG CAATGGCTAC GCACTGCTTG GCAATGCCGG CGAATTTCTC GACCCCGTGT TTTCGTCGGG CGTGACGATC GCATTCAAGT CGGCCAGCCT CGCCACCGCC TGCGTGGCGC GCGCGTTGGC CGGCGAATCC GTGGACTGGG AAACCGAGTA CGCCAGGCCG CTGAAAGCAG GCGTTGACTG CTTCCGTGCC TTTGTCGAAG CATGGTACGA AGGCAGCTTC CAGAAGCTTA TCTTCCATCC CAACGCGCCC ACCGACATTC GCGACATGAT CTCGTCGATC CTGGCAGGCT ATGCCTGGGA TCTGAACAAT CCATTCGTGA CGGAGTCGCG CCGTCGGCTG AGCGTGCTGG AGCAATTTTG CGACGATTGA
|
Protein sequence | MKKETVDVLI VGAGPAGSVA AGLLRKRGIG VLVIEKETFP RFSIGESLLP QSMAYIEEAG MLQAVVEAGF QYKNGAAFAR GDRYTDFDFR EKFSPGWGTT YQVQRAHFDD VLIREAQKQG AEVRFRHVVE AVDVSGAAPV VTVRDPDGNV YQVESRFLLD ASGFGRVLPR LLDLESPSNF PVRAAIFTHV ADRIPGGSFD RNKIRVSVHP EHVDVWYWTI PFSNGRCSQG VVAEKSFLDR YEGDEMTRLR QLVAEEPGLA KLLKDAEWDT PARQIVGYSA NVRSLWGNGY ALLGNAGEFL DPVFSSGVTI AFKSASLATA CVARALAGES VDWETEYARP LKAGVDCFRA FVEAWYEGSF QKLIFHPNAP TDIRDMISSI LAGYAWDLNN PFVTESRRRL SVLEQFCDD
|
| |