Gene Rmet_4387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRmet_4387 
Symbol 
ID4041245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCupriavidus metallidurans CH34 
KingdomBacteria 
Replicon accessionNC_007974 
Strand
Start bp984931 
End bp986160 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content63% 
IMG OID637979808 
Producttryptophan halogenase 
Protein accessionYP_586521 
Protein GI94313312 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000478087 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.336171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG AGACAGTGGA CGTACTGATC GTCGGCGCGG GCCCGGCGGG TTCCGTGGCG 
GCCGGATTGC TGCGCAAGCG CGGCATCGGC GTGCTGGTGA TCGAAAAGGA AACGTTCCCG
CGTTTCTCGA TCGGCGAGAG CCTGCTGCCC CAGAGCATGG CCTACATCGA GGAGGCCGGT
ATGCTGCAGG CTGTGGTGGA GGCGGGCTTC CAGTACAAGA ACGGGGCGGC CTTCGCGCGG
GGCGACCGCT ATACCGATTT CGATTTTCGC GAGAAATTCT CGCCGGGATG GGGCACCACC
TACCAGGTGC AGCGCGCCCA TTTCGACGAT GTGCTGATTC GCGAGGCGCA GAAGCAGGGC
GCCGAGGTGC GCTTTCGCCA TGTTGTGGAA GCGGTGGACG TCAGCGGTGC CGCGCCGGTT
GTCACCGTGC GCGATCCGGA TGGCAACGTG TATCAGGTGG AATCACGCTT CCTGCTGGAC
GCCTCTGGCT TTGGCCGGGT TTTGCCGCGC CTGCTCGACC TGGAGTCGCC GTCGAATTTT
CCGGTGCGCG CCGCCATCTT CACGCACGTG GCCGACAGGA TTCCGGGGGG CAGCTTCGAT
CGCAACAAGA TCCGCGTCAG CGTGCATCCG GAACACGTCG ATGTCTGGTA CTGGACCATC
CCGTTCTCCA ACGGGCGTTG CTCGCAGGGC GTGGTGGCCG AGAAGTCGTT CCTCGACCGG
TACGAAGGCG ATGAAATGAC TCGCCTCAGG CAACTCGTGG CCGAGGAGCC GGGCCTCGCG
AAGCTGCTGA AGGATGCCGA GTGGGACACG CCGGCCCGCC AGATCGTCGG CTATTCGGCC
AACGTCCGGT CGCTCTGGGG CAATGGCTAC GCACTGCTTG GCAATGCCGG CGAATTTCTC
GACCCCGTGT TTTCGTCGGG CGTGACGATC GCATTCAAGT CGGCCAGCCT CGCCACCGCC
TGCGTGGCGC GCGCGTTGGC CGGCGAATCC GTGGACTGGG AAACCGAGTA CGCCAGGCCG
CTGAAAGCAG GCGTTGACTG CTTCCGTGCC TTTGTCGAAG CATGGTACGA AGGCAGCTTC
CAGAAGCTTA TCTTCCATCC CAACGCGCCC ACCGACATTC GCGACATGAT CTCGTCGATC
CTGGCAGGCT ATGCCTGGGA TCTGAACAAT CCATTCGTGA CGGAGTCGCG CCGTCGGCTG
AGCGTGCTGG AGCAATTTTG CGACGATTGA
 
Protein sequence
MKKETVDVLI VGAGPAGSVA AGLLRKRGIG VLVIEKETFP RFSIGESLLP QSMAYIEEAG 
MLQAVVEAGF QYKNGAAFAR GDRYTDFDFR EKFSPGWGTT YQVQRAHFDD VLIREAQKQG
AEVRFRHVVE AVDVSGAAPV VTVRDPDGNV YQVESRFLLD ASGFGRVLPR LLDLESPSNF
PVRAAIFTHV ADRIPGGSFD RNKIRVSVHP EHVDVWYWTI PFSNGRCSQG VVAEKSFLDR
YEGDEMTRLR QLVAEEPGLA KLLKDAEWDT PARQIVGYSA NVRSLWGNGY ALLGNAGEFL
DPVFSSGVTI AFKSASLATA CVARALAGES VDWETEYARP LKAGVDCFRA FVEAWYEGSF
QKLIFHPNAP TDIRDMISSI LAGYAWDLNN PFVTESRRRL SVLEQFCDD