Gene Dfer_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2003 
Symbol 
ID8225575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2444588 
End bp2445835 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content52% 
IMG OID644929840 
Producttryptophan halogenase 
Protein accessionYP_003086391 
Protein GI255035770 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.923769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTG TCGATGTGCT GGTTATCGGA GCCGGCCCTG CCGGAACTGT CGCTGCTTCT 
TACCTGAAAA AGCAGGGCTA CGACGTCACG ATCCTGGAAA AAGAAAAATT CCCGCGTTTC
CAGATCGGTG AAAGCCTTTT GCCGTGTTGT ATGGAACATT TAACGGAAGC CGGGCTGCGG
GAGGCCATCG AGCCATTGAA TTTTCAGAAA AAAACAGGTG CGGCATTCAT GCGTGGCGAG
AAGCGGTGCG AGTTCTTTTT TGAAGATCAG TTTACCAAAG GCTGGACATG GACCTGGCAG
GTGAAGCGCG CCGACTTTGA CAGCAAACTG GCGGAAGCAA CGCGTGAAAA AGGCGTGGAT
GTGAATTTTG AATGCGAAGT GACCGCGGTG GAATGTGGCC CGGAGAAACA GATTGTGGAT
TACAAGGATG CCGAAGGCAA TGCGCATCGC ATTGAAGCGA AATTTATCAT CGATTCGAGC
GGTTATGGCC GCGTTCTGCC CCGGCTTTTC AACCTGAGCA AAGCTTCCGC ATTCACGCCG
CGCGGGGCGG TTTTCTCGCA TTTGGAAGAC AAAAACCGTT CGGAAGAGGC CAGCAACAAC
ATTTTTGTGC ATTCGTTCGA CAATAACAAA TCGTGGATCT GGGCAATCCC ATTCTCCGAC
GGCTCCACGT CCGTGGGTAT TGTGGGCGAC AGGGAAAAGA TCGAAGCGCT GGCCGAAAAC
AATGGCGAAC AATACAAGGA GTTCATCCGC AATTTCGCCG ACCTGAACGG CCGGTTCAAA
GAGTCTGATT TCAAATTCGA GCCGCGGGCG ATCCTCGGGT ATTCAATCGG CGTAACGCAA
ATGTCGGGCG AAGGTTTCGT GCTTTCGGGC AACAGCACCG AGTTTCTCGA CCCTATTTTC
TCATCGGGCG TGACATTCGC CACGGCCTCG GGCCTGCTTT CCGCCAAAAT GACCCACAAG
CATTTACAAG GTGAAGCGGT GAATTGGAAA AAGGATTACG AAGAAGTAAT CCAGCAGGGT
ATTAATGTGT TTAGAAGCTA CGTTACCGGC TGGTATTCAG GCGACTTTCA GACCATAGTT
TTTGCCAAAC ACATCGACGA GGACATTAAG CGGCAAATCT GCTCGGTATT AGCTGGCTAC
GTTTGGGATC AGAGCAACCC GTTCGTCAAA AAGCACGACA CCATCCTGCC CACACTCGCG
AAGGTGATCA AAATGAAGGA AAAGGCGGAG GAAACCGACC TGAGCTAG
 
Protein sequence
MQTVDVLVIG AGPAGTVAAS YLKKQGYDVT ILEKEKFPRF QIGESLLPCC MEHLTEAGLR 
EAIEPLNFQK KTGAAFMRGE KRCEFFFEDQ FTKGWTWTWQ VKRADFDSKL AEATREKGVD
VNFECEVTAV ECGPEKQIVD YKDAEGNAHR IEAKFIIDSS GYGRVLPRLF NLSKASAFTP
RGAVFSHLED KNRSEEASNN IFVHSFDNNK SWIWAIPFSD GSTSVGIVGD REKIEALAEN
NGEQYKEFIR NFADLNGRFK ESDFKFEPRA ILGYSIGVTQ MSGEGFVLSG NSTEFLDPIF
SSGVTFATAS GLLSAKMTHK HLQGEAVNWK KDYEEVIQQG INVFRSYVTG WYSGDFQTIV
FAKHIDEDIK RQICSVLAGY VWDQSNPFVK KHDTILPTLA KVIKMKEKAE ETDLS