Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2107 |
Symbol | |
ID | 8225679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 2572459 |
End bp | 2573583 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644929944 |
Product | 4-hydroxyphenylpyruvate dioxygenase |
Protein accession | YP_003086495 |
Protein GI | 255035874 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins |
TIGRFAM ID | [TIGR01263] 4-hydroxyphenylpyruvate dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0706257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.400386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCCC AAACCGCCCC CGCCCCCCGC CACACGACCG ACTTCCTGCC TATTAACGGG ACGGATTATG TGGAACTTTA TGTCGGGAAT GCGCTGCAAT CGGCGCATTA TTTCCGGAAT GCATTCGGGT TTCAGCCGCT GGCGATGGCC GGGCTTGAAA CCGGGCTCAC CGACCGCGAA TCGTACGTGG TGGTGCAGGA CAAGATCCGG CTCGTTTTCA CCTCTCCGCT GCACGGCGGC ACGGCCATTG GCGCGCATAT CGACCGCCAT GGCGATGGTG TAAAGGCCAT TGCATTGTGG GTGGACGATG CCGAAAGTGC GTTTCGCGAG ACCGTCAGCC GCGGGGCCGA GCCCTTTTTT GAACCGGTTA CCGAGCAGGA CGCACACGGC CGGACGGTGC GGGCCGGAAT ATGCACTTAT GGTGACACGG TGCATGTTTT TGTGGAGCGG ACGCATTATG ATGGCGTTTT CCTTCCCGGA TTTGTCGCCT GGCAGCCGGA GCACTCGCCC GGGCCGGTCG GGTTGCGGTA TATCGATCAC ATGGTAGGGA ATGTGGGCTG GAACGAAATG AACCGCTGGG TGAAGTTTTA CGAGGAAGTG ATGGGTTTTA AAATGATGGT TTCTTTCGAT GACAAGGACA TTTCGACGGA ATATTCGGCG CTCATGAGCA AAGTGATGAG CAATGGCAAC GGGCGGATCA AGTTTCCGAT CAACGAACCG GCGGAGGGTA AGAAGAAATC GCAGGTGGAA GAATACCTCG ATTTTTACGG CGGCCCGGGC GTGCAGCATA TTGCCGTGGC TACCGACCAC ATTGTCGACA CCGTGCGTGC GCTGCGCGAC CGCGGCGTGG AGTTTCTGCG CGTCCCTGCG GCCTATTACG ACGACCTCCT CAGCCGCGTC GGGCATATCG ATGAAGACAT GGAAAGCCTC CGCGAACTCG GCATTCTGGT CGACCGCGAC GACGAGGGTT ACCTCTTGCA GATATTTACC CGGCCGGTCA TGCCCCGCCC TACTTTGTTT TTTGAGATCA TTCAAAGAAA AGGTGCGCAA TCGTTTGGGA AAGGGAATTT CAAAGCATTG TTCGAGGCGA TCGAGCGCGA GCAGATGGCG CGCGGCACAC TTTAA
|
Protein sequence | MLPQTAPAPR HTTDFLPING TDYVELYVGN ALQSAHYFRN AFGFQPLAMA GLETGLTDRE SYVVVQDKIR LVFTSPLHGG TAIGAHIDRH GDGVKAIALW VDDAESAFRE TVSRGAEPFF EPVTEQDAHG RTVRAGICTY GDTVHVFVER THYDGVFLPG FVAWQPEHSP GPVGLRYIDH MVGNVGWNEM NRWVKFYEEV MGFKMMVSFD DKDISTEYSA LMSKVMSNGN GRIKFPINEP AEGKKKSQVE EYLDFYGGPG VQHIAVATDH IVDTVRALRD RGVEFLRVPA AYYDDLLSRV GHIDEDMESL RELGILVDRD DEGYLLQIFT RPVMPRPTLF FEIIQRKGAQ SFGKGNFKAL FEAIEREQMA RGTL
|
| |