Gene Dfer_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2107 
Symbol 
ID8225679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2572459 
End bp2573583 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content58% 
IMG OID644929944 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_003086495 
Protein GI255035874 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0706257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.400386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCCC AAACCGCCCC CGCCCCCCGC CACACGACCG ACTTCCTGCC TATTAACGGG 
ACGGATTATG TGGAACTTTA TGTCGGGAAT GCGCTGCAAT CGGCGCATTA TTTCCGGAAT
GCATTCGGGT TTCAGCCGCT GGCGATGGCC GGGCTTGAAA CCGGGCTCAC CGACCGCGAA
TCGTACGTGG TGGTGCAGGA CAAGATCCGG CTCGTTTTCA CCTCTCCGCT GCACGGCGGC
ACGGCCATTG GCGCGCATAT CGACCGCCAT GGCGATGGTG TAAAGGCCAT TGCATTGTGG
GTGGACGATG CCGAAAGTGC GTTTCGCGAG ACCGTCAGCC GCGGGGCCGA GCCCTTTTTT
GAACCGGTTA CCGAGCAGGA CGCACACGGC CGGACGGTGC GGGCCGGAAT ATGCACTTAT
GGTGACACGG TGCATGTTTT TGTGGAGCGG ACGCATTATG ATGGCGTTTT CCTTCCCGGA
TTTGTCGCCT GGCAGCCGGA GCACTCGCCC GGGCCGGTCG GGTTGCGGTA TATCGATCAC
ATGGTAGGGA ATGTGGGCTG GAACGAAATG AACCGCTGGG TGAAGTTTTA CGAGGAAGTG
ATGGGTTTTA AAATGATGGT TTCTTTCGAT GACAAGGACA TTTCGACGGA ATATTCGGCG
CTCATGAGCA AAGTGATGAG CAATGGCAAC GGGCGGATCA AGTTTCCGAT CAACGAACCG
GCGGAGGGTA AGAAGAAATC GCAGGTGGAA GAATACCTCG ATTTTTACGG CGGCCCGGGC
GTGCAGCATA TTGCCGTGGC TACCGACCAC ATTGTCGACA CCGTGCGTGC GCTGCGCGAC
CGCGGCGTGG AGTTTCTGCG CGTCCCTGCG GCCTATTACG ACGACCTCCT CAGCCGCGTC
GGGCATATCG ATGAAGACAT GGAAAGCCTC CGCGAACTCG GCATTCTGGT CGACCGCGAC
GACGAGGGTT ACCTCTTGCA GATATTTACC CGGCCGGTCA TGCCCCGCCC TACTTTGTTT
TTTGAGATCA TTCAAAGAAA AGGTGCGCAA TCGTTTGGGA AAGGGAATTT CAAAGCATTG
TTCGAGGCGA TCGAGCGCGA GCAGATGGCG CGCGGCACAC TTTAA
 
Protein sequence
MLPQTAPAPR HTTDFLPING TDYVELYVGN ALQSAHYFRN AFGFQPLAMA GLETGLTDRE 
SYVVVQDKIR LVFTSPLHGG TAIGAHIDRH GDGVKAIALW VDDAESAFRE TVSRGAEPFF
EPVTEQDAHG RTVRAGICTY GDTVHVFVER THYDGVFLPG FVAWQPEHSP GPVGLRYIDH
MVGNVGWNEM NRWVKFYEEV MGFKMMVSFD DKDISTEYSA LMSKVMSNGN GRIKFPINEP
AEGKKKSQVE EYLDFYGGPG VQHIAVATDH IVDTVRALRD RGVEFLRVPA AYYDDLLSRV
GHIDEDMESL RELGILVDRD DEGYLLQIFT RPVMPRPTLF FEIIQRKGAQ SFGKGNFKAL
FEAIEREQMA RGTL