Gene Dfer_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3349 
Symbol 
ID8226927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4099512 
End bp4100750 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content56% 
IMG OID644931180 
Productdihydroorotase 
Protein accessionYP_003087725 
Protein GI255037104 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000206965 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAT TATTATTCCA AATCACGGGA TGCTGGCTCG CATTCGCGGC GTTGGTCGCT 
GTAAATGCCC GTGCCCAATC CTACGACCTG CTCATCAAAG GCGCGCACGT GATCGATCCG
GCCAGCTCGG TCAATGCCAG GATGGACGTT GCCGTGAGTC AGGGGAAAAT CGCCCGGGTG
GCGGCTGATA TCCGCGGCTC GGCGGCGCAG GTGATCAATG CCGACGGCCT GTACCTCACG
CCCGGATTTA TCGATCCGCA CACGCATGTG TTCGTAGGCG CGCAGACGGG ACAATTTGCC
AATGGTGTAA ACAGTGTTTC GCCCGACGAT TTTACATTCC GCTCGGGTGT AACCACCGTT
GTGGACGCGG GAACTTCCGG CTGGCGGAGC TTTCCGTTGT TTAAAAAACA GGTGATTGAT
CAGTCCAAAA CCCGCATTCT GGCATTTCTG AACATCTCCG GCGGCGGTAT GACGGGCGCG
GCGCATGAGC AGAACTTGCA GGACATGAAT GTGGATTCGG CTGTTGCCAC GATCCGGCAA
TATGCGGCTG TGATCGCAGG CGTGAAGATC GGGCATTATA ACGGGAAGGA ATGGACGCCG
TTCGATAATG CATTGAATGC TGCCAAACAA TCCGGCAAGC CGCTTTTTGT CGAATGCCAT
TTACCGGAAT ATTCCCTGGA AGATCAGCTG GCCAGAATGC GGCCCGGCGA TATGATCACC
CACACGTTCG AGAACATCAA AGAACGAATG CCGATTGTAG GCGACGACGG CCGCGTGCGC
CCCTTCGTGC TCGAAGCCCG GAAGCGCGGC GTGCTGTTCG ACCTCGGTCA TGGCGGTGCT
GGTTTCTGGT TCGATCAGGC GGTGCCCGCC GTGAAGCAGG GTTTTTGGCC CAATTCGTTT
GGGACGGACC TGCATCGTTT CAGCATGAAC TCTGCCATGA AGGATATGTC AAACGTCATG
TCGAAGTTCA TGGCGATGGG CCTTTCGCTT GAAGAGGTGG TGCGGATCGC GACGTGGAAT
GCGGCCCAGG CAATCGGTCA TCCCGAGCTG GGTGCACTGC GCGAGGGAAA TCCGGCCGAT
ATCGCATTGT TCCGGTTGCG GGAAGGTACA TTCGGTTTCA TGGATTCTGT CGGAAACAGC
ATCAGCGGCA GCCGGAAGCT GGAAGTGGAA ATGACCGTCC GCGAGGGAAA GGTAGTGTGG
GATTTGAATG GGCTCGCAGC GAAGAAAAAG GGATTTTAA
 
Protein sequence
MKALLFQITG CWLAFAALVA VNARAQSYDL LIKGAHVIDP ASSVNARMDV AVSQGKIARV 
AADIRGSAAQ VINADGLYLT PGFIDPHTHV FVGAQTGQFA NGVNSVSPDD FTFRSGVTTV
VDAGTSGWRS FPLFKKQVID QSKTRILAFL NISGGGMTGA AHEQNLQDMN VDSAVATIRQ
YAAVIAGVKI GHYNGKEWTP FDNALNAAKQ SGKPLFVECH LPEYSLEDQL ARMRPGDMIT
HTFENIKERM PIVGDDGRVR PFVLEARKRG VLFDLGHGGA GFWFDQAVPA VKQGFWPNSF
GTDLHRFSMN SAMKDMSNVM SKFMAMGLSL EEVVRIATWN AAQAIGHPEL GALREGNPAD
IALFRLREGT FGFMDSVGNS ISGSRKLEVE MTVREGKVVW DLNGLAAKKK GF