Gene Dfer_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2441 
Symbol 
ID8226013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp3005397 
End bp3006650 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content53% 
IMG OID644930273 
Productdihydroorotase 
Protein accessionYP_003086824 
Protein GI255036203 
COG category[R] General function prediction only 
COG ID[COG3964] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.214517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACACA GGTCCACCCT TATCATTTTC CTGGCCTGTC TGCTCGGCGT GGCCTGCCAC 
GCGCAGACGT ACAGCATCCT GATCAAAGGC GGCACGGTGA TCGACCCGAA GAATAACATC
AATCAGGTGA TGGATGTGGG TATTCTCGAC GGGAAGATCA AAAAAGTGGC GAAGGATATC
GATCCGAAAG AGGCCCGCCA GGTGGTGGAT GCCAAAGGGA TGTATGTCAC ACCCGGATTG
ATCGATATTC ATGGACATGT GTTTTTTGGT ACCGAGCCAA ACCATTATCT AAGTAATGGT
TTGGTGGCGT TACCACCCGA CGGTTTTACC TTCCGCGTCG GCGTGACGAC GATCGTCGAC
GCGGGCGGTG CCGGCTGGGA TTCGTTTTCG GAGTTTAAAA ACAATGTGAT TTTTAATTCC
AAAACCCGCG TGCTCTCCTT CCTGAACATC GTGGGCCAGG GAATGCGGGG CGGGGCCTGG
GAGCAGGACA CCGCCGATAT GGACCCGAAC CTGGCCGCCG GCGTGGCGAT CAAAAACCGC
AACGATGTGG TCGGTTTTAA AGTCGCGCAT TTCATGGGCA AAGACTGGAA ACCGGTGGAT
AATGCCGTGA AAGCCGGTAA GCTGGCCAAC ATGCCGGTTA TGATCGATTT CGGCGGCAAC
ACGCCTCCCC TTCCGCTCGA AGAGCTGTTT ATGAAACACC TTCGCCCGGG CGACATTTAT
ACACATGCAT ACACCTTGCT CGAAGGCAAT GTAAGAGAAA CCATCGTGGA TGAAGCCACG
CAAAAAGTGA AACCTTTCGC CATTGAGGCC CGCAAAAAAG GCATCGTGTT CGACGTAGGC
TATGGCGGCG CAAGCTTTAA CTACTCCCAG GCGATCCCGG CCATGAAAGC CGGTTTTCAC
CCCACCACCA TCAGCACCGA CCTCCACACC GGCAGCATGA ACGGCTCGAT GAAGGATATG
CTCAGCATTA TGTCGAAGTT TTACAATATG GGCATGGACC TGCCGGCGGT GATCAGGGCC
AGCACCTGGG AACCCGCCAA AGTGATCCAT CGCGAGCAAC TCGGCCACAT TTCGGAAGGA
GCGATCGCAG ACGTAGCCGT TTTCTCAATG CGCAAAGGCA ATTTCGGTTT TTACGATAAA
ACCGGCTACA AAGTGGAAGG CAAAGAAAAA CTCGAATGCG AGCTAACCGT CATGGGCGGA
AAAATTGTAT ACGACCTGAA CGGAATTACT CAGCCGATAT ATTTGACGAA GTAG
 
Protein sequence
MQHRSTLIIF LACLLGVACH AQTYSILIKG GTVIDPKNNI NQVMDVGILD GKIKKVAKDI 
DPKEARQVVD AKGMYVTPGL IDIHGHVFFG TEPNHYLSNG LVALPPDGFT FRVGVTTIVD
AGGAGWDSFS EFKNNVIFNS KTRVLSFLNI VGQGMRGGAW EQDTADMDPN LAAGVAIKNR
NDVVGFKVAH FMGKDWKPVD NAVKAGKLAN MPVMIDFGGN TPPLPLEELF MKHLRPGDIY
THAYTLLEGN VRETIVDEAT QKVKPFAIEA RKKGIVFDVG YGGASFNYSQ AIPAMKAGFH
PTTISTDLHT GSMNGSMKDM LSIMSKFYNM GMDLPAVIRA STWEPAKVIH REQLGHISEG
AIADVAVFSM RKGNFGFYDK TGYKVEGKEK LECELTVMGG KIVYDLNGIT QPIYLTK