Gene Dfer_4337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4337 
Symbol 
ID8227940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp5261972 
End bp5263162 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID644932184 
ProductDeoxyribodipyrimidine photo-lyase 
Protein accessionYP_003088704 
Protein GI255038083 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR02765] cryptochrome, DASH family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.660905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGAC GCATTATCTA TTGGTTCAGG AATGATTTGA GGTTAAAAGA TAACCAGGCG 
CTTTCTGCCG CCGTTGGCTC CGCCGACGAG ATCATACCGG TGTATGTTTT TGATCCCCGT
CAATTTGAAA AAACAAAACT AGGCTTCCGC CGCACGGGCG CATTACGCGC ACGCTTCCTC
ATCGAATCCG TGGCGGAACT CCGCGAAAAT ATCCGGCAAA AGGGCGGCGA TCTGATCATC
CGCACAGGCG CGCCCGAAGC CATCGTAGCC CAGCTCGCCG AAGATTACAA TGCCGACTAC
GTGTACACAA GCAAGGAAAT CGCGCCGCAG GAAACGCGCA TCGAATCGTC ATTGAGCAAA
AACCTCAAAA CAGCCAATGT GGACATTAAG CTGTTCTGGA TGGACACGAT GATCAATGCA
ACCGACCTGC CGTTTCCGGT ATCGAAGCTC CCCTCGGGCT TTGCGGAATT CGAGCGGCTG
CTGAGCAACG ATCTTAAAAT CAAAGACCAG TTTCCCACGC CGGCAAGCAT TACTTTACCC
GCCGACGTAG AAGCAGGAGC CATTCCTGGC CTTCCCGAGC TGGGTATCGA CCCGAACGAG
ATCCCGGCGG GAACAACAGG CCCTTTGGCC GGAGGTGAAG CGCGTGCATT GGCTGTTCTC
AAAGAATATG TGGAGGAATA TGTTAAAAAA GACATCGCCT ACCCTTCCGC CGAGCCGCTT
ACCGACACGC GCCTGTCCGA CTGGCTTTCT CTGGGATGCG TTTCGGCATC GTACATCTAT
CGCAGCGTAA AAACCGCGCA ATCGCACGCC GTAGTGGAAG ATCCGATCAT TACCAACCTG
CTAAGAAGGG AATTCCTGCA TTGGACATTG CTCCGTTTCG GCCCGCGGAT GTTCAAACCC
AGCGGTGTAA AACATCATTT CAACCGCCGT TGGAAAAATG ATAATGCAGC GTTTGAAAAA
TGGATCAATG GGCAAACCGG CGACCAGTCA ATAAATGACA TCATCCGCAG GCTAACCGCA
ACCGGCTTCA TTACCGCCGC TGAACGCGAG TCGGCCGCGC GGTACCTGGT GGACGACCTG
GATATCAACT GGACCTGGGG TGCTATGTAT TTCGAAAGCC TGCTGATGGA CTACGAAGCA
TCTGTGAACT GGGGCCGCTG GAACCATATC GCAGGAGTGG GTGAAGACTA A
 
Protein sequence
MARRIIYWFR NDLRLKDNQA LSAAVGSADE IIPVYVFDPR QFEKTKLGFR RTGALRARFL 
IESVAELREN IRQKGGDLII RTGAPEAIVA QLAEDYNADY VYTSKEIAPQ ETRIESSLSK
NLKTANVDIK LFWMDTMINA TDLPFPVSKL PSGFAEFERL LSNDLKIKDQ FPTPASITLP
ADVEAGAIPG LPELGIDPNE IPAGTTGPLA GGEARALAVL KEYVEEYVKK DIAYPSAEPL
TDTRLSDWLS LGCVSASYIY RSVKTAQSHA VVEDPIITNL LRREFLHWTL LRFGPRMFKP
SGVKHHFNRR WKNDNAAFEK WINGQTGDQS INDIIRRLTA TGFITAAERE SAARYLVDDL
DINWTWGAMY FESLLMDYEA SVNWGRWNHI AGVGED