Gene Dfer_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_2023 
Symbol 
ID8225595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp2465594 
End bp2468779 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content54% 
IMG OID644929860 
Productpeptidase M12B ADAM/reprolysin 
Protein accessionYP_003086411 
Protein GI255035790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0538769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCAC ATCTCCCATT CAAATACCTG ACAATCATCC TGTTGCTCCT GACGGTCGGA 
ATTGCGCGCG CTCAGCAAGC GGCGCCGTCC TGCGGTACCA ACGACTCACT GATGATCGCT
TATCTGGAAA AAATTTCCAA AAGAACGGAC CTCACCCAGG CGCGGATGCA CGCCGGTGAG
ATGCTGGAAT ACCGCATTGC CGTGGATGTC GAATACAAAA CTGCGGCGCA ATACAACCAT
GACCAGGAGA AGATTAAAGA GGCTGTTTAC AAAATGTTCC GGGAGGCTTC TGCCATTTTC
GAGCGCGAAA TGAATATCAA GCTGACCGTT TCGTTCATCC ACATCTGGCA GGAACCGGAG
CCCTACACCT TGGAATTTGA CTACGACTAT TTCACGAAAG TGCAGAATTA CTGGCTCGAA
CATCGCAAGG AAGAACGGGA CGCCGTCGTG GGGATGTCGA TCCGTTCCGG CTGGTTTTAC
GGAGGTTACC GGATGGCCAC ATCCAACCTG CCGTCCCCCA ATACCACGCT TTTGCCCGAC
CTGCTCGCCC ACGAGCTGGG CCATACGCTC GGCTCTCCGC ACACACACAG CTGTTCCTGG
CCCGGCGGGG CGATCGACCA TTGCGAGGAG CTGGAAGGTG CCAACGAAGC CTGTCCGTCG
GGTAGCCGGG AAGTGGTTAA CGGCACGATC ATGTCGTATT GCCGATCCAA GCTTACATTC
CACCCTTTTT GCCGCAATCT CATAAGAGAG TATGCCGAAG GGAAAGTCAA TCCGTCATTT
TCATTAAAAC CATTTACTGA AAAGCCTGCG GCGCCCGGCC CACTGACCGT ACTCACCAAG
CCCGGCACCG CGAACGATTT CGCACCGTAT TTTGAATGGT TTGCCTCCGA GCGCGCCGCT
CAATTCCGGT TCCAGATCGC CACCGACGAA GCATTCACCC AAATCGTGGA GGATACGGTG
GTAAACCAGG CATTTCACCG CTCACCGGGA CAAAGCGACG GCAATTACTT CGCCCGCTTC
CGCGCCGAGA ACAGCCACGG CACCACCGAA TGGTCGGCGC CGGCAGCATT CAGCATCAGC
GGCTGGAAAA GCGCTTCACT GCCTCCGCAA TTTATCAGCC TTTCCCGCAC GAGCGCGGGG
ACCATTTCCG GCTCGTTCCG AAACCTGGAA GGTATTTCGT CGTACCAGCT GGAAGTACAA
CTCCCGTTTT CCTCCGAGGT GTATGAAATC ACCCGGCAAC CGGATAACCA GAACATCCAG
CACTTTTCCA TTCAGCTGCC GTTAAAAAAA GACCAGTACT ACGCATTGAA ATTCCGGGTG
AACAGATTTG GCTCCTGGAC CGAATGGTCG AACTGGAAAG AGTTGTACGC CACGGATTTC
ACGACAAACA TCCTGCCCGA TTCTACCCAG CCGCTGTCTA CCCGGCCGTT GCTCGCGCTA
CGCCAGTGGG TGCCCGACCG CGGCCCCGAA GCCTATACCG GCGTATTCCA GGTAGCAACG
GATGCTTCAT TTGAAAACAT CGTTTTTGAG CAGTCATTTT CGAATAATGA AGCCAATCAT
TCACTTTCTG ACAAATCGGT GTTTGTACCG TCATTGGAAG AAAATACAAC CTATTACGTT
CGGTCGCGCA TGCAGCTGGC CAATCTCGCA CCGTCGGGCT GGGAGGTTTC ACGCATGAAT
ACGGGCTCGC ACGACAATCG GTTCGCGTTC CTGGGAACTC CGGCCGAAGT GGTTCAAAGT
ACCGGCTACG CCACCGCGGA CGTGCTATTT AACCGGTTTA TGAAAGCGGG CGAGCACCTG
TATGTGTTCA ACTTCCAGGG CGGTTACCAC CGTACCAGCG ACCTGCAAAC ATGGAAAACC
TATTTGCCAT CGACCACCAA AGGGCAAAGC CCCATGTACG TAGGCGCATT TGGGGCCGCG
CCCGACGGGC AGACACTCAC GATCGACTTC ATGAAGAACA TTGCCGTACA AATGACCGAC
GACCAATATG AGAAAAGCTT TTCGGCCACG CCATTATATA TGGGCTATCT CCAACCCATG
GTGTATACCA AAAACGAAGG CTATTTTTTC AAAAGCTATG AGGAAGGTGT CATCCAGGTC
CAGAATGGCG TGTGGACGAA GCATCAGAAC CAGCCTCATG TGTTCCGGCC TGTGACCCTG
GCCAGCGATA ACCGCGACAG GGTGTGGAGC ATGGGCGATG GCGGCTATAT GGCTTATTAT
GAAAATGGCC AATGGACCTC TCAGCCGCAG TTTCCTTTCT GGGACGGCGT GAGCGGGATG
GTATTTGATG AGAAGGACAA TTGTTACGTG TACGGCTCCT TCGGCGTGTC CGTGCTCAAC
ACGGCAACGG GTTCATGGGA CGCGATCGAG GCGCTACGGC CATTTCCTAT CCGGAAAATC
GTTTTTGACG GCAGCGGAAA CATGTGGCTT GCTTCCTACC GGAGAACCCG GTGGGACGCT
CAGAAGATGG ACAACTTTGC GCTTGTGAAA TACGCGGACG GCAAGGCCAG CGCCTACTCC
GACGGCCTTA ACCTCCTCCA CGAACCCTTC GATATTGAGT ATTACAAAGA CAAATTGCTC
ATTATGACCA CCGGCGGCGA GATCCAATCC TTCGACGACC GGCAGATCCT CTCTTTCAAT
CCCAAGAGAA CCTATTGCCC CGGCGAACCG CTGGACCTGA AACTGGCGAC CAACTCCACT
TTTGCTGCCG ATAACAAAAC AAGTGTAGAA CTGCAACAAG TCACGACCGG CAAGACAGTC
GCGTGGGAAA TAACCAGCAA TGCATCGCAA AAGCTCGTAG CCATGCTGCC CGACACGCTC
ACCGAGGGCC GGTATTCGCT GGCAATCCGC ACGACTGCGC CCGAGATTAC CACATACCGC
AGCGACGAGT TTGAAGTATT ACCTGCGAAC ACCTGCGGCG AGGCCAAAGG TGTGGTGCTG
CTGCAAAACC GTCCCAACCC GTTTGGAGCG TCGGGAACAA TTTCCTTTTA CCTACCGCAG
TCGGAAGAGG TGCGGCTGGA ATTATTTAAC CTGATGGGTC AGCGAATGGA AGAGTTGAAA
AACGGGGTGT TGCCGCAGGG CTGGCACACG GTGGATGTAA ACGGGACATC GCTGGCGGCG
GGCTTGTACG TTTACCGCCT GAAAGCCGGC AAAATCACCC GGTCGCTCAA AATGATCCGT
AAATAA
 
Protein sequence
MPAHLPFKYL TIILLLLTVG IARAQQAAPS CGTNDSLMIA YLEKISKRTD LTQARMHAGE 
MLEYRIAVDV EYKTAAQYNH DQEKIKEAVY KMFREASAIF EREMNIKLTV SFIHIWQEPE
PYTLEFDYDY FTKVQNYWLE HRKEERDAVV GMSIRSGWFY GGYRMATSNL PSPNTTLLPD
LLAHELGHTL GSPHTHSCSW PGGAIDHCEE LEGANEACPS GSREVVNGTI MSYCRSKLTF
HPFCRNLIRE YAEGKVNPSF SLKPFTEKPA APGPLTVLTK PGTANDFAPY FEWFASERAA
QFRFQIATDE AFTQIVEDTV VNQAFHRSPG QSDGNYFARF RAENSHGTTE WSAPAAFSIS
GWKSASLPPQ FISLSRTSAG TISGSFRNLE GISSYQLEVQ LPFSSEVYEI TRQPDNQNIQ
HFSIQLPLKK DQYYALKFRV NRFGSWTEWS NWKELYATDF TTNILPDSTQ PLSTRPLLAL
RQWVPDRGPE AYTGVFQVAT DASFENIVFE QSFSNNEANH SLSDKSVFVP SLEENTTYYV
RSRMQLANLA PSGWEVSRMN TGSHDNRFAF LGTPAEVVQS TGYATADVLF NRFMKAGEHL
YVFNFQGGYH RTSDLQTWKT YLPSTTKGQS PMYVGAFGAA PDGQTLTIDF MKNIAVQMTD
DQYEKSFSAT PLYMGYLQPM VYTKNEGYFF KSYEEGVIQV QNGVWTKHQN QPHVFRPVTL
ASDNRDRVWS MGDGGYMAYY ENGQWTSQPQ FPFWDGVSGM VFDEKDNCYV YGSFGVSVLN
TATGSWDAIE ALRPFPIRKI VFDGSGNMWL ASYRRTRWDA QKMDNFALVK YADGKASAYS
DGLNLLHEPF DIEYYKDKLL IMTTGGEIQS FDDRQILSFN PKRTYCPGEP LDLKLATNST
FAADNKTSVE LQQVTTGKTV AWEITSNASQ KLVAMLPDTL TEGRYSLAIR TTAPEITTYR
SDEFEVLPAN TCGEAKGVVL LQNRPNPFGA SGTISFYLPQ SEEVRLELFN LMGQRMEELK
NGVLPQGWHT VDVNGTSLAA GLYVYRLKAG KITRSLKMIR K