Gene Dfer_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3921 
Symbol 
ID8227516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4767714 
End bp4769417 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content51% 
IMG OID644931762 
Productcarboxyl-terminal protease 
Protein accessionYP_003088290 
Protein GI255037669 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAG AAAGACCTCA GCCCCCGATC CAGAACTCGA AATCCGTAGT TCGCTTGCCT 
ATTATCATCG CGATCACGCT GGCGGCGGGA GTGCTGCTGG GGAGCACGTT TTTCAGCGGA
GGTAAAAAAC TGTCCGATGT TGCCAAAGGG TACAGCAAAT TCAGGGAAGT ATTAATGCTC
GTCGAAAACA ACTATGTCGA TTCGGTCAAT ACCGAGGAAC TGGTGGATTT CTCGATCTCC
AAAATGCTCG AAAAGCTTGA TCCGCATACG GCCTATTTCA ATTCCGAGGA AGCTACCGCG
GCACGCTCGC AGCTCGAATC GGGATTTGAC GGCATCGGGG TCGAATTCAA TATTTACAAC
GACACGGTTT ACGTGGTAAC GCCATTGAGC GGAGGTCCGT CGGAGGCTGC CGGTATCCAG
AGTGGCGATC GCATTATTTC AGTGAATAAA GAAAACCTGT CGGGTCCGGG CGTTAGCAAT
GCGCAGGTTT ACAAGCTCTT GCGTGGCAAA CGCGGAACAA AAGTGGACCT GGCCATTGAA
AGGGTGGGCC TGAACGACAA AATGAATTTC TCGGTAGTTC GCGACCGTAT TCCCACTTAT
TCGGTGGATG CGGCCTATAT GGTGGATCAG GAAATCGGTT ATATCAAGGT GAGCCGTTTT
TCCGAAACCA CTTACGACGA GTTTAAATCG GCATTGAAAA CATTGAAAGC GGATGGTTTG
AAAAACCTCA TTCTCGACCT CCGCGGCAAT CCGGGTGGTT ATATGGAACG CGCCACAAGC
ATGGCCGACG AGTTTATTTC CGGCGATAAG CTGCTGGTTT ACACTGAAGG AAAAGACAGC
CGGTTCGATC GCAAAACGCG TTCGCACGTG GCAGGCATGT TCGAGCAGGG CCCGCTGATC
GTGCTCGTGG ACGAAGGCAG CGCCTCAGCT TCCGAAATCC TCGCGGGTGC ATTGCAGGAT
CACGACCGCG CGCTGGTGGT GGGAAGAAGG TCTTATGGAA AAGGTTTGGT ACAAATGCCG
ATCAAACTAT CGGACGGCTC GGAGCTGCGC CTTACCATCT CGCGCTACTT CACACCGAGC
GGCCGCAGCA TCCAGAAACC TTACGAGCTC GGCAAGGGCG AAGATTACAG CCAGGACCTC
ACGCACCGGT ACGAAAGCGG AGAGCTGTTT AACGTAGACA GCATTAAATT CGATAAAAGC
AAGGTATACA AAACCGATGG CGGCCGTATC GTGTACGGCG GCGGAGGCAT TACACCGGAT
ATTTTCGTGC CGAAAGACAC ATTGCTCAAC AGCAAATATC TTTTTGAATT GTATTCCAAA
AACATCATCC GCGAATATGC ATTGCGGTAT GCCAATGAAA ACCAGAGAAA ACTGGAAAAA
CTGCCGTTTA AAGAGTTCCT GAAAACGTTC GAAGTGAGCG ACGCCATGGT GGTCGAGCTG
GTGAAAGACG CGTCCAAAGC GGGAATTAAA CCGAACGAGA AGGAACTGAA CCTTTCAAGA
CCGCTCATTA CCTCGCAAAC GAAGGCGATC ATCGGTCGTT ACGTGTGGGG CAGAAAGCAG
AAAAGCGGGC TGAATAACGA AGTGTTCCAG GTGCTGAACC CGACCGACAA TGTGTATCAG
CACGCGGTAC AGCTTTTCAG CCAGGCGGCG CAGTTGGAAA AAGGCGAATT CAGCAGTCTT
AATATTCCCA AAAACAAAAA GTAA
 
Protein sequence
MNSERPQPPI QNSKSVVRLP IIIAITLAAG VLLGSTFFSG GKKLSDVAKG YSKFREVLML 
VENNYVDSVN TEELVDFSIS KMLEKLDPHT AYFNSEEATA ARSQLESGFD GIGVEFNIYN
DTVYVVTPLS GGPSEAAGIQ SGDRIISVNK ENLSGPGVSN AQVYKLLRGK RGTKVDLAIE
RVGLNDKMNF SVVRDRIPTY SVDAAYMVDQ EIGYIKVSRF SETTYDEFKS ALKTLKADGL
KNLILDLRGN PGGYMERATS MADEFISGDK LLVYTEGKDS RFDRKTRSHV AGMFEQGPLI
VLVDEGSASA SEILAGALQD HDRALVVGRR SYGKGLVQMP IKLSDGSELR LTISRYFTPS
GRSIQKPYEL GKGEDYSQDL THRYESGELF NVDSIKFDKS KVYKTDGGRI VYGGGGITPD
IFVPKDTLLN SKYLFELYSK NIIREYALRY ANENQRKLEK LPFKEFLKTF EVSDAMVVEL
VKDASKAGIK PNEKELNLSR PLITSQTKAI IGRYVWGRKQ KSGLNNEVFQ VLNPTDNVYQ
HAVQLFSQAA QLEKGEFSSL NIPKNKK