Gene Ppro_1854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpro_1854 
Symbol 
ID4574854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelobacter propionicus DSM 2379 
KingdomBacteria 
Replicon accessionNC_008609 
Strand
Start bp1993572 
End bp1996652 
Gene Length3081 bp 
Protein Length1026 aa 
Translation table11 
GC content64% 
IMG OID639755903 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_901524 
Protein GI118580274 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAG AACACCTGGA ACGGGAAACC CTGGGTTGGC TGGCGGATAC CGGCTATAGC 
TACCGTTATG GACTGGATAT CGCGCCGGAT GGACCGCAGC CGGAACGCGG CAGCTACAGC
CAGGTGCTGC TGGTTGGCCG CCTGCGCGAG GCGATCAACC GGCTGAATCC CCTTGTTCCC
CAAGTGGCCC GCGAAGACGC CCTGCAACAG GTGCTCAACC TGGACACACC GGTGCTCCTG
GCCGCCAACC GTGCCTTCCA CCAACTGCTG GTCAACGGCG TGCCGGTCCA GTACCAGCAG
GAGGGCGAGA CGCGGGGCGA CTTCGTGCGC CTGATGGACT TTGGCGACGC GGGCGCCAAC
GAGTGGCTGG CGGTCAACCA GTTCTCCATC AAGGGGCCGA AATTCACCCG CCGCCCGGAC
ATCGTCCTGT TCGTCAACGG CCTGCCGCTG GTGCTGCTGG AGCTGAAGAA CCCGGCCGAC
GAGAAGGCGG ACATCTGGAA GGCCTACGAC CAGATCCAGA CCTACAAGGA GCAGATCCCG
GACGTCTTCC AGTACAACGA GATCCTGGTG ATCTCGGACG GCACCGAGGC GCGCCTGGGG
TCTCTCTCCG CAAACAGGGA ACGCTTCATG GCCTGGCGCA CCATCGACGG AGTCACGCTC
GACCCGCTGG GGCAGTTCAA CGAACTGGAG ACCCTGGTGC GCGGCGTACT GGCCCCGGAC
TACCTGCTGG ACTTCCTGCG CTTCTTCGTG CTGTTCGAGG ATGACGGCAC CCTGGTCAAG
AAGATCGCCG GCTACCACCA GTTCCATGCG GTGCGCTCGG CCATCGACCA GGTGGTAGCG
GCATCCCGGC CCGGCGGCAG CCGCAAGGGG GGCGTGGTCT GGCATACCCA GGGCTCGGGC
AAAAGCATCA CCATGACCTG CTTCGCGGCG CGGGTCATGC GCGAAGCGGC CATGGAGAAC
CCCACCATCG TTGTCATCAC CGACCGCAAC GACCTGGACG GGCAACTGTT CGGCGTGTTC
TCCCTTTCCC AGGAGCTGCT GCGGGAACAG CCGGTGCAGG GAGAAACCCG CCAGGACCTG
CGCGACAAGC TGGCCAACCG CCCCTCGGGC GGCATCGTCT TTGCCACCAT CCAGAAGTTT
ATGCCCGGTG AGGATGAGGA CAGCTTTCCG ATCCTGTCCG ACCGTCACAA CATCGTCGTC
ATCGCCGACG AGGCCCACCG CACCCAGTAC GGCTTTGAAG CCAAGTTCAA GGGGGATGCC
AAGGGGTATC AGGTGGGCTA TGCCCAGCAT CTGCGGGATG CCCTGCCTAA TGCCACCTTT
GTGGCCTTCA CCGGCACCCC GGTATCATCG GAGGACCGGG ACACCCGGGC CGTGTTCGGC
GATTACATCA GCATCTACGA CATGCAGCAG GCCCGCGACG ACGGCGCCAC CGTGGCGATC
TACTTCGAAT CGCGCCTGGC CAAGCTGGGG CTGAAGACAG ACGTGCTGCC GGAGATCGAC
GCCGAGGTGG ATGAACTGGC CGAGGATGAG GAGGACGACC AGCAGGCCCG GCTCAAGAGC
CGGTGGGCGG CCTTGGAAAA GGTGGTCGGC GCCGAGCCGC GCATCAAACA GGTGGCGGCC
GACCTGGTGG CCCACTTCGA GGAACGTAAC CAGGCCCAGA GCGGCAAAGC CATGGTGGTG
GCCATGAGCC GCGACATCTG CGTCCACCTC TACAACGCCA TCATCGCCTT GCGGCCGGAG
TGGCACGACG AAGACCCGGA AAAGGGTGCC ATCAAGATCG TCATGACCGG CTCCGCCAGC
GACAAGCCGC TGCTGAGGCC GCACATATAT GCCAAGCAGA CCAAGAAGCG CCTGGAAAAG
CGCTTCAAAG ACCCGGACGA CCCCCTGAAG CTGGTCATCG TGCGGGACAT GTGGCTGACC
GGCTTCGATG CCCCCTGCGC GCACACCATG TACGTGGACA AACCGATGAA GGGGCACAAC
CTGATGCAGG CCATCGCCCG GGTCAACCGC GTCTTCCGCG ACAAGCAGGG CGGGCTGGTG
GTGGACTACA TCGGCATCGC CAACGACCTG AAGCAGGCAC TCAAGGAATA CACCGCCAGC
AAGGGGCGCG GCCGACCGAC CGTCGATGCC CGCGAAGCCT ACGCGGTGCT GGAGGAGAAG
CTGGACATCC TGCGGGCCAT GCTGCATGAG TTCGACTACA GCGGCTTCCT CACCGGCGGC
CACGGTCTGC TGGCCAGGAC CGCCAACCAT GTGCTGGGGC TCCAGGACGG CAAGAAGCGC
TTCGGCGATA CGGCGCTGGC CATGTCCAAG GCCTTCACCC TCTGTTGCAC CCTGGATGAG
GCCAGGGCCG TGCGGGAAGA GGTGGCCTTC TTCCAGGCGG TCAAGGTGCT GCTGACCAAA
CGGGAGATCA GCACTCAAAG GCGCACGGAC GAAGAACGCG AGTTGGCCAT CCGCCAGATC
ATCGGCTCGG CGCTGGTATC GGAAGATGTG GTGGACATTT TCCAGGCAGT CGGGCTGGAC
AAGCCCAACA TCGGCATCCT GGATGACGAG TTTCTGAACG ACGTCCGCAA CCTGCCGGAA
CGGAACCTGG CGGTGGAGCT GCTGGAGCGG TTGCTGGAAG GGGAGATCAG GACCCGCTTT
AGCACCAACA TCGTCCAGCA GTCCAGGTTC TCGGAGCTGC TGGCCAAGGT CATCGCCCGC
TACCAGAACC GGGCCATCGA AACCGCCCAG GTCATGGAAG AGTTGATCGC CATGGCCAAG
AAGTTTCGGG AATCCATCAA CCGGGGGGAA GAGCTGGGGC TTAATGCCGA TGAGTTGGCC
TTCTACGACG CCCTGGCCAA CAACGAGGAA GCGGTGCGTG AGATGGGTGA CGAGATACTG
AAGAAGATCG CCCATGAACT GGCGGAGAAC CTGCGGAAGA ACATCAGCGT AGACTGGTCG
GTGCGGGAAA GCGTGCGGGC CAAGCTGCGG CTGATGGTGA AGCGCATCCT GCGAAAGTAC
AAGTATCCGC CGGACCGGCA GGAAGAGGCG GTACAGCTGG TGCTGGACCA GGCGGAGACG
CTGAGCGCGG AGTGGGGGTA G
 
Protein sequence
MTEEHLERET LGWLADTGYS YRYGLDIAPD GPQPERGSYS QVLLVGRLRE AINRLNPLVP 
QVAREDALQQ VLNLDTPVLL AANRAFHQLL VNGVPVQYQQ EGETRGDFVR LMDFGDAGAN
EWLAVNQFSI KGPKFTRRPD IVLFVNGLPL VLLELKNPAD EKADIWKAYD QIQTYKEQIP
DVFQYNEILV ISDGTEARLG SLSANRERFM AWRTIDGVTL DPLGQFNELE TLVRGVLAPD
YLLDFLRFFV LFEDDGTLVK KIAGYHQFHA VRSAIDQVVA ASRPGGSRKG GVVWHTQGSG
KSITMTCFAA RVMREAAMEN PTIVVITDRN DLDGQLFGVF SLSQELLREQ PVQGETRQDL
RDKLANRPSG GIVFATIQKF MPGEDEDSFP ILSDRHNIVV IADEAHRTQY GFEAKFKGDA
KGYQVGYAQH LRDALPNATF VAFTGTPVSS EDRDTRAVFG DYISIYDMQQ ARDDGATVAI
YFESRLAKLG LKTDVLPEID AEVDELAEDE EDDQQARLKS RWAALEKVVG AEPRIKQVAA
DLVAHFEERN QAQSGKAMVV AMSRDICVHL YNAIIALRPE WHDEDPEKGA IKIVMTGSAS
DKPLLRPHIY AKQTKKRLEK RFKDPDDPLK LVIVRDMWLT GFDAPCAHTM YVDKPMKGHN
LMQAIARVNR VFRDKQGGLV VDYIGIANDL KQALKEYTAS KGRGRPTVDA REAYAVLEEK
LDILRAMLHE FDYSGFLTGG HGLLARTANH VLGLQDGKKR FGDTALAMSK AFTLCCTLDE
ARAVREEVAF FQAVKVLLTK REISTQRRTD EERELAIRQI IGSALVSEDV VDIFQAVGLD
KPNIGILDDE FLNDVRNLPE RNLAVELLER LLEGEIRTRF STNIVQQSRF SELLAKVIAR
YQNRAIETAQ VMEELIAMAK KFRESINRGE ELGLNADELA FYDALANNEE AVREMGDEIL
KKIAHELAEN LRKNISVDWS VRESVRAKLR LMVKRILRKY KYPPDRQEEA VQLVLDQAET
LSAEWG