Gene Cagg_1807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1807 
Symbol 
ID7267719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2213965 
End bp2216913 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content63% 
IMG OID643566646 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_002463141 
Protein GI219848708 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.71292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTGG CATCCGAACG TCACACCGTC CAAAACCCGC TCATCCACTA TGCGGTGGAG 
GCTGGCTGGG AATACCTTTC GCCCGACGAT GCACTGCGCC TGCGAGGCGG CGAGGAGAAG
CCCTTCCTGC ACGCCGTGCT GGTCGAGTCC ATCCAGCGCC TCAACCCCGG CGTGGTGACC
GAGGCCGCTC AGGCCGAAGA GATCGTCCGC AGCCTGCTCA CCTTGCGCGC CGACATCGAG
GGCAACCGTG AGGCCTGGGA ATACCTCAAG GGGCTGAAGA CCGTCTTCGT CCCGGCCGAG
CGCCGCGAAC GCAATCTTGC CCTGCTCGAC CCGGAACGCC CGGAGGCCAA TCGCTTCCAC
GTCACCGACG AGTTCACCTT CCAGAGCGGC GCCCGCCGCA TTCGCGCCGA TGTGGTCTTC
CTGGTCAACG GCATCCCGGT CATCGTTATC GAGACCAAAG CCGCCACCCG CCTGGAGGGC
ATCGCCGAGG CCTTCGATCA GATACGCCGC TACCATCAGG AGGCGCCCGA CCTGATGGCG
CAGGCGCAAC TGTTTGCCCT CACCCATCTG GTGCAATTTT TCTACGGCGC CACCTGGTCG
CTCTCGCGTA AGGCGCTCTT CAATTGGCGG GAAGAGGTAA GGGCAAACGG CGGTTTGCCC
CCGCCCGATT TCGAGACCCT GGTCAAATCC TTCATCGCCC CGCGGCGCGT GCTGCGCGTG
CTCACCGATT ACATCCTCTT TGCCCGCAAG GACGGCGAAC TGCAAAAGAT CGTCTTGCGC
CCGCATCAGA TGCGCGCCAC CGAGCGCGTC CTGGCGCGTT CCTATCAGGC CGTAACGGCG
CCCAAGGCGC CCCGCCGCGG TCTGATCTGG CACACGCAAG GCTCCGGCAA GACCTACACC
ATGCTCACCA TCGCCCGCCG ATTGATTGAA GACGGACGTT TCGACAATCC CACCGTCCTG
CTCATCGTCG ACCGCAACGA ACTGGAGAGT CAACTGTTCC AGAACCTGGA AGCGGTCGGC
TTCGGACGGG TGCGCCTGGC GCGCTCCAAA CGTCACCTGC GCGAGCTGCT CCGAGCCGAT
ACGCGCGGCC TGATCGTCTC GATGATTCAC AAATTCGACG ATATGCCAGC CAATCTCTGC
CCGCGGCGCA ACGTCTTCGT GCTGGTGGAT GAGGCGCATC GCTCCACCGG CGGGGATCTC
GGCAACTACC TGATGGGCGC GCTGCCCAAC GCCGTCTTCA TCGGTTTTAC CGGCACGCCC
ATTGACCGCA CCGCCCACGG CAAGGGCACC TTCAAAGTCT TCGGTGCCGA CGACCCGCAG
GGCTACCTGG ACAAATACTC CATCCGCGAG TCCATCGAGG ACGGCGCCAC CGTCCCGCTG
CACTATCAAC TGGCGCCGAA CGACCTGATA GCCGACCGCG AGGCGATGGA GCGCGAGTTC
TGGGCGGTTG CCGAACTGGA GGGCGTGGCC GAGGTTGAAG AACTCAACCG CGTCCTCGAC
CGCGCCGTGA CATTGACCAA CAAGCTCAAG AACCGCGAGC GGGTGGACAA AATCGCCGCT
TTCGTGGCCG ATCACTTTCA GAAATACGTT CAGCCGATGG GTTACAAGGC TTTCCTGGTC
GCTGCCGACC GCGAGGCCTG CGCGTTGTAC AAAGAAGCCC TGGATCGCTA CCTGCCCGCG
GAGTGGAGCG CGGTGGTCAT CAGCGCCGGC CACAACGATC CGCCGCACCT CAAACGCTAC
CACCTGAGCG AGGAAGAGGA AACGCGCCTG CGCCGCGCCT TCCGCAAGCC GGGCGAGAAC
CCGCAGATGT TCATCGTCAC CGAGAAACTG CTCACCGGCT ACGACGCCCC CATCCTCTAC
TGCATGTACC TCGATAAGCC AATGCGCGAC CACGTGCTCT TGCAGGCGAT TGCCCGCGTC
AACCGCCCCT ACGAAAGCGA CGACGGTCAG CGCAAGACCA GCGGGCTGAT TCTCGACTTT
GTCGGCGTCT TCGAGAACCT GGAGCGGGCG CTGGCTTTCG ACTCACAGGA TGTGAGCGGG
GTGGTAGAAG GCATCGAGGT TCTCCAGCAG CGCTTTGCCG CGTTGATGGA ACAGAGCCGC
CAGGAGCATC TTACCGTCGG CCAAAATCTC TCTTCGCCCT ATGCGGCGAA GGACGACAAA
CTCGCCGAAT CCATCCTGCT GCGCTTCCGC GACAAAGACA CCCGCGAGGC CTTCTACCGC
TTCTTCCGCG AACTGGAAGA ACTCTACGAA ATCCTCTCGC CCGATCCCTT CCTGCGCCCC
TATCTGGAAG ACTACCAGCG GCTGGTGGAG ATGTATCGCT TGCTGCGTGC GGCCTATGAG
CCGCACGTGC CGGTGGATAA ATCCTTCCTG CGCAAGACGG CGGAGATCGT CCAGCAACAT
AGCCGCACCG ATGCCATCCA TGAGCCGCAG GCCACCTACG AAATTGGCCC GGTCGCGCTC
CTGGCGCTCT TGACGGAGGA AAAACCTGAA ACGGTCAAAG TCTTCAACCT GCTCAAGGAA
CTGCACCACC TGGTGGAAGA GCAAGGCCAC GCCGCTCCTT ACCTGCTCTC CATCGGCGAA
CGCGCCGAAG AAGTTCGTCG CCGCTTCGAG GAACGGCAGA TCGAGTCGCA GCAAGCGCTA
CAAGAACTGG ATGAGCTGGT TAAACAACTG AACCAGGCCC ATGCCGAACG GTCATCCAGC
CCGCTCTCGC CGCAGGCGTT CGCCGTAGAA TGGTGGTTGC GCACCCACCA GATCGCGCCA
GAGCGCGCTA TCCAGGTGGC GCAGCGCATG GAACGCGCCT TTGCGGATTT TCCCCACTGG
ATCAGTAGTC CCAGGCAGGA GAGCGAACTG CGCAAAGTGC TCTATAAAGC CATGCTCGAT
GCCGGAGTCA GCGATGTTGT CGCCTGGGCC GACGCCATCC TCAACCTGTT GCGGAGGGCT
GTCCAATGA
 
Protein sequence
MTLASERHTV QNPLIHYAVE AGWEYLSPDD ALRLRGGEEK PFLHAVLVES IQRLNPGVVT 
EAAQAEEIVR SLLTLRADIE GNREAWEYLK GLKTVFVPAE RRERNLALLD PERPEANRFH
VTDEFTFQSG ARRIRADVVF LVNGIPVIVI ETKAATRLEG IAEAFDQIRR YHQEAPDLMA
QAQLFALTHL VQFFYGATWS LSRKALFNWR EEVRANGGLP PPDFETLVKS FIAPRRVLRV
LTDYILFARK DGELQKIVLR PHQMRATERV LARSYQAVTA PKAPRRGLIW HTQGSGKTYT
MLTIARRLIE DGRFDNPTVL LIVDRNELES QLFQNLEAVG FGRVRLARSK RHLRELLRAD
TRGLIVSMIH KFDDMPANLC PRRNVFVLVD EAHRSTGGDL GNYLMGALPN AVFIGFTGTP
IDRTAHGKGT FKVFGADDPQ GYLDKYSIRE SIEDGATVPL HYQLAPNDLI ADREAMEREF
WAVAELEGVA EVEELNRVLD RAVTLTNKLK NRERVDKIAA FVADHFQKYV QPMGYKAFLV
AADREACALY KEALDRYLPA EWSAVVISAG HNDPPHLKRY HLSEEEETRL RRAFRKPGEN
PQMFIVTEKL LTGYDAPILY CMYLDKPMRD HVLLQAIARV NRPYESDDGQ RKTSGLILDF
VGVFENLERA LAFDSQDVSG VVEGIEVLQQ RFAALMEQSR QEHLTVGQNL SSPYAAKDDK
LAESILLRFR DKDTREAFYR FFRELEELYE ILSPDPFLRP YLEDYQRLVE MYRLLRAAYE
PHVPVDKSFL RKTAEIVQQH SRTDAIHEPQ ATYEIGPVAL LALLTEEKPE TVKVFNLLKE
LHHLVEEQGH AAPYLLSIGE RAEEVRRRFE ERQIESQQAL QELDELVKQL NQAHAERSSS
PLSPQAFAVE WWLRTHQIAP ERAIQVAQRM ERAFADFPHW ISSPRQESEL RKVLYKAMLD
AGVSDVVAWA DAILNLLRRA VQ