Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1807 |
Symbol | |
ID | 7267719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2213965 |
End bp | 2216913 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643566646 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_002463141 |
Protein GI | 219848708 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.71292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTGG CATCCGAACG TCACACCGTC CAAAACCCGC TCATCCACTA TGCGGTGGAG GCTGGCTGGG AATACCTTTC GCCCGACGAT GCACTGCGCC TGCGAGGCGG CGAGGAGAAG CCCTTCCTGC ACGCCGTGCT GGTCGAGTCC ATCCAGCGCC TCAACCCCGG CGTGGTGACC GAGGCCGCTC AGGCCGAAGA GATCGTCCGC AGCCTGCTCA CCTTGCGCGC CGACATCGAG GGCAACCGTG AGGCCTGGGA ATACCTCAAG GGGCTGAAGA CCGTCTTCGT CCCGGCCGAG CGCCGCGAAC GCAATCTTGC CCTGCTCGAC CCGGAACGCC CGGAGGCCAA TCGCTTCCAC GTCACCGACG AGTTCACCTT CCAGAGCGGC GCCCGCCGCA TTCGCGCCGA TGTGGTCTTC CTGGTCAACG GCATCCCGGT CATCGTTATC GAGACCAAAG CCGCCACCCG CCTGGAGGGC ATCGCCGAGG CCTTCGATCA GATACGCCGC TACCATCAGG AGGCGCCCGA CCTGATGGCG CAGGCGCAAC TGTTTGCCCT CACCCATCTG GTGCAATTTT TCTACGGCGC CACCTGGTCG CTCTCGCGTA AGGCGCTCTT CAATTGGCGG GAAGAGGTAA GGGCAAACGG CGGTTTGCCC CCGCCCGATT TCGAGACCCT GGTCAAATCC TTCATCGCCC CGCGGCGCGT GCTGCGCGTG CTCACCGATT ACATCCTCTT TGCCCGCAAG GACGGCGAAC TGCAAAAGAT CGTCTTGCGC CCGCATCAGA TGCGCGCCAC CGAGCGCGTC CTGGCGCGTT CCTATCAGGC CGTAACGGCG CCCAAGGCGC CCCGCCGCGG TCTGATCTGG CACACGCAAG GCTCCGGCAA GACCTACACC ATGCTCACCA TCGCCCGCCG ATTGATTGAA GACGGACGTT TCGACAATCC CACCGTCCTG CTCATCGTCG ACCGCAACGA ACTGGAGAGT CAACTGTTCC AGAACCTGGA AGCGGTCGGC TTCGGACGGG TGCGCCTGGC GCGCTCCAAA CGTCACCTGC GCGAGCTGCT CCGAGCCGAT ACGCGCGGCC TGATCGTCTC GATGATTCAC AAATTCGACG ATATGCCAGC CAATCTCTGC CCGCGGCGCA ACGTCTTCGT GCTGGTGGAT GAGGCGCATC GCTCCACCGG CGGGGATCTC GGCAACTACC TGATGGGCGC GCTGCCCAAC GCCGTCTTCA TCGGTTTTAC CGGCACGCCC ATTGACCGCA CCGCCCACGG CAAGGGCACC TTCAAAGTCT TCGGTGCCGA CGACCCGCAG GGCTACCTGG ACAAATACTC CATCCGCGAG TCCATCGAGG ACGGCGCCAC CGTCCCGCTG CACTATCAAC TGGCGCCGAA CGACCTGATA GCCGACCGCG AGGCGATGGA GCGCGAGTTC TGGGCGGTTG CCGAACTGGA GGGCGTGGCC GAGGTTGAAG AACTCAACCG CGTCCTCGAC CGCGCCGTGA CATTGACCAA CAAGCTCAAG AACCGCGAGC GGGTGGACAA AATCGCCGCT TTCGTGGCCG ATCACTTTCA GAAATACGTT CAGCCGATGG GTTACAAGGC TTTCCTGGTC GCTGCCGACC GCGAGGCCTG CGCGTTGTAC AAAGAAGCCC TGGATCGCTA CCTGCCCGCG GAGTGGAGCG CGGTGGTCAT CAGCGCCGGC CACAACGATC CGCCGCACCT CAAACGCTAC CACCTGAGCG AGGAAGAGGA AACGCGCCTG CGCCGCGCCT TCCGCAAGCC GGGCGAGAAC CCGCAGATGT TCATCGTCAC CGAGAAACTG CTCACCGGCT ACGACGCCCC CATCCTCTAC TGCATGTACC TCGATAAGCC AATGCGCGAC CACGTGCTCT TGCAGGCGAT TGCCCGCGTC AACCGCCCCT ACGAAAGCGA CGACGGTCAG CGCAAGACCA GCGGGCTGAT TCTCGACTTT GTCGGCGTCT TCGAGAACCT GGAGCGGGCG CTGGCTTTCG ACTCACAGGA TGTGAGCGGG GTGGTAGAAG GCATCGAGGT TCTCCAGCAG CGCTTTGCCG CGTTGATGGA ACAGAGCCGC CAGGAGCATC TTACCGTCGG CCAAAATCTC TCTTCGCCCT ATGCGGCGAA GGACGACAAA CTCGCCGAAT CCATCCTGCT GCGCTTCCGC GACAAAGACA CCCGCGAGGC CTTCTACCGC TTCTTCCGCG AACTGGAAGA ACTCTACGAA ATCCTCTCGC CCGATCCCTT CCTGCGCCCC TATCTGGAAG ACTACCAGCG GCTGGTGGAG ATGTATCGCT TGCTGCGTGC GGCCTATGAG CCGCACGTGC CGGTGGATAA ATCCTTCCTG CGCAAGACGG CGGAGATCGT CCAGCAACAT AGCCGCACCG ATGCCATCCA TGAGCCGCAG GCCACCTACG AAATTGGCCC GGTCGCGCTC CTGGCGCTCT TGACGGAGGA AAAACCTGAA ACGGTCAAAG TCTTCAACCT GCTCAAGGAA CTGCACCACC TGGTGGAAGA GCAAGGCCAC GCCGCTCCTT ACCTGCTCTC CATCGGCGAA CGCGCCGAAG AAGTTCGTCG CCGCTTCGAG GAACGGCAGA TCGAGTCGCA GCAAGCGCTA CAAGAACTGG ATGAGCTGGT TAAACAACTG AACCAGGCCC ATGCCGAACG GTCATCCAGC CCGCTCTCGC CGCAGGCGTT CGCCGTAGAA TGGTGGTTGC GCACCCACCA GATCGCGCCA GAGCGCGCTA TCCAGGTGGC GCAGCGCATG GAACGCGCCT TTGCGGATTT TCCCCACTGG ATCAGTAGTC CCAGGCAGGA GAGCGAACTG CGCAAAGTGC TCTATAAAGC CATGCTCGAT GCCGGAGTCA GCGATGTTGT CGCCTGGGCC GACGCCATCC TCAACCTGTT GCGGAGGGCT GTCCAATGA
|
Protein sequence | MTLASERHTV QNPLIHYAVE AGWEYLSPDD ALRLRGGEEK PFLHAVLVES IQRLNPGVVT EAAQAEEIVR SLLTLRADIE GNREAWEYLK GLKTVFVPAE RRERNLALLD PERPEANRFH VTDEFTFQSG ARRIRADVVF LVNGIPVIVI ETKAATRLEG IAEAFDQIRR YHQEAPDLMA QAQLFALTHL VQFFYGATWS LSRKALFNWR EEVRANGGLP PPDFETLVKS FIAPRRVLRV LTDYILFARK DGELQKIVLR PHQMRATERV LARSYQAVTA PKAPRRGLIW HTQGSGKTYT MLTIARRLIE DGRFDNPTVL LIVDRNELES QLFQNLEAVG FGRVRLARSK RHLRELLRAD TRGLIVSMIH KFDDMPANLC PRRNVFVLVD EAHRSTGGDL GNYLMGALPN AVFIGFTGTP IDRTAHGKGT FKVFGADDPQ GYLDKYSIRE SIEDGATVPL HYQLAPNDLI ADREAMEREF WAVAELEGVA EVEELNRVLD RAVTLTNKLK NRERVDKIAA FVADHFQKYV QPMGYKAFLV AADREACALY KEALDRYLPA EWSAVVISAG HNDPPHLKRY HLSEEEETRL RRAFRKPGEN PQMFIVTEKL LTGYDAPILY CMYLDKPMRD HVLLQAIARV NRPYESDDGQ RKTSGLILDF VGVFENLERA LAFDSQDVSG VVEGIEVLQQ RFAALMEQSR QEHLTVGQNL SSPYAAKDDK LAESILLRFR DKDTREAFYR FFRELEELYE ILSPDPFLRP YLEDYQRLVE MYRLLRAAYE PHVPVDKSFL RKTAEIVQQH SRTDAIHEPQ ATYEIGPVAL LALLTEEKPE TVKVFNLLKE LHHLVEEQGH AAPYLLSIGE RAEEVRRRFE ERQIESQQAL QELDELVKQL NQAHAERSSS PLSPQAFAVE WWLRTHQIAP ERAIQVAQRM ERAFADFPHW ISSPRQESEL RKVLYKAMLD AGVSDVVAWA DAILNLLRRA VQ
|
| |