Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3755 |
Symbol | |
ID | 4069330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4435635 |
End bp | 4438724 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637985777 |
Product | HsdR family type I site-specific deoxyribonuclease |
Protein accession | YP_592829 |
Protein GI | 94970781 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0174443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACTACTG TCTCCGCGTT CACCGAATCT GTGGTCGAAG AGGCGGCGCT TGCTTGGTTG GAAGGGCTGG GCTACACGGT ACTTTCCGGC CCAAGCATTT CTGCTGGAGA AATTGCCGCC GAGCGGATCG AGGCTGATTA CTCCGACGTT ATTCTTGAGC GGCGCGTTAG ACAGACACTG CAACGCCTCA ATCCTAATCT CCCACCAGAA GCGTTAGACG ACGCGTTCCG CAGGCTCACT CACGACAATG AAACGTCTTT GATCGCTAAT AACCATGCCT TTCATCAGAT TTTGGTGAAT GGCGTTACCG TTGAGTACAA CCGAGCCGAT GGTTCCATCG GCGGAGCCTT GGTGCGCGTT ATTGACTTTG AGGCTGCCGA AAACAACGAC TGGCTCGCGG TGAACCAGTT CACGATCGTG GAGGGACAGT ACACGCGGCG ACCTGACGTG CTGATATTTA TCAATGGTCT CCCGATCGCC TTGCTGGAAC TCAAGAACGC TGCCGACGAA AATGCCACCA TTTGGACCGC GTTCAAGCAG ATTCAGACGT ACAAATCCCA AATTCCATCG TTGTTCGCCT ACAACGAACT GCTCGTGATC TCCGATGGTA TGGAGGCACG AGTCGGATCG CTCACTGCTG ATACAGAGCG GTTCATGCCT TGGCGCACAA TCGGAGGCGA AGAGCTTGCC CCGACATCTC TGCCGCAGCT TCAGGTGTTA ATCGAGGGGC TTTTCGAAAG GCGCCGTTTA CTTGACTTCA TACGTCACTT TGTCGTCTTC GAAGACGAAG GACGCGGGGT GTTGATCAAG AAGATTGCTG GATATCATCA GTTCCACGCC GTCAATAGGG CGTTAAGAGA GACGCTGTCC GCATCGGCGG AAGGCGGCGA TCGCAGAGTG GGAGTGGTGT GGCACACACA GGGAAGCGGC AAGAGTCTCA CGATGGCTTT TTATGCCGGG CGCCTTGTAC TTGAGCCGAC TCTTCAAAAT CCCACCATCG TCGTGATAAC AGATCGTAAC GACTTGGACG ATCAACTTTA CGGGACATTC TCTCGCTGCC ATGAGTTGTT GAGACAGGCT CCGCTACAAG CCCTGGATCG AGAGCACTTG CGGTCACTGG TTCAAGTGTC GTCCGGAGGC GTGATCTTCA CAACAATCCA GAAATTCATG CCGGCGGAGG GCGACAAATA CCCGCAGCTT ACAGATCGTC GCAACGTGAT CGTCGTCGCT GACGAGGCAC ATCGTAGTCA ATACGACTTT ATAGATGGGT TTGCGCGGCA CATGAGGGAT GCCTTACCCC ACGCCTCATT TATTGGCTTC ACAGGTACAC CCATCGAAAA GACCGATGTA AATACGCGCT CCGTCTTCGG CGATTACATC AGCATCTACG ACATCCAGCG AGCCGTCGAA GATGGGGCGA CTGTGCCGAT CTACTACGAG AGCCGTCTGG CGAAGCTCGA ACTCCCTGCT TCAGAAAAGC CCAAGGTCGA TACGGCTTTC GAGGAAGCCA CAGAAGGCGA AGAGGTCGAT CGCAAAGAAA AGCTCAGGAC GAAATGGGCC GCTGTGGAAG CCGTTGTGGG AACCGAGAAG CGGCTCAATA CGGTTGCTAA AGACCTCGTA AAGCACTTCG AAGAGCGGCT AGAGGTCATG GACGGCAAAG CAATGATTGT CTGTATGAGC CGCCGAATCT GCGTTGACCT GTACAACGAG ATTATGAAAC TTCGTCCAAC GTGGCACAGC GAGAACGATG ATGCCGGAGT GCTCAAGATC GTGATGACGG GGTCAGCAAC GGACGGGCCC GAGTGGCAAC AGCACATCAG GGACAAGCGA CGGCGTGAAG CGTTAGGTAA GAGATTTAAG AATGCTAAAG ACCCATTCAA GATCGTGATC GTAAGGGATA TGTGGCTTAC AGGATTCGAC GTTCCATCGT TACACACGAT GTACGCAGAC AAGCCGATGC GAGCGCATGG GCTGATGCAG GCGATTGCCC GAGTCAACCG CGTGTTCAAA GACAAGCCCG GCGGACTGGT TGTGGACTAC CTCGGCATCG CGCACGAACT CAAGCTTGCG CTGGCCACGT ACACAGAAAG CGGTGGAAAG GGCCAGACGG CGCTGCCGCA GGAAGAGGCT GTCGCAAGGA TGATCGAGAA ATATGAAATC TGTCGGGGAT TATTCCACGG GTTTGACTGG TCAATGTGGC TCACCGGGAC CTCCGCGCAG AAGCTCGCGA TTCTTCCTAA CGCCCAGGAA CACGTCCTGA AACAGGAGGA CGGAAAGAAC AGACTGTTGA AGGCGGTGTC AGATTTGTCA AAAGCATTCG CATTGGCTGT GCCCCACGAT GCAGCTTTGA AGATTCGTGA CGATGTAGGC TTTTTCCAAG CAATCCGAAG CGTCTTGGCC AAAACCGCTT CGCAACAGTA TCGTCCCGAG CAAGAGCTCG ATAACGCAAT TCGCCAAATC ATATCCAGAG CGATGTCCTC TGGCGAAGTG GTGGATATTT TCGCAGCTGC GGGCCTTAAG AAGCCCGATA TATCGATTCT TTCCGAAGAG TTCCTGAGAG AGGTCAGCAA CATCCCACAG AAGAATCTCG CAGTAGAGCT GCTACGCAAG CTGCTCAATG ACGAGATCAA AAGTCGATCG CGGCGCAACG TCGTGAAGGG CAAGCTCTTC TCCGCAATGT TGGAGAGCTC GATTCATAAG TATCAGAACC GGGCTATTCA GGCCGCTCAA GTCATTGAGG AGTTAATCGG ACTCGCGAAG GAGATGCGAG AAGAGTCGGC AAAAGGCGAG GAACTGGGCC TAAGCGAAGA CGAGGTGGCT TTCTACGATG CACTCGAAGT CAATGACAGC GCCGTCAAAG TGCTTGGTGA ACCAACTTTG AAGACTATTG CGCGCGAGCT TGTGGACACC GTTCGCCGAA ACACGACGAT CGATTGGGCC GTGCGCGAGA ACATCCGTGC GAACCTGCGT GTGCTGGTGA AACGCATCCT GAGGAAGTAT GGATACCCGC CAGACAAGCA GGAGAAGGCG ACCCTGACTG TGCTGGAGCA GGCGGAAGCG TTGTCAGCGA ATTGGGCGGT GACTACGTAA
|
Protein sequence | MTTVSAFTES VVEEAALAWL EGLGYTVLSG PSISAGEIAA ERIEADYSDV ILERRVRQTL QRLNPNLPPE ALDDAFRRLT HDNETSLIAN NHAFHQILVN GVTVEYNRAD GSIGGALVRV IDFEAAENND WLAVNQFTIV EGQYTRRPDV LIFINGLPIA LLELKNAADE NATIWTAFKQ IQTYKSQIPS LFAYNELLVI SDGMEARVGS LTADTERFMP WRTIGGEELA PTSLPQLQVL IEGLFERRRL LDFIRHFVVF EDEGRGVLIK KIAGYHQFHA VNRALRETLS ASAEGGDRRV GVVWHTQGSG KSLTMAFYAG RLVLEPTLQN PTIVVITDRN DLDDQLYGTF SRCHELLRQA PLQALDREHL RSLVQVSSGG VIFTTIQKFM PAEGDKYPQL TDRRNVIVVA DEAHRSQYDF IDGFARHMRD ALPHASFIGF TGTPIEKTDV NTRSVFGDYI SIYDIQRAVE DGATVPIYYE SRLAKLELPA SEKPKVDTAF EEATEGEEVD RKEKLRTKWA AVEAVVGTEK RLNTVAKDLV KHFEERLEVM DGKAMIVCMS RRICVDLYNE IMKLRPTWHS ENDDAGVLKI VMTGSATDGP EWQQHIRDKR RREALGKRFK NAKDPFKIVI VRDMWLTGFD VPSLHTMYAD KPMRAHGLMQ AIARVNRVFK DKPGGLVVDY LGIAHELKLA LATYTESGGK GQTALPQEEA VARMIEKYEI CRGLFHGFDW SMWLTGTSAQ KLAILPNAQE HVLKQEDGKN RLLKAVSDLS KAFALAVPHD AALKIRDDVG FFQAIRSVLA KTASQQYRPE QELDNAIRQI ISRAMSSGEV VDIFAAAGLK KPDISILSEE FLREVSNIPQ KNLAVELLRK LLNDEIKSRS RRNVVKGKLF SAMLESSIHK YQNRAIQAAQ VIEELIGLAK EMREESAKGE ELGLSEDEVA FYDALEVNDS AVKVLGEPTL KTIARELVDT VRRNTTIDWA VRENIRANLR VLVKRILRKY GYPPDKQEKA TLTVLEQAEA LSANWAVTT
|
| |