Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3753 |
Symbol | |
ID | 4069328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4431985 |
End bp | 4433040 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637985775 |
Product | 5-methylcytosine-specific restriction enzyme subunit McrC |
Protein accession | YP_592827 |
Protein GI | 94970779 |
COG category | [V] Defense mechanisms |
COG ID | [COG4268] McrBC 5-methylcytosine restriction system component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.537146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATCC CTGTCGCAAA CGTCTACTAC CTCCTTTGTT ATGCTTGGGA CAAACTGGAA GAGAGAGATC TTGTCGATAT TCATCCGACC GAAGAAACGG ATTTGGTGAA CCTGTTCGCC CGCGTTCTGA CCAACGGTAT CGACCATTTG CTGAAGAAAG GCATCGATCG TGGGTACCTA CTTCATAGCG AAGAATCCTG CGTGTTGCGT GGGAGGATCG ATTTTCCACA ATCGATAAAA CACATGCTCT TTCAGCGAGC GCAGGCGCAT TGCGAGTTTG ATGAATTGAG TTTTGATGTG CTGCATAACC GTATTCTGAA ATCGACGATT ATGCGCTTAA TTAGGACTCG TGATCTAGAT TCAGGAATTC GAGATCGTTT GCTTTTCCAA TATCGCTACT TTGCGGAGGT CGGGGACCTC GATTTATCGG TTCAGATATT TGGCAAAGTA CAGCTTTACC GTAACAACCA CTTTTATGAT TTCCTTTTAA GGGTCTGCGC GCTGCTTTTT GAGAATCTGC TTCCGACCCA AGAGCCTGGA AATTGGCGGT TTAGGTCGTT CTTGCAGAAT CGGGAACAGA TGGCGTATGT CTTTGAGCGT TTCGTACGCA ACTTTTACAA GAGGGAACTA CCAAGCGTGA GAGTTGACGG GCGATGCAAA GTCAAGCGGG AGGACATAAA TTGGGGCATG ACACCTTCAG ACGACCTCAG CTCAGCTCTG CTTCCCAAAA TGCAAACCGA TGTGTGTATC ACCACCGAGG CAAAAAGGAT CTTGGTTGAA TGCAAATACG TCGATGATCC TCTTGAGCAG CGGGAGGAGA TGGCCCCGAA GCTGATTACT ACTCATCTTT ACCAAGTGAA TGCTTACCTG GACAACTGGC CCGATTTGCC ACTCTACCGC TCTTCCCGCG CCATATTGTT ATATCCACTC GCTACACGTC CGATCGCTGT CGAGTTCACT CGTGCCGACG GGCAGCTCCT CAGCGTACGC ACCCTCAATT TGGCCCAGCA ATGGTCTGCT ATTCACCAAG ACCTTCTTAG ATTGGTGGAC AATTGA
|
Protein sequence | MEIPVANVYY LLCYAWDKLE ERDLVDIHPT EETDLVNLFA RVLTNGIDHL LKKGIDRGYL LHSEESCVLR GRIDFPQSIK HMLFQRAQAH CEFDELSFDV LHNRILKSTI MRLIRTRDLD SGIRDRLLFQ YRYFAEVGDL DLSVQIFGKV QLYRNNHFYD FLLRVCALLF ENLLPTQEPG NWRFRSFLQN REQMAYVFER FVRNFYKREL PSVRVDGRCK VKREDINWGM TPSDDLSSAL LPKMQTDVCI TTEAKRILVE CKYVDDPLEQ REEMAPKLIT THLYQVNAYL DNWPDLPLYR SSRAILLYPL ATRPIAVEFT RADGQLLSVR TLNLAQQWSA IHQDLLRLVD N
|
| |