Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3757 |
Symbol | |
ID | 4069332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4439377 |
End bp | 4440687 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637985779 |
Product | restriction modification system S subunit |
Protein accession | YP_592831 |
Protein GI | 94970783 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.386531 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTGGC CTAAGTCCAC CGTTATGGAA CTTCAGCGGG ACGGGGTCCT GCTCGTCGAA GATGGTAACC ACGGCGAAAG TCGCCCCCGC CCCGATGAGT TTGTTAAGCG TGGGGTGGCT TTTATCCGAG CAGCCGACAT GGATGCAAGC GACGTACTGT TTGATACTGC CTCGCGCATC AATGACGTCG CTCGTAAACG AATTACGAAA GGAATTGGTG CACCTGGTGA CATTTTGTTG TCTCACAAGG GAACTGTCGG GAAGGTTGCA CTGGTTCCAG ATGACGCCCC CCCGTTTGTT TGTAGTCCTC AAACGACTTT TTGGCGGACA CTGAAGGGCG ATCGACTCGA TCGACGCTAT CTCCACGCAT ATTTGCGTTC ACCTTATTTC CATCAACAGC TTGCCAGTAG GGCAGGCGAG ACGGACATGG CTCCCTACGT TAGCCTTACG TCTCAGCGTG GCCTTCATGT GCTGATGCCA GACATTGATA TCCAAAGACG AATAGGGAGC ATCGTTGGTG CGCTTGATGC AAAGATTAGC GTGGAGCGGA AAATAAAGGG TACGCTGGCA GACATTGCGC GGGCTCTATT TCAATCGTGG TTCGTTGACT TCGATCCTGT GCGTGCCAAG AGTTTAGGGA GTAGCTCCAG CTTACCTGCG TCGTTGGAAT CGTTGTTTCC CGATACGTTC GAAGAGTCTG AACTCGGTCA GATTCCGAGT GGTTGGACCG TTGGGTCTCT GGATCAAATC GCACATTTCC TGAATGGGCT TGCTCTGCAA AGATTTCCCC CAAACGAGAA CGGCTCACTC CCGGTGATAA AGATCGCGCA GTTGAAGGCT GGAAACACCG AAGGCGCTGA TCTCGCGAGC CCTAATTTGG ATCCCGGGTA CATCGTTCAG GATGGCGACG TTTTGTTTTC TTGGTCTGGG TCGCTCGAAT GCGTAGTCTG GTCGGGCGGG AAAGGCGCAT TGAACCAACA TTTATTTAAG GTCACATCCA AAGATTATCC GAAGTGGTTT TTCTATCTTT GGATACACAG GCATCTAGAT GAGTTTCGAC GAATCGCCGC AGCTAAAGCG ACGACGATGG GCCACATACA GCGCTATCAT CTCTCTGAGG CAAAGATACT TCTGCCTCAC AAGAAATTGC TAGACGCCGC AGACCGTATA ATCGGGCCGC TCATTGAGTC TATCAACGTC CGCGCTGTCC AATCGAAAAT ACTAGGACGC ATTCGGGATT TGTTGCTGCC GAAGTTGATT TCGGGAGAAC TGGCGATTGA GGATGACGCA GAGTTTGGAG TCGTCAAATG A
|
Protein sequence | MNWPKSTVME LQRDGVLLVE DGNHGESRPR PDEFVKRGVA FIRAADMDAS DVLFDTASRI NDVARKRITK GIGAPGDILL SHKGTVGKVA LVPDDAPPFV CSPQTTFWRT LKGDRLDRRY LHAYLRSPYF HQQLASRAGE TDMAPYVSLT SQRGLHVLMP DIDIQRRIGS IVGALDAKIS VERKIKGTLA DIARALFQSW FVDFDPVRAK SLGSSSSLPA SLESLFPDTF EESELGQIPS GWTVGSLDQI AHFLNGLALQ RFPPNENGSL PVIKIAQLKA GNTEGADLAS PNLDPGYIVQ DGDVLFSWSG SLECVVWSGG KGALNQHLFK VTSKDYPKWF FYLWIHRHLD EFRRIAAAKA TTMGHIQRYH LSEAKILLPH KKLLDAADRI IGPLIESINV RAVQSKILGR IRDLLLPKLI SGELAIEDDA EFGVVK
|
| |