Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0549 |
Symbol | |
ID | 4070007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 677608 |
End bp | 679524 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982554 |
Product | lytic transglycosylase, catalytic |
Protein accession | YP_589628 |
Protein GI | 94967580 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.357331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.13799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATTC GTAGCTTCTG GTCGCTGTGC CTGCTGGGCG CAATGATCAT CACCACGGCG TGCGAGACCG ACAACCCGAA GAAGGCTGCT ACCACTCCAC CTCCGCAGGC GACGGCACCT ACACTGGCGA CACCGCAGAG CGATGCGGCA CCGGCAACTG CGCCGGTAGC GACTGCCGCG CCTGCGGGAG ATTCAGTCAC AGAGCTGATC GCGAAGGCCG AGAAGGAGTT CAAGGCCGGG CAGTCGAATT ACGCTGCCGG GCATCTTGAA GCCGCGAAGG ACAATTTCGA CCAGGCCTTC AAGACGATGT TGTCGAGCAA TCTCGATGTG CATTCGGATG AGCGGCTCGA GAACGAGTTC GACAAGATTG TGGAGTCCGT CCACGAGCTT GAGTTGCAGG CGCTGAAAGA GGGTGACGGC TTCACCGAAC AGAAGTCGGA ACCGGCCCCG ATTGACGAGG CCAACGAAGT CACCTTCCCG GTTGATCCCA ACATCAAGGC GCAGGCATTG GCGGAGATCC GGCAGACGCG TTCCGATTTG CCGCTGGTAA TGAACGACCA GATCGCGGGT TACATCTCGT ACTTCTCTTC GCGCGCGAAG GGCACGCTGG CGAATGGGAT GGCGCGCTCG GGAATGTATC GCGAGATGAT CCAGCGCGTG TTGAAAGAAG AAGGCGTGCC GCAGGATTTG ATCTATCTCG CGCAGGCGGA GTCGGGGTTC CGGCCGCTGG CGCTCTCACG CGCGGGAGCT CGCGGAATGT GGCAGTTCAT GGCTTCACGC GGGGGCCAAT ACGGACTGGA TCGCAACTGG TGGGTGGATG ATCGCCAGGA TCCGGAAAAA GCCACGCGCG CAGCGGCACG GCATTTGAAA GATCTCTACC ACATGTTCGG CGACTGGTAT CTCGCGATGG CTGCGTATAA CAGCGGTCCG CTGACGGTAC AACGCGCGGT ACAACGCACG GGCTACGCGG ACTTCTGGGA GTTGTATAAG CGCGGCGTGC TGCCGGGTGA GACCAAGAAC TATGTGCCGA TCATCATCGC GATCACGATC ATGTCGAAGA ACCCGGCGCA GTATGGGCTC ACCGAGGTGC AGTTCGATGC GGCGATCCAG GGCGATTCGG TGACGATTGA TTATCCGGTG GATTTGCGAC TGGTCGCGGA ATGCGTGGAC GGTTCGGTGA GCACGTTGCA GGAACTGAAT CCGAGCCTGT TGCGCATGAC CACGCCGAAA GACCAATCGT TCACACTGCA CGTGCCGACG GGGACGAAAG ACAAGTTCGA GCAGAACATC GCAGCGATCC CGCTGGAGAA GCGCGTGCTG TGGCGCTTCC ATCGCGTGCA GCCCGGAGAC ACGCTGGCGT CGATCGCGCA CAAGTATCAC GTGAGCAGCG ATGCCATCGC GGAAGCGAAC GATCTTCCTG ATGAAGAAGT ACGCACCGAT GCCAAGCTGA TTATTCCGGT GGCATCGAGC AAAGGTTCGC AGAGCGCGTC GAATGAAGGC GGCGGATATT CGAAGAAGCC GGTGTCGTAC AAAGTGCGTA AGGGCGACAC GATTGCTTCG ATCGCGGATG ACTTCGGAGT TCCTGCCGAC AAGATTCGAC GGTGGAACCA CATCAGCGGC GACTCGGTGA AACAAGGTCG CGTGTTGCAC ATCTACAAGC CGACAGGCGA TGAAGAAGAG GCGAGTTCGA CGCGTTCACG CTCGTCGTCA AAGAAAGCTG CGCCGGAGTT GTCGAGCAAA TCACAGGCGA AATCGAGTGA CGAAAAAAAG CAGAGCGCGA TGCATCACAC CGTGAAACGT GGTGAGACTC TGAGCAGCAT CGCGAACCAA TACAACACCA CGGTTGCGGA GTTGAAGAAG TACAATCCCA ACACTTCGAA GCTGCGCCCG GGCGATGTAC TAGTGATCAG ACGATAA
|
Protein sequence | MRIRSFWSLC LLGAMIITTA CETDNPKKAA TTPPPQATAP TLATPQSDAA PATAPVATAA PAGDSVTELI AKAEKEFKAG QSNYAAGHLE AAKDNFDQAF KTMLSSNLDV HSDERLENEF DKIVESVHEL ELQALKEGDG FTEQKSEPAP IDEANEVTFP VDPNIKAQAL AEIRQTRSDL PLVMNDQIAG YISYFSSRAK GTLANGMARS GMYREMIQRV LKEEGVPQDL IYLAQAESGF RPLALSRAGA RGMWQFMASR GGQYGLDRNW WVDDRQDPEK ATRAAARHLK DLYHMFGDWY LAMAAYNSGP LTVQRAVQRT GYADFWELYK RGVLPGETKN YVPIIIAITI MSKNPAQYGL TEVQFDAAIQ GDSVTIDYPV DLRLVAECVD GSVSTLQELN PSLLRMTTPK DQSFTLHVPT GTKDKFEQNI AAIPLEKRVL WRFHRVQPGD TLASIAHKYH VSSDAIAEAN DLPDEEVRTD AKLIIPVASS KGSQSASNEG GGYSKKPVSY KVRKGDTIAS IADDFGVPAD KIRRWNHISG DSVKQGRVLH IYKPTGDEEE ASSTRSRSSS KKAAPELSSK SQAKSSDEKK QSAMHHTVKR GETLSSIANQ YNTTVAELKK YNPNTSKLRP GDVLVIRR
|
| |