Gene Acid345_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0549 
Symbol 
ID4070007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp677608 
End bp679524 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content59% 
IMG OID637982554 
Productlytic transglycosylase, catalytic 
Protein accessionYP_589628 
Protein GI94967580 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0741] Soluble lytic murein transglycosylase and related regulatory proteins (some contain LysM/invasin domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.357331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTC GTAGCTTCTG GTCGCTGTGC CTGCTGGGCG CAATGATCAT CACCACGGCG 
TGCGAGACCG ACAACCCGAA GAAGGCTGCT ACCACTCCAC CTCCGCAGGC GACGGCACCT
ACACTGGCGA CACCGCAGAG CGATGCGGCA CCGGCAACTG CGCCGGTAGC GACTGCCGCG
CCTGCGGGAG ATTCAGTCAC AGAGCTGATC GCGAAGGCCG AGAAGGAGTT CAAGGCCGGG
CAGTCGAATT ACGCTGCCGG GCATCTTGAA GCCGCGAAGG ACAATTTCGA CCAGGCCTTC
AAGACGATGT TGTCGAGCAA TCTCGATGTG CATTCGGATG AGCGGCTCGA GAACGAGTTC
GACAAGATTG TGGAGTCCGT CCACGAGCTT GAGTTGCAGG CGCTGAAAGA GGGTGACGGC
TTCACCGAAC AGAAGTCGGA ACCGGCCCCG ATTGACGAGG CCAACGAAGT CACCTTCCCG
GTTGATCCCA ACATCAAGGC GCAGGCATTG GCGGAGATCC GGCAGACGCG TTCCGATTTG
CCGCTGGTAA TGAACGACCA GATCGCGGGT TACATCTCGT ACTTCTCTTC GCGCGCGAAG
GGCACGCTGG CGAATGGGAT GGCGCGCTCG GGAATGTATC GCGAGATGAT CCAGCGCGTG
TTGAAAGAAG AAGGCGTGCC GCAGGATTTG ATCTATCTCG CGCAGGCGGA GTCGGGGTTC
CGGCCGCTGG CGCTCTCACG CGCGGGAGCT CGCGGAATGT GGCAGTTCAT GGCTTCACGC
GGGGGCCAAT ACGGACTGGA TCGCAACTGG TGGGTGGATG ATCGCCAGGA TCCGGAAAAA
GCCACGCGCG CAGCGGCACG GCATTTGAAA GATCTCTACC ACATGTTCGG CGACTGGTAT
CTCGCGATGG CTGCGTATAA CAGCGGTCCG CTGACGGTAC AACGCGCGGT ACAACGCACG
GGCTACGCGG ACTTCTGGGA GTTGTATAAG CGCGGCGTGC TGCCGGGTGA GACCAAGAAC
TATGTGCCGA TCATCATCGC GATCACGATC ATGTCGAAGA ACCCGGCGCA GTATGGGCTC
ACCGAGGTGC AGTTCGATGC GGCGATCCAG GGCGATTCGG TGACGATTGA TTATCCGGTG
GATTTGCGAC TGGTCGCGGA ATGCGTGGAC GGTTCGGTGA GCACGTTGCA GGAACTGAAT
CCGAGCCTGT TGCGCATGAC CACGCCGAAA GACCAATCGT TCACACTGCA CGTGCCGACG
GGGACGAAAG ACAAGTTCGA GCAGAACATC GCAGCGATCC CGCTGGAGAA GCGCGTGCTG
TGGCGCTTCC ATCGCGTGCA GCCCGGAGAC ACGCTGGCGT CGATCGCGCA CAAGTATCAC
GTGAGCAGCG ATGCCATCGC GGAAGCGAAC GATCTTCCTG ATGAAGAAGT ACGCACCGAT
GCCAAGCTGA TTATTCCGGT GGCATCGAGC AAAGGTTCGC AGAGCGCGTC GAATGAAGGC
GGCGGATATT CGAAGAAGCC GGTGTCGTAC AAAGTGCGTA AGGGCGACAC GATTGCTTCG
ATCGCGGATG ACTTCGGAGT TCCTGCCGAC AAGATTCGAC GGTGGAACCA CATCAGCGGC
GACTCGGTGA AACAAGGTCG CGTGTTGCAC ATCTACAAGC CGACAGGCGA TGAAGAAGAG
GCGAGTTCGA CGCGTTCACG CTCGTCGTCA AAGAAAGCTG CGCCGGAGTT GTCGAGCAAA
TCACAGGCGA AATCGAGTGA CGAAAAAAAG CAGAGCGCGA TGCATCACAC CGTGAAACGT
GGTGAGACTC TGAGCAGCAT CGCGAACCAA TACAACACCA CGGTTGCGGA GTTGAAGAAG
TACAATCCCA ACACTTCGAA GCTGCGCCCG GGCGATGTAC TAGTGATCAG ACGATAA
 
Protein sequence
MRIRSFWSLC LLGAMIITTA CETDNPKKAA TTPPPQATAP TLATPQSDAA PATAPVATAA 
PAGDSVTELI AKAEKEFKAG QSNYAAGHLE AAKDNFDQAF KTMLSSNLDV HSDERLENEF
DKIVESVHEL ELQALKEGDG FTEQKSEPAP IDEANEVTFP VDPNIKAQAL AEIRQTRSDL
PLVMNDQIAG YISYFSSRAK GTLANGMARS GMYREMIQRV LKEEGVPQDL IYLAQAESGF
RPLALSRAGA RGMWQFMASR GGQYGLDRNW WVDDRQDPEK ATRAAARHLK DLYHMFGDWY
LAMAAYNSGP LTVQRAVQRT GYADFWELYK RGVLPGETKN YVPIIIAITI MSKNPAQYGL
TEVQFDAAIQ GDSVTIDYPV DLRLVAECVD GSVSTLQELN PSLLRMTTPK DQSFTLHVPT
GTKDKFEQNI AAIPLEKRVL WRFHRVQPGD TLASIAHKYH VSSDAIAEAN DLPDEEVRTD
AKLIIPVASS KGSQSASNEG GGYSKKPVSY KVRKGDTIAS IADDFGVPAD KIRRWNHISG
DSVKQGRVLH IYKPTGDEEE ASSTRSRSSS KKAAPELSSK SQAKSSDEKK QSAMHHTVKR
GETLSSIANQ YNTTVAELKK YNPNTSKLRP GDVLVIRR