Gene Acid345_4495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4495 
Symbol 
ID4070173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5334938 
End bp5336494 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content61% 
IMG OID637986534 
ProductD-alanyl-D-alanine carboxypeptidase 
Protein accessionYP_593569 
Protein GI94971521 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2027] D-alanyl-D-alanine carboxypeptidase (penicillin-binding protein 4) 
TIGRFAM ID[TIGR00666] D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.9123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCT CGCGGACCAT TCCGGCTCTC TTCTTCCTCG TTCTCATCGC GCTGTACGCG 
CCTGCGGCCG ACAAACTCTC GTCGAAGATC GACAGAGTCC TCTCCGCGCC CGACGTCTCA
CGCGCATTCT GGGGCATCGA GATCGTCTCG CTCGACAACG GCAAAACGCT CTATTCGCGC
AACAGCGACA AGCTCTTTAC GCCCGCATCC AACACCAAGC TCTTCACCAC CTCGACAGCG
TTCGCGCTGC TCGGTCCCGA CTTCCGCTTC CATACCACGG TCGAGACCTC AGGCACAGTC
GACAAGCGTG GCCGCCTCGA TTCTGACCTC GTCATTGTGG GTCGCGGCGA TCCGAACCTC
TCCGGTCGTA CTCTGCCCTA CAACCTGCGC ACCGAACGTA AGCAACCGCC GATTGCTGCG
CTCGAGAACC TCGCCGACCA GCTCGTCCAA AAGGGCGTTC GCTACGTAGA TGGCGACATC
GTTGCCGACG ACTCCTACTA CGCCTTCGAG CGCTACGGTG AAGGGTGGGC GCAGGATGAC
CTTGTCTGGG AGTGGGGAGC GCCGGTATCA GCGCTGACCG TCAATGACAA CGTGATCTTC
GTCAGCATTC AGCCCGCCGA TCGCGTCGGC GAACGCGCCT TTGTGGACAT CACTCCGTTT
CCGGCGTACT ACCGCGTGGA CAACCGCGTT ATGACCACTC CACAGGGAAC CGGCCCGCGC
AAGATCTACA TCAACCGTGA GCCGGGTTCG AACCAGCTCA CGTTGTGGGG GAACATTCCG
GTCGATGACC AGGGCGCTAA CGAAGCGCTT GCCATCGAAG ACCCTGCCGA CTTCACCGCG
AAGCTCTTCC GCGAACTCCT CGACAAGCGC GGCGTCACCG TATACGGACG CCCGAAGACC
AAGCACACCG AGTTGGCATC GCTCTCCACG TTCAGCATCA CCGCGACGGC CTCAGCCGGA
GGCGGCGCCG ATCGCGCGCC CGCTCCGGTT TCACGCCTGC CTCTGGTGCT CGGACAGTAC
GATTCGCAGC CGCTCTCCGC CGATCTCAAA GTCATCAACA AGGTCAGCCA GAACCTTCAC
GCCGAATTAC TGCTGCGTTT GCTCGGCAAG GAGAAGGGAA CTGCCGGCAC GATCGAAGGC
GGCCTCGAAG TTGAGCGGGC CTTCCTCGCC TCCGCCGATA TTCGTCCCGA GGAATACACG
CTCTACGACG GAAGCGGTCT TTCACGGCAG GACCTCGCCA CGCCGCACGC GTTCGTGAAG
CTCCTGACCT ACGCGCACAA GCAACCATGG GGCGCGACCT TCGAGGATAC GCTGCCCGTC
GCAGGTGTGG ACGGTTCGCT CGTGGAACGC TTTACCAAGT CGACCGCGCA AAGCCGCGTA
CACGCCAAGA CCGGTTCTCT CGACCATGTG AATTCGTTGT CGGGCTATTT AACTACCGAG
AAGGGCGAGC ACGTGGTGTT CTCGATCCTG TCGAACAACC ACAACCTCAC CAACAAGCAT
GCCATTGAGA CGATCGACGC GATTGTGCAG GCGGTCGTGG ATAGCGGGAA GAAGTAG
 
Protein sequence
MSRSRTIPAL FFLVLIALYA PAADKLSSKI DRVLSAPDVS RAFWGIEIVS LDNGKTLYSR 
NSDKLFTPAS NTKLFTTSTA FALLGPDFRF HTTVETSGTV DKRGRLDSDL VIVGRGDPNL
SGRTLPYNLR TERKQPPIAA LENLADQLVQ KGVRYVDGDI VADDSYYAFE RYGEGWAQDD
LVWEWGAPVS ALTVNDNVIF VSIQPADRVG ERAFVDITPF PAYYRVDNRV MTTPQGTGPR
KIYINREPGS NQLTLWGNIP VDDQGANEAL AIEDPADFTA KLFRELLDKR GVTVYGRPKT
KHTELASLST FSITATASAG GGADRAPAPV SRLPLVLGQY DSQPLSADLK VINKVSQNLH
AELLLRLLGK EKGTAGTIEG GLEVERAFLA SADIRPEEYT LYDGSGLSRQ DLATPHAFVK
LLTYAHKQPW GATFEDTLPV AGVDGSLVER FTKSTAQSRV HAKTGSLDHV NSLSGYLTTE
KGEHVVFSIL SNNHNLTNKH AIETIDAIVQ AVVDSGKK