Gene Acid345_4516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4516 
Symbol 
ID4070194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5357140 
End bp5359194 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content57% 
IMG OID637986555 
ProductPgPepO oligopeptidase 
Protein accessionYP_593590 
Protein GI94971542 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3590] Predicted metalloendopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.861511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCC CTAAGATCGT TGGCATTTTT CTTCTGTGCG GCGCGGCTGC CTTTGCCCAA 
ACGTCCACAG ATTCAGCTAA GCCACCTGCG CCCGAAAAAA TCTTGAGCTT CGATGTCGAG
GCAATCGACA CGAGCGTTAA CCCTTGCGAG AACTTCTATC AGTTCGCCTG TGGCAACTGG
AAAAAGACCA ACCCCATTCC CGGCGACCAG ACCCGATGGG GACAGTTCAA CAAGCTCGCC
GAGAACAATC GGCTGGTTCT TTATGAATTG CTGACCAGCG CCGCGAAGCC CGGAAAGCAC
AACCCCATCG AACAAAAAGT CGGCGATTAC TTCGCGGCTT GTATGGACAC TAAGACCATC
GAAGCCCGCG GAGCGGAGCC GCTAAAGGCA CAGCTCGACG CCATCTCCAA GATCTCGAAC
CGCACCCAAC TCATGGAAGC AGTCGCGAAC CTGCAGCAGA ACGGCGTCCG AACTTTGATG
GCCTTCTACG CCTCGGCCGA TATGCACGAC GCCACCACGC AGGTTGCCAA CATCGACCAG
GGTGGCATCA CTCTTCCCGA CCGCGATAAC TACCTGAAGG ATGACGCTGC CAATGTCGAA
ATGCGCGCCA AGTATCTCGA ACACGTACAG AAGATGTTCG AACTTCTCGG CGACAGCTCC
GACACTGCGA AGAAAGAAGC GCAGACGGTC ATGACGATCG AGACTGCGCT GGCTAAGGCC
TCGATGGACC GCACCTTGCG CCGCGATCCC AAGAACCGCG ACCACATGAC CGAAGTTTCC
GTAGCTGAGA AGGACGCTGC GAACCTGGAA CTCGCCAAGT TCTTCGCAAC CACAAAGGCC
CCTGAGTTCA GCAAGGTCAA CGTCGGCAAT CCGGACTTCT TCAAGCAGAT CAACGACCTC
GTCGCTGGCA CTCCGGTTGA CGACATGAAG GTCTACCTCC GCTGGAAGGC GCTGCACGAT
GGCGCGTCTG CGCTCTCCGA TAAGTTCGTG AATGAGGATT TCAACTTCTT CAACGCCTAT
TTGCGCGGCC AGAAAGAAAT CGCGCCGCGC TGGAAGCGTT GCGTGGAATA CACCGACGGT
TCGCTCGGCG AGGCCCTTGG CCAGCTCTAT GTCGAGAAGG TCTTCGGCAA AGAGCAGAAG
GAGCGCACCC AGAAGATGGT GAAGGCGATC GAAGAAGCCA TGAACGACGA CCTCAAGTCG
CTCGAATGGA TGACGCCCGA AACCAAGAAG GCTGCCTACA CCAAGCTCGA ATCCATCGTG
AATAACATTG GCTATCCCGA GAAGTGGCGC GATTACAGCT CGGTGAAGGT CACGCGCGAC
GACTTCTTCG GTAACTCCCA GCGCGCCGAT TATTTTGAAG TCCACCGCAA CTGGAACAAG
ATCGGCAAGC CCACCGACAA GAAAGAATGG GGAATGACCC CTCCGACGGT GAACGCCTAC
TACAATCCAT CGCGCAACGA CATCAACTTC CCAGCCGGCA TCTTGCAGTC GCCGTTTTAC
GCCGGCGGCG CGGATGACGC CGTGAACTTG GGCGGCATCG GCGTGGTGAT CGGACACGAA
CTTACGCACG GCTTCGACGA CCAGGGCCGC AAGTTCGATG CGCAGGGCAA TCTTCGCGAT
TGGTGGACCG CGGAAGACGG CAAGGCCTTC GAAGAGCGCG CCAAGTGCGT TTCCGATGAG
TACTCCAGCT TCGTCTCCGT GAAGGACGAC AAAGGAGAAG TTCATCTCAA CGGCAAACTC
ACACTCGGCG AGAATACCGC CGATAACGGT GGACTTCGCC TCGCCTACGC TGCCCTGATG
AAGCTGATCA ACAACGACGA TTCGAAAAAG GTTGACGGCT ACACGCCCTC GCAGCGCTTC
TTCATCTCGT TCGCGCAGGT CTGGTGCCAG AACGTAACGC CTCAGCAAGC GCGCCAATTG
GCTCTTGTCG ACCCACACTC TCCGGGTGAG TGGCGCGCCA ACGGCACTGT CCGCAACTTC
GAGGGCTTCT ACAAGGCCTT CGGCTGCAAA GAAGGCCAAC CGATGGTTCC CACTCAGGGC
TGCCGCGTTT GGTAA
 
Protein sequence
MQIPKIVGIF LLCGAAAFAQ TSTDSAKPPA PEKILSFDVE AIDTSVNPCE NFYQFACGNW 
KKTNPIPGDQ TRWGQFNKLA ENNRLVLYEL LTSAAKPGKH NPIEQKVGDY FAACMDTKTI
EARGAEPLKA QLDAISKISN RTQLMEAVAN LQQNGVRTLM AFYASADMHD ATTQVANIDQ
GGITLPDRDN YLKDDAANVE MRAKYLEHVQ KMFELLGDSS DTAKKEAQTV MTIETALAKA
SMDRTLRRDP KNRDHMTEVS VAEKDAANLE LAKFFATTKA PEFSKVNVGN PDFFKQINDL
VAGTPVDDMK VYLRWKALHD GASALSDKFV NEDFNFFNAY LRGQKEIAPR WKRCVEYTDG
SLGEALGQLY VEKVFGKEQK ERTQKMVKAI EEAMNDDLKS LEWMTPETKK AAYTKLESIV
NNIGYPEKWR DYSSVKVTRD DFFGNSQRAD YFEVHRNWNK IGKPTDKKEW GMTPPTVNAY
YNPSRNDINF PAGILQSPFY AGGADDAVNL GGIGVVIGHE LTHGFDDQGR KFDAQGNLRD
WWTAEDGKAF EERAKCVSDE YSSFVSVKDD KGEVHLNGKL TLGENTADNG GLRLAYAALM
KLINNDDSKK VDGYTPSQRF FISFAQVWCQ NVTPQQARQL ALVDPHSPGE WRANGTVRNF
EGFYKAFGCK EGQPMVPTQG CRVW