Gene Acid345_2619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2619 
Symbol 
ID4072028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3088942 
End bp3090786 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content60% 
IMG OID637984636 
Productpeptidase M61 
Protein accessionYP_591694 
Protein GI94969646 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.84974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGCT ACGCCGCGGT TCGACTCGTA TGCAGCGTGT TGTGGTTCAT CGTTTATTGT 
TCGCGATTCT CGGCCGCTGC GGTCACGTGC GCTTCGCCGG AGCCGCTGCC GGGTAATCCG
CAATACGAGT ACTTCGTCTC GGTCGCCGAT CACGACCGCC ATCAACTGCA CGTGTCCATC
CGATACCGCG CCACGAAGCC GACGGTTTTC CAAATGCCGG TATGGAACGC GCTCTACCAG
GTACGTGACT TCGCGCAGTA CGCGACCGAA CTACAAGCAC ACGATGGCAA TGGAACCGCG
TTGAGCGTCG AGGCGGAGGG GAATTCCGCT TGGAGAGTTC CGCCGTCAAA CGGCTGCGCG
GTCATTGAAT ACAACCTGAA TGCGAATGTT CCCGGGCCGT TCAGCGCGCA GGCGAGCAGC
GATCACGTTT TTCTAAACTG GGCGCAGGTG TTGCTCTACG GAGATCGCAA CGCTCCGTTG
ATGCTGGCCG TTTCTGATCT TCCTGCGACA TGGTCGTTAC GCGACCTTGG TTTGTTCGAT
GAAACCGCGC ACCGGCTCGC GCGTCCCGTG AGTTACGACG CTCTCGTGGA CAGTCCGGTG
GAGATGTCGG CGAGCAAGAT CGCGGCCTTC GACGAAGATG GTGCCCAATA TCGGATTGTT
GTGGATGCGG ATGACGCTGA TTACAACCTC CCCGCCATCC AGGATGCGCT GCGTAAAGTC
GTCCACGCTT CGGTTGACTG GATGCACGAT CGTCCGTTCG ACCAATACAC GTTCCTGTAT
CACTTCCCAC GCGGGCCCGT CGGCGGCGGC ATGGAGCACA GCTACGGTAC CGCGATCTCC
GCGCCGGCGG ACCGCATGCA CGAAAACGCG CTTGCTCCCA TCAGTACCTC GGCGCACGAG
TTCTTTCATC TGTGGAACGT GAAGAGGATC CGGCCGCAAT CGCTCCAGCC CGTCGACTTT
CAGCATGAGC AGTACACACG CGCACTGTGG TTTGCCGAGG GCGTGACAAG TACCGCGTCG
GAACTGATGC TGGTGCGGGC GGGGCTGGAG AATGAACGCG GGTATCTGTC GCATCTCTCG
GCAGTGATTA GCGACTTTGA GGCTCGTCCC GCGCACAAAT TCCAGTCGCC TGAGACCTCG
AGCCTGGAGG CATGGATGGA AGGCCACGCC TACTATCGGC GTCCGGAGCG CAGCGTTTCG
TACTACACCA GCGGAGAATT GTTAGGCGTT TTGCTTGATC TCGAGATGCG CAAGCGGACG
CGGGGAACAA AGTCGTTACG CGACCTGTTC ATTTATCTCA ACGCCGAATA CGCTAAAAAG
CACCGCTACT ACGACGATTC CAACGCGGTC CAGCAGGCCG CGGAAAAAGT CGCCGGTGGC
AGCTTCCAAT CCTTCTTCGA TAAGTACGTT CGCAGCACGG TGCCAATCCC GTACGATGAC
TATCTCCGTT TCGTCGGGCT GACGCTGCAG CCGTTTGCGA TCCTGGGCGT AGATGCTGGG
TTCGACGCAT CCGTGAACTT CACCGGCCTG CCGGAGGTGA CCAAGGTGAC GCCGGGAAGC
GCGGTGGAAG CCGCGGGCGT GCATGCCGGA GATACATTGA CGGCCATCGA TGAACACGAG
TACATGGGTG ATCTCTCGCA CTACCTCGTC GGCCACAAGG CGGGCGACAC GGTGACGTTT
CGATTCGCCT CGCGCACCCG AACCATGGCC GTAAAGGTGA CGCTGGCGGA ATCAAAGGGC
CCTGCGTTTT CGGTCGTCGA AGAGCCGGCC GCCAGCGTGG AACAGCGCGC TCAACGCGCG
GCGTGGATCC GTGGCGACGA CATGGAGAGC GGAGCACACA AATGA
 
Protein sequence
MRRYAAVRLV CSVLWFIVYC SRFSAAAVTC ASPEPLPGNP QYEYFVSVAD HDRHQLHVSI 
RYRATKPTVF QMPVWNALYQ VRDFAQYATE LQAHDGNGTA LSVEAEGNSA WRVPPSNGCA
VIEYNLNANV PGPFSAQASS DHVFLNWAQV LLYGDRNAPL MLAVSDLPAT WSLRDLGLFD
ETAHRLARPV SYDALVDSPV EMSASKIAAF DEDGAQYRIV VDADDADYNL PAIQDALRKV
VHASVDWMHD RPFDQYTFLY HFPRGPVGGG MEHSYGTAIS APADRMHENA LAPISTSAHE
FFHLWNVKRI RPQSLQPVDF QHEQYTRALW FAEGVTSTAS ELMLVRAGLE NERGYLSHLS
AVISDFEARP AHKFQSPETS SLEAWMEGHA YYRRPERSVS YYTSGELLGV LLDLEMRKRT
RGTKSLRDLF IYLNAEYAKK HRYYDDSNAV QQAAEKVAGG SFQSFFDKYV RSTVPIPYDD
YLRFVGLTLQ PFAILGVDAG FDASVNFTGL PEVTKVTPGS AVEAAGVHAG DTLTAIDEHE
YMGDLSHYLV GHKAGDTVTF RFASRTRTMA VKVTLAESKG PAFSVVEEPA ASVEQRAQRA
AWIRGDDMES GAHK