Gene Acid345_0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0098 
Symbol 
ID4069473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp99032 
End bp102703 
Gene Length3672 bp 
Protein Length1223 aa 
Translation table11 
GC content60% 
IMG OID637982098 
Productprotease-like 
Protein accessionYP_589177 
Protein GI94967129 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.34808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTTG GAGTATCCCT GCCGAAAATC CGGCATAGTC TCGTCCCCTG CTTGCTCGCA 
ACTGTTCTCT CCATCGCCTC CTCCGCACAA ACCGCTCCAA ATCCATCCCG CATTCAATCT
CCAATTTTAG AGACCGATCG CATTACCTTA CATAGCAGTT CGCCGGCATG GACAAAGGCC
GCCCGCGACG CCGGAGCGGT GGATTCCAAC CTTTCGCTGG ATCGCATTCT GGTCTTGCTC
CAGCCCAGCG CCAAACAAAG CGATGCGTTG AAACTTTATC TCGACGATTT ACAGAACACT
GATTCGAAGA ACTATCACCA GTGGCTCACG CCTGAGCAGT ACGCGCAGAA ATTCGGCCCC
ACCGACGCCG ACATGCAGAA GGTCACCGAC TGGCTGCAGT CACACGGCTT CAAGATTTCT
GAGGTTGCGG CCGGCCGGCA GTGGATCGAG GTCGCCGGAA CTGCAGCCCA AGTCAGGACC
GCATTCAATA CCGAGATTCA CAAGTACCAC CTCAACGGTA AGGATTACTT CGCGAATGCG
TCAGATGTCT CGCTGCCGCG CGCGCTGTCG TCAGTGGTGC GCGGCGTGCT CTCGCTGAAC
AACTTCCAGA AGCGTTCCTT CACTTCGCAC GGCGTCCTGG TGTCGAGAGG CACGAGCGGG
AAATTGCAGC CTGTCGCGGG AGTCGTACCT GACAGCAAAG GCAAGCTGAC GGTAGATCCC
TCCTACACCC TCACAACCGG GAATGGCTCA TTCCATTTCC TCGCACCCGG AGACTTGCAG
AAGATTTACG ACGAAACGCC TCTGCTCACG GACGGCAACA ACGGTCGCGG AGTGTCGATT
GCAATCGTGG GACGAACCAA CATCGAATTG TCCGACGTAC ATTCCTTCCG CCAGATCTTC
GGGCTTTCGG AAAACGATCC TGAAGTCCTC ATCAACGGTA CGGACCCGGG AATCACCCCG
GACGAACTCG AGGCCGACCT CGACCTGGAG TGGGCAGGCG CCTCGGCGCC GGGTGCAAAA
CTGAAGTTCG TCACTTCCAG CTCGACGGCC TCTACAGACG GCGTGGATCT CTCGGCGGCA
TACATCGTCG ACCACCGTAT CGCGCCCATC ATGAGTACTA GCTATGGACA GTGCGAAGCG
TTTCTCGGTC CAAGCGGCAA TGCGTTTTAT TCGTCGTTGT GGCAGCAAGC CGCTGCTGAG
GGCATTACTG CCTTCGTGAG CTCGGGCGAT AACGGTGCTG CCGGCTGCGA TCCTGCCGCG
TACTTCTTGC CCGAACAATA CGGCAAAATG GTGAGCGGGC TCGCTTCCAC CCCATACAAC
GTTGCTGTGG GAGGCACCGA GCTCAACGAG AACGGAAACG ACTCCACCTA CTGGTCGGCG
AACAATGCAG CCGACCAAGC CTCTGTCCTG GGATACATTC CCGAAGTTAC CTGGAACGAG
ACCTGCGATC CGCGCACCAG CACTTCCTGT TCGCAGTACA TCAATTACTC GAGCAGCGGC
GGTCCCAGCA ACTGTTCGGA TGTAACCCAG AACGGCAGCC GGTTCAACTT CACCTGCAAC
GCCGGTTACC CGAAACCTGC CTGGCAAACC GGCGTCGGCG TTCCAAACGA CGGCGTCCGT
GATCTGCCTG ATGTCTCGCT GGCAGCCGCC GGTGGACACG ATGGCTACCT GCTGTGCGTC
GAAGGCAGTT GCCAGACCAC GACTGTGAAT GGAAAGATCG TCCTGACGAA CGCCGTCGTC
GTTGGTGGGA CCTCCGCCGC TTCTCCCTCC ATGGCCGGCA TCCTCGCCCT TGTGGAACAA
AAGAACGGAC AATACCAGGG CCAGGCAAAC TACACCTTCT ACCAACTCGC TGCCGCCGAG
CAAGCCGCGA ACTGCAACGC GTCGCAGAGG GTGGATCCCA CGCACACCAG CCAATGCATC
TTCAATGACG TCACCTCCGG CAACAACGGC GTGCCCAACC TCACCGGGTT CAACGCCGGC
ACCGGGTACG ACTTGACCAC CGGACTCGGC TCGGTCAGTG CCGCAAACCT GGCCGCGAAT
TGGAACACCG GCAGGAAGCA CCTCACCGAA ACCTTCCTTT GGGCTTCACG CTTCAAAGCG
CAGCACGGAC AACCCATCGA TCTCAACATT CGCGTTCATG CGTCGCGAGC GTCGAGCGCG
CCCACGGGAG CTGTTGCACT CGAGGCAGGT TCGAACCGCT ACCCGACGAG CGTGCCTCTG
ACGCACGGTG CATTCTCCGG TCCGGTCGCA AGTCTGCCGG CTGGCCACTA CCTGTTGACC
GCGCACTATG GCGGCGACGG CACCTATAGC CAGAGCACCT CCAACCCCAT TCCGATCGAT
ATCACGCCGG AGGACAGCAA GATCACAGTC ATTCCTTACA ACGTGAACCT CGTGGGCCAG
TACCTGCCGA CGACGGGACC GATCACTTTC GGGACGGAAG CCGCGCTCCA GATCAACGTT
CAGGGCCTCT CCGGGCAAGG ACAAGCGACA GGATCGGTGA CCATCACGGT CGACGGCAAG
AACGCGGGCA CTGCGACCAT CACCGCGGGC AATGTCTTCG TGACCCTGGA TTCCCTGGTT
TCACAAACGT TGTCCGTAGG CACGCACTCG TTCGGCGCGA CCTACTCGGG CGACACCAGT
TTCCACGCCT CGTCCTCTCC GCATGCCGCA TCGATATCGG TGGCGCGAGG CTACGTCGGT
CTTACGCGCA TCAGCTCCGA TCTCCAAACC GTTGCCGTTG GAGCACCGCT GACGTTCTAC
ATCAGCGTGC TCGCCCCCGG ATCCTCTCGC CCGTCGGGTA CGGTTCAGGT CTATGACAAT
GGCGCAGCCA TCAGTGGTCC GATCGCACTC GCAACAAACG TTCCATCAGA CGCGGTGCAG
GCAGAATTTG TGCACGCCTT CACCACGACC GGAACTCACA TCATTCGTTT GAGCTACTCG
GGCGACAAAA ACTTCTTCCC GGTCGCGCCC GACGAATTCC GCTCTTCGCA GTTCTTCCTC
ACCGTGAATT CCGCGAAGGG AGCGGCCACA GTGACGCAGA TCTCGCAGTC GAACCCAACT
CTCACGGTCG GCGGCACAGA CACCTTCACG GTTTCGGTGG CGCCTCAGAA GTCGGGAGGT
GCAGCGCTCA CGGGCACAGT CACGCTCGTC AGCATGTACA ACTCGATCAT CGCTGGACCG
GTGGCGCTCA CCAACGGTAA AGCAAGCTTC GTGGTGCCCT GGTCGAAGTT CCTGAACGTT
GGAACCAGTG AATTGCTCGC GTCGTACTCG GGCGATGCAA ATTACGCGCC CAGTGCCAGC
GGTAATATCG AGACCACCGT AAACCCTGCA ACCCCCGCTA TAACACTGTC CGCGGATGCC
TCTGAGGTGC GCGCGGGCGC CACCAGCGAG TTGGCCGTCA TTGTGAAACC GACGCTGAGC
GGTGATTCCA GCATCGTTCT GCCGTTCGGT AAAGTGCAGT TCTACGATGC CGTCAACGGG
CGGGCTCCGC AGCCACTTGG GCCTGCATAT GGCCTCACCC AAGGCAACGG CAACTTCACC
ACGTTCCTGT TCGCAACCCA GTTGCCGGCA GGGCACAACG TGATCACGGC ACGCTATCTC
GGAAATGGCG AGTGGGGCCC TGCGGCCTCG AATCCGGTGG TCGTCCTGGT AGGACGCGCA
CACCGCGACT AA
 
Protein sequence
MRLGVSLPKI RHSLVPCLLA TVLSIASSAQ TAPNPSRIQS PILETDRITL HSSSPAWTKA 
ARDAGAVDSN LSLDRILVLL QPSAKQSDAL KLYLDDLQNT DSKNYHQWLT PEQYAQKFGP
TDADMQKVTD WLQSHGFKIS EVAAGRQWIE VAGTAAQVRT AFNTEIHKYH LNGKDYFANA
SDVSLPRALS SVVRGVLSLN NFQKRSFTSH GVLVSRGTSG KLQPVAGVVP DSKGKLTVDP
SYTLTTGNGS FHFLAPGDLQ KIYDETPLLT DGNNGRGVSI AIVGRTNIEL SDVHSFRQIF
GLSENDPEVL INGTDPGITP DELEADLDLE WAGASAPGAK LKFVTSSSTA STDGVDLSAA
YIVDHRIAPI MSTSYGQCEA FLGPSGNAFY SSLWQQAAAE GITAFVSSGD NGAAGCDPAA
YFLPEQYGKM VSGLASTPYN VAVGGTELNE NGNDSTYWSA NNAADQASVL GYIPEVTWNE
TCDPRTSTSC SQYINYSSSG GPSNCSDVTQ NGSRFNFTCN AGYPKPAWQT GVGVPNDGVR
DLPDVSLAAA GGHDGYLLCV EGSCQTTTVN GKIVLTNAVV VGGTSAASPS MAGILALVEQ
KNGQYQGQAN YTFYQLAAAE QAANCNASQR VDPTHTSQCI FNDVTSGNNG VPNLTGFNAG
TGYDLTTGLG SVSAANLAAN WNTGRKHLTE TFLWASRFKA QHGQPIDLNI RVHASRASSA
PTGAVALEAG SNRYPTSVPL THGAFSGPVA SLPAGHYLLT AHYGGDGTYS QSTSNPIPID
ITPEDSKITV IPYNVNLVGQ YLPTTGPITF GTEAALQINV QGLSGQGQAT GSVTITVDGK
NAGTATITAG NVFVTLDSLV SQTLSVGTHS FGATYSGDTS FHASSSPHAA SISVARGYVG
LTRISSDLQT VAVGAPLTFY ISVLAPGSSR PSGTVQVYDN GAAISGPIAL ATNVPSDAVQ
AEFVHAFTTT GTHIIRLSYS GDKNFFPVAP DEFRSSQFFL TVNSAKGAAT VTQISQSNPT
LTVGGTDTFT VSVAPQKSGG AALTGTVTLV SMYNSIIAGP VALTNGKASF VVPWSKFLNV
GTSELLASYS GDANYAPSAS GNIETTVNPA TPAITLSADA SEVRAGATSE LAVIVKPTLS
GDSSIVLPFG KVQFYDAVNG RAPQPLGPAY GLTQGNGNFT TFLFATQLPA GHNVITARYL
GNGEWGPAAS NPVVVLVGRA HRD