Gene Acid345_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0450 
Symbol 
ID4071697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp534074 
End bp537697 
Gene Length3624 bp 
Protein Length1207 aa 
Translation table11 
GC content59% 
IMG OID637982454 
Productprotease 
Protein accessionYP_589529 
Protein GI94967481 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.470359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATTC GAAATCGGCT CTTAAGCGCG ACAACACGCG TTGCGCTCTT GCTGTGTGCA 
TGCTCCATGG TTTTCGCCCA AGCGCCCTTA ATTCAAAATC GCGTCACAGC CCCTATCGAA
AACAGTAAGA CAATCAAGAT TCGTCAGACA GTCTCGCCGT TGGTCGCGAA GAGTGCAGAC
AAAGGCCGAC TCGCTGGCGA TCGTAATCTC GGGCAGATGC TGCTCATGCT GTCCCCGACG
AAGGAACAGA ACACCGCGCT GGAAGCCGTC ATTAAAGCGG AGCACACTCC GGGATCGGCC
AAGTACCATC ACTGGCTCAA GGCTTCGGAG ATTGCGACGA AGTATGGCGT CTCCGAACCG
GACACGACTG CGGTTCGGGG ATGGCTGGCG TCGCAGGGAT TCGAGGTGAA GCACGTCGCC
AACAGCCGAC GTTTTGTGGT TTTCAGCGGT ACGGTGGCGC AAGTGGAAGC GGCATTCCAC
ACCCAGATGC ACCAGTACGA GCTCTCAGGG AACTCGTTTA TTGCAAACTC CCAGGAAGTC
CAGATCCCTG CGGCGTTGGC GCCGGTGGTT CGTGGCGTCG TCCGCCTGAC CAGCACGCCG
AAAAACAACA ACGTGAAGAT CGTAGGCAAG GCCGCGTTCG ATAAAGAGAA AGGCCAGATC
ACCTTCACGA ACGGTGAGCA TGCAATCACG CCCGCGGATT TCGCGACGAT CTACAACCTC
AATCCGCTTT ATCAGGCAGG CATCAACGGC GCAGGGCAGT CGATTGCGAT CGTGGCGCGC
AGTGATATTT ATTCGCGCGA TGTCTTCGAC TTCATGAGCA TCTTTGGAGT TTCGTTTGGC
GGCTTCTACT ACACCATCAA CGGAGACGAT CCGGGCTATG TTTCGGGATC GGACGTTGAG
GCCACCCTCG ACCTCACCTG GGCGGCTGCA ATTGCACCGG GTGCAACCCC AAACATCGTG
ATTTCGCAGA GCAACTTCGC TGACGGCGTC GATATCTCCG CCGCATTCAT TGTGGACAAC
AATCTCGCGC CGGTAATGAG CACGAGCTTC AGCTCCTGCG AGCAGCAGAT GGGGCCGGTC
GGCACTGAGT TCTATTACTC GCTCTGGGCG CAGGCAGCCG CCGAGGGCAT TACCGCCGTG
GTCTCCAGTG ACGACAGTGG CGGCGCAGGT TGCGATCTTC CGGGAAGCGG TACCTTCGCG
CAGAACGGCC TCGCCGTGAA TGCGCTCGCG TCCACGCCGT TCAACGTCGC CGTCGGTGGT
ACGCAGTTCG ACGACACAGC GGATCCGAGC AAGTATTGGT CCTCCACGAA CGATTCCACG
ACCAAAGCGT CCGTGCTTTC CTACATCCCC GAAAAAGCCT GGAACGAGAG CAGCATTGAT
TCGGGCAACG TCAGCCTATG GGCGGGCGGT GGTGGTGTAA GCACGCTTTG GACGAAGCCT
GAATGGCAGA TCGGAACTGG CGTTCCCGCC GATGGAATGC GTGACTTGCC CGACGTCTCG
CTGACTGCGG CCGGTCACGA CGGATACGTA TTGTGCTTCG GTGGTTCTTG CGAGAGTGGA
GGCATTTATA CGGTGGGCGG AACATCGGCG TCGGCGCCGG CGTTTGCCGC GATCATGGCA
CTGGTCAACC AGCAGACCGG ATCTCCGCAA GGAAATCCGA ACTACGTGAT CTACCAACTC
GCGGCGCAGC ACCCGGAATT CTTCCACGAC ACCACAGTTG GAGATAACAA AGTTCCGGAT
ATGAACGGCG AGTTCACTGT CGGATATTCG ACCGGTGTGG GCTACGACCT TGCGACCGGC
TTGGGATCAT TCGATGCGAA CTCGCTGGTG ACGAACTGGA ACAACGTCAC CTTCAGCGGA
ACCAACACGA CGTTGAGTGG CCCGGCGGGC GGCTTGACCT TCGTGCATGG CGCAGGCGTG
CCGGTCACGG CGAGCGTCAG CGCAGCGTCA GGCAGCAAAC TCCCGACCGG CAATGTCGCA
TTCTTTACGG ACAACCCGCT CGGCCTCGCC ACTCCATTCG GCGTCGGTGC CGCTGCGCTG
GATAACACGG GCGACGCGAC TACCTCCCTC GCCGCGATTC CCGGCGGCAC GCACTCGCTT
ACAGCTCGGT ACGGAGGCGA CGCAACCTTC ACGGCCAGTA CCTCGAACGC GGTTACAGTG
ACGGTTACGC CCGAGCCTTC GAATACGTAT TTCGTGGCTG GCGTGGGTGG AAGCACCGTC
ACCTCGGCTG AAGCAAAGTA CGGCGACCCT CTGGTGATGG CCGTTCTTGT GCAAGGCAAT
TCGCTCGTCG GACACCCGAC TGGCTCCGTT TCGCTGAGTG AAGGCAGCAC CGACCTTGGC
ACGCGCTACC TTAATTATGG TGAACACGAG GATGCGGAGC AGGGATCGAG TTCGGTCTTC
GGAGTGATCG GTTTCCCAGT CGGCGTACAC CAATTGACCG CGAGCTACAC CGGCGATCCC
AGCTTCAATC CGAGTACGTC CACGAACTTC CAACTCACCA TCGTTAAAAG CGATTCAACC
ATCTCATCCC TGCAATTCCA GGGCTCGGCA CTCTCCGGTG CGCCACTCCC TGTCTTCGGA
CAGGTCTCCC TGGCGTCGGG CACTCTCATG CCGATCTCCG GTTCGGTAAC TTTCACAGCA
GCTTCCGACA AAACAACTGT GAACCTCGGG AGCTTAACCA TCGATGCGAC CAGCGGCACG
TTCGCAGGCA GGGTCAGCTT CCCATCTGCT GGAAGCTGGG TGTTGACTGC GGTTTATGGC
GGTGACAGTA ACGTAACCGG CACTCAAACC CAGACTCGCG TCGCCGTCGA CAGCAGCGAA
GCCACGACGA TGTCGCTGAG TTCAAATGCA CCCTCAGTTC CTGCCGGAGG TTCGGTGACA
TTCACCGCGC AGGTAAGTTC TCCCGTGGTG CTCCGGCTAC CGACCGGCAC GGTGACATTC
ATGGACGGCA CGGCTTCGCT CGGCACCGCA ACGTTGGATG GATACGGAAT CGGAAAATTC
ACGACCACCA GCCTGACCGG TGGATCACAC TCGATTACCG CGAACTACGG TGGCGATGCG
ATCTTCCGTG CTACCTCGGC AAGTGTCAGT CAGTCGATCA GTGACTTTGC GGTGCAGCCA
ACGACCGCGG CTGTGTCCAT CAAGGTCGGA CAATCCGGCA CTGCGTTGAT CGCACTCACT
CCGCAAGGTG GGTTTAATCA AGCCGTCACC TTTAGCTGTT CTGGTCTGCC TTCCGGCGCA
AGTTGCACGT TTGCACCAGC AACGCTAACG CCAACAGGCA CGGATGTTGC GACTGACACG
ATGACAATTG CGACCAGCGG AAGTGGCGCG GCTGCACATC GTGCTGAGAA CCGACGGATG
AATTGGCTCG CTAGTTCCGG CTTTGGTCTG GCTGGCGTGC TGTTGCTGGT ACCGATCTGC
AATCGCAAGC GGCGGGCGCG TCTCGTCGTT CTGGCGGGAC TCATGCTGAT GCTCGGACTG
TGGGGATGCG GTGGCAGTTC CTCCTCATCG CCCAAGCCGC CGCCTCCGAA CCCGATGGTC
GGAACTTACA GCGTAACGGT GACAGCGACC TCAGGCACTG GATCTGCGCA TGCGGCAGAT
CTGTCGGTCA CCATCACTCA GTAG
 
Protein sequence
MSIRNRLLSA TTRVALLLCA CSMVFAQAPL IQNRVTAPIE NSKTIKIRQT VSPLVAKSAD 
KGRLAGDRNL GQMLLMLSPT KEQNTALEAV IKAEHTPGSA KYHHWLKASE IATKYGVSEP
DTTAVRGWLA SQGFEVKHVA NSRRFVVFSG TVAQVEAAFH TQMHQYELSG NSFIANSQEV
QIPAALAPVV RGVVRLTSTP KNNNVKIVGK AAFDKEKGQI TFTNGEHAIT PADFATIYNL
NPLYQAGING AGQSIAIVAR SDIYSRDVFD FMSIFGVSFG GFYYTINGDD PGYVSGSDVE
ATLDLTWAAA IAPGATPNIV ISQSNFADGV DISAAFIVDN NLAPVMSTSF SSCEQQMGPV
GTEFYYSLWA QAAAEGITAV VSSDDSGGAG CDLPGSGTFA QNGLAVNALA STPFNVAVGG
TQFDDTADPS KYWSSTNDST TKASVLSYIP EKAWNESSID SGNVSLWAGG GGVSTLWTKP
EWQIGTGVPA DGMRDLPDVS LTAAGHDGYV LCFGGSCESG GIYTVGGTSA SAPAFAAIMA
LVNQQTGSPQ GNPNYVIYQL AAQHPEFFHD TTVGDNKVPD MNGEFTVGYS TGVGYDLATG
LGSFDANSLV TNWNNVTFSG TNTTLSGPAG GLTFVHGAGV PVTASVSAAS GSKLPTGNVA
FFTDNPLGLA TPFGVGAAAL DNTGDATTSL AAIPGGTHSL TARYGGDATF TASTSNAVTV
TVTPEPSNTY FVAGVGGSTV TSAEAKYGDP LVMAVLVQGN SLVGHPTGSV SLSEGSTDLG
TRYLNYGEHE DAEQGSSSVF GVIGFPVGVH QLTASYTGDP SFNPSTSTNF QLTIVKSDST
ISSLQFQGSA LSGAPLPVFG QVSLASGTLM PISGSVTFTA ASDKTTVNLG SLTIDATSGT
FAGRVSFPSA GSWVLTAVYG GDSNVTGTQT QTRVAVDSSE ATTMSLSSNA PSVPAGGSVT
FTAQVSSPVV LRLPTGTVTF MDGTASLGTA TLDGYGIGKF TTTSLTGGSH SITANYGGDA
IFRATSASVS QSISDFAVQP TTAAVSIKVG QSGTALIALT PQGGFNQAVT FSCSGLPSGA
SCTFAPATLT PTGTDVATDT MTIATSGSGA AAHRAENRRM NWLASSGFGL AGVLLLVPIC
NRKRRARLVV LAGLMLMLGL WGCGGSSSSS PKPPPPNPMV GTYSVTVTAT SGTGSAHAAD
LSVTITQ