Gene Acid345_3141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3141 
Symbol 
ID4070256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3733333 
End bp3735351 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content60% 
IMG OID637985161 
Productphysarolisin II 
Protein accessionYP_592216 
Protein GI94970168 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC CGCCTCGCTT CCTGCCATGC CTGCTGCTTG CGACTGCAAC GGTCGCAAGC 
TTGCCGGCAC AAACCGCCAC CAAGACCGTT CTCCCCAACA ACGTTCCGAA ATTCACCGCA
TCGAGCGTCG ATCTCGGTCC GGCTGATCCC ACCCAGCAAA TCACGGTGAC GATGACGCTT
GCGTCCAAGA ACGCCAGCGG ACTCCAGCAG TTCGTCAGCG ACATCCGCAC TCCCGGTACT
GGTTCCTATC ACGAATTTCT AACGCCTGCG CTGTTCGCCA CCAAGTACGG CGCAGCCGAC
GCGACGCTTA CCGCCGTCAA GACCTTCGCA GCCGCGAATG GTCTCACCAT TACGCACACC
GCGCCCAACA AACTCGTGAT GTCGTTGCGC GGAACTGTCG CCGCCGTCGA AAACGCGTTC
TCGGTTCCGA TCCACAACTA CAAGAAGAAC GGCGAGACGC TGCGCGTGAA CGTGACCAAC
CCGCAGATCT CAACTTCGCT GGTTGGAAAG GTGACGGGCG TTCACATTGC CGACTTCAAC
TTCAAGTCGC ATGCCGTGAT GCCGCTCGAC CCGAACGGCA AAACGCAGAA GCCGGTTCCG
CTCTCTATCT CTCCGCACGG CCTCTTCTTC GCCAGCGGCT GCTTCCGTAA TCCGCAGACC
ATTACCGCCA GCGGTGGTGG TGCGACGGCC ACGTATGCCG GCAACCGATA TGGATCGGAC
ATCACCAGTG GACCTCCGAA TCTTCCTCCC TGCGGCTATG ACGTTGCCGA TGTGTATGCC
GGCTACAACC TGTGGCCGAT GTACAACGCT GGCCTCGATG GCACCGGCGA AACTATCGTC
ATTATTGACG CTTTCGGCTC ACCGACGATC CAGGCCGACG CCAATACCTT CTCGGCGATC
AACGGTCTTC CCGCCCTGAA CTCCACCAAC TTCCAGGTTG TCGGCGCCAA TGCCGGCGGC
AACGCCAGTT GGGCGGGTGA GACCACGCTC GACGTGGAAT GGGCGCACGC AATCGCTCCC
AACGCGAAGA TCGTCCTTGA GGTCGCGCCG ACCAACAGTT TCGTGGACCT CTTTTACGCC
GAAGTGGACG CCATTGCGAA TCACCGCGGT ATCGTGATCT CCAATAGCTG GGGCGGCTTT
GAAACTTTCA CCGATTCCTC GCTCCGCGGT GCGTTCGACT TCATCATGAT GGAAGCGATC
TCCGTGGGTA TCGACGTTAA CTTCTCCACC GGCGACTACG GCGATAACGT ATCCGTGCTT
GGCTACGCCG ACGTGCAGTA CCCGGGCAGC TCACCATTCG CGACGGCCGT AGGCGGCACC
AGCCTCGCGC TCACCAACAC CAAAACCAAG ACGATGAAGT TCCAAACCGG ATGGGGCAAC
AACATCACCC GCCTGGTGGA CGGCACCACT GGGGCGCCGG ACGATCCGCC GCTTATGGAA
GGATTCATCT TCGGCGCCGG CGGCGGGAAC AGCAACGTCT ACACCAAGCC GAGCTGGCAG
GTGGGAACCA ACCAGCCTCG TCGCGCGCTG CCTGATATCG CATGGCTCGC CGATCCTTAC
ACCGGTGTCG AGATTATCCA GACCATCAGC GGTAGCCAGT ACATCGAGGT CATTGGCGGA
ACCAGCCTCG CTGCGCCGAT GTTCTCTGGT ATTTGGGCGA TCGCCAACCA GAAAGCAAAT
ACTACGATCG GTCTCGGCGA TGCGGCATCG CAGCTCTACA GCATGCCGTC CGGCTCGATC
AAAGACGTCG TGCCCTTTAA CACCGCGAAC AACGTGCGCG GCGTTCTGAC CGATGCTTAC
GGAACGTACG AAGAGAGTTC AACTACTCTT GCAGCTCCGC TCGCCTACAC CCGCGGCTTC
TACAGCGCGC TGTACCAGGG CGCGAGTTCG CACAGCTGGT ACGACCTGAC GTTCGGTACC
GACTCCACGC TCTTCACCAA GCAAGGGTGG GACAACGTAA CCGGCTGGGG CACTCCCAAC
GGCCTCAACT TCGTGACCGC CATCGCCAAC AAGAAATAG
 
Protein sequence
MKIPPRFLPC LLLATATVAS LPAQTATKTV LPNNVPKFTA SSVDLGPADP TQQITVTMTL 
ASKNASGLQQ FVSDIRTPGT GSYHEFLTPA LFATKYGAAD ATLTAVKTFA AANGLTITHT
APNKLVMSLR GTVAAVENAF SVPIHNYKKN GETLRVNVTN PQISTSLVGK VTGVHIADFN
FKSHAVMPLD PNGKTQKPVP LSISPHGLFF ASGCFRNPQT ITASGGGATA TYAGNRYGSD
ITSGPPNLPP CGYDVADVYA GYNLWPMYNA GLDGTGETIV IIDAFGSPTI QADANTFSAI
NGLPALNSTN FQVVGANAGG NASWAGETTL DVEWAHAIAP NAKIVLEVAP TNSFVDLFYA
EVDAIANHRG IVISNSWGGF ETFTDSSLRG AFDFIMMEAI SVGIDVNFST GDYGDNVSVL
GYADVQYPGS SPFATAVGGT SLALTNTKTK TMKFQTGWGN NITRLVDGTT GAPDDPPLME
GFIFGAGGGN SNVYTKPSWQ VGTNQPRRAL PDIAWLADPY TGVEIIQTIS GSQYIEVIGG
TSLAAPMFSG IWAIANQKAN TTIGLGDAAS QLYSMPSGSI KDVVPFNTAN NVRGVLTDAY
GTYEESSTTL AAPLAYTRGF YSALYQGASS HSWYDLTFGT DSTLFTKQGW DNVTGWGTPN
GLNFVTAIAN KK