Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0105 |
Symbol | |
ID | 4069480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 109913 |
End bp | 114349 |
Gene Length | 4437 bp |
Protein Length | 1478 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982105 |
Product | protease-like |
Protein accession | YP_589184 |
Protein GI | 94967136 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.34808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGTT TCTTCGTTCG GGCCCGACTG CTCTCAATTT GCCTCGCACT GCTTTGGATT GCTTCCTACG CCGGCGCACA GGCTGTAAAA AATCCTTCTC GCATCACGCA AAAAATCGAC AACACGGCGC GTGTCACCTT AAGTGGAACG GTACCCAAAG CGGCGAAGAA CGCGCACGAT CTCGGTGAAG TGGACGGCGG CATGAAGTTG CAGCGCATGA TGCTCGTGCT GAAGCCCAGC GCAGAGCAAC AGGCCTCGTT GCAACGCCTG CTCGACAGCC AGCAGAACAA GAAATCTCCC AACCATCACA AGTGGCTGAC GCCCGAAAAA TTCGCGGCGA GTTACGGACC TTCCGAAGAG GATGTTGCGA AGGTGAAGAA CTGGCTCGAG TCGCAGGGAA TGTCGGTCAG CAAAATCGGC AACGGTCGCC AGTGGATTGA GTTCTCCGGA TCCGCGCACC AGGTGGGAAC TGCGTTCGGC ACATCGCTTC ACTATTTCGA AACCAATGGC GAGCGCCACA TGGCCAACGC CACCGAGATC TCGATACCGC GTGCGATCTC GCCGGTGGTG AGCGGCGTTC TCTCGCTGCA TAACTTCCAC AAGCAGCCGA ATACCTCGAA GCTGTCGAAA GTGAAGTTGG GCGACGATGG CAAGTTGGCG CCCGTCGATC CGGCGTGGAC CTATCGTGAC TACAACCTGA ATCCCTACTA CTACCTTGCA CCGACGGACG CGCAGAAGAT CTACAACGCT TCGCCGCTGT TGAACGATGG CGTGGACGGA ACCGGAATCT CGATTGCAGT GGCGGGCCGC AGCAACATTT ATCTCAGCGA CGTGCAGTTG TTCCGGAACG TTTTCGGCCT CAAACAGAAC GACCCCAATT TCATCGTCAA CGGGCCTGAT CCGGGATATC CGTTCGGCGA CCTGGTGGAG AACACGCTGG ACGTGGAATG GGCCAGCGCG ATGGCTCCGG GCGCGACGAT CAACTTCGTC GCCAGCGGCA GCACGGACAC CACGGACGGT GTTGATCTCT CAGCCGCATA CATTGTGGAC AACGCCGTAG CACCGATCGT CACCGTGAGT TACGGGCTTT GCGAAGCGCT CATGGGCCCG GCGAGCAACC AGTTCTACAA CTCGTTGTGG GCGCAAGCGG CGGCGCAAGG TATGACGGTG TTTGTATCTA CCGGCGACGT GGGCGCGGCG GAATGTGATG GCGATCTGCA GCGCGCCGGC TATGACCCGC CGGGGCCGGC GCAGTATGGC CCCACCATCA GCGGATTGTC TTCCACTCCT TACAACGTCG CCGTCGGTGG CACACAGTAC AACGAGGGCA GCAACTTCGG ACAGTACTGG TCCCCTAACA ACGACTCCAC GTTCGGCTCG GTACTCGGCT ACGTGCCGGA ACAGGCGTGG AACGAAAGCT GCGATCCGAA CCTTCCGCAG GAAGGAACGA ACTGCGTCTA TGGACAGACA AACTACAACC TCGATGGCGG TGGCGGTGGT CCCAGCAATT GCAGCCAATC GACGGTGGAC GATCAGGGGC TCATCACCTG CGTTGCGGGA ACTCCGAAGC CGTCGTGGCA GACCGGATTA GGTGTGCCGA ACGACGGTGT GCGCGATACG CCGGACCTCT CGCTGAACGC ATCGCCGGAT GACGATGGTT TCCTGTTCTG CCTGATCGGA GGCTGTCAGA CCACGACGGT CAACGGCCAG ACGATATTGA CCAACGCCAG CACGATTGGA GGCACATCGG CATCGACACC GGCGATGGCC GGCATCATGG CGCTGATCGA GCAGAAAAAC GGCGCCTTCC AGGGGCAGGC GAACTACATC TTCTACAAAC TCGCCGCGAT GGACGATCAA GCGGCTTGCG ATTCGTCGAA GCGGACGGAT CCAACAGCAA CGAGCACGTG CAACTTCAAC GACGTAACGA TGGGCAGCAA CAGCGTGCCC GGACTGCCCG GTTATGGCAC GGACAGCGCG GAGTGGAGCG CGACGACCGG CTATGACATG GCTACCGGCC TCGGCACGGT GAATGCAGCG AACATCGCAA CCAACTGGGC GAAGGTGACG TTTGCAGCCT CGTCCACAGC ACTCACGGTC GGCGGTGGCA CGGTCGCTCA CGGCGATCCG GTAACGGTAA ACATCGCGGT GACAGCGACG GACGGCGGCA GTTCTAAGCC GACGGGGCCG GTGTCGCTGG TCACGGACAA GTACGGTGCA GTTGGCCAGG TCACGCTCGA TGCGAACGGC AGCTACAGTG GTCCGGTAGC GAACCTTCCG GGCGGTAGCT ACAGCTTGAC CGCGCAGTAC GGCGGAGACG GCACCTTCGG CGCGAGCACA TCGGCGCCAG CTGCGGTGAC GGTCACGCCC GAAGACAGCA CGACGACGAT CATCGGTCTC TACACGGTTG ATCCCAATAC GAGCCGCGTG ATTCCGTACA CCGGTTCGGC GCAATGGGCC TACCCGCTGT GGATTAGTGT GAAGGTCGAC GGCAAGTCCG GCGAGGGACG GGCAACCGGC ACCGTCAACG TGCTTCGGAA CGGAACCGTC GTGATGAGCG CACCGCTCAA CAGCGATGGC TCCGCGTACA TCCAAACCGG CAACGGAATG TCGTACACGT TCCCTGCGGG CGATAGTGAT CTCTCGATTC AGTACTCCGG CGATAGCGGC TTCAACGCGA GTACGTCGGG GGTGACGAAA ATCTCGTTCA CGCCACAGAA GGTCTGGTCC ACCATCCAAA TCAGTTGGTG GCAGGTCCAG GCAGGACAGC CGGTGCTCCT CACCGCAGGT GTCCGTGCAT TCGGCACACC GGTACCGACT GGGACCATGA CCTTCTACGA CAACGGCAAG AAGCTGAGTG ATGCAATTCC CCTCGCGACC GACGGACCTT ACGGTCCAAC TGTTCCCGAG GTGACGTACA CGGCGAAACT CACCACGGTG GGAGACCACT GGATCACTGC GGAATACAGT GGCGATGCCA ACTACGCGGC CGTTGCTCAG GATGATCCGA CGTATTCTTG GGGCAGCAGC TACACGGTCA TCCCCGCTGC GGGTGAAACG ACGACGACCA CCGTCACTCA GTATCCGGCG GCAGTTTCGT TCGGACAATG GATCAACTTC CTCGTGAACG TGAAGCCGGC GAAAGCAGGC GGAGCGGCGC CGACCGGCGA GGTCGTGCTC ACTTCCAACG GCCAGGTGAA CGGGCAGGGC AACCTGGTCA ACGGACAAGT CACGATCTCG GTACAAGCTG GTGCTCAAAC TGCCGAAGTC TATGCGCAGT ACCAGGGCGA CAGCACGTAC GCCTCGTCTT CGAGCGGCGT TTTCAAAACG ACGATTGCGA AACTCGATTC GACCGTATCA TTGACGACCA CCGGCGCCTA CGTTCTGGCT GGACAACAGA CCAGCCTCAA CTTCGTGGTG CAGGGCTACT ACTACAACTC CACGTCGTGG TATCAACCGC AGGGCAGCGT GCAGTTCTTC GACGCGGTGA ACGGCGGAGC ACCCCAGGCC ATCACCGCAC AATTGGGGAT GACCGGGATG AATCCGTGGG CGAACAGTGG CTTGAGCCTT CGCGAGACTC TGCCCGCGGG CACGAACGTC ATCACCGCGC AGTACTCAGG CGACTCCTAC TTCAATCCGG CGACCACGGC TGCAGTAACC GTGGTGGTTT CGCCACCGGA CTTCACGGTC AGCTCCGATC CTTCGGCGTT GACCATTTCG GCCGGAGGCA CGACGTCAGC AATACTTTCA GTGGCTCCGA TCCTGGGATT CTCAGGAGCG GTGACGCTGA CTTGCGGCGA CGGCTTGCCC GCGGGAACAA CCTGCAGCTT CTCACCCGCC ACGCTCGACG CCAGCGGTGG ACAATCTACG ATGACCGTCA CGATGAAGGG CCCGTTCACC AACGCGGCTG CAAATCATGT TTCGGGATGG TGGATGCTTA CCGGCGGCTC CGGCGCTCTC GGGTTCTTCC TGCTCGGGAT TTCCGGAAAA CGCCGGAAGT ACCTCGCGGG CATGCTGGCT ACGATCGCGC TCTTCGGGCT GATGATGGCT TGCGGCGGCG ACAGTCATCC ACCGGCGGCA ACTACGTCGG TGATGCTGGA GTCTTCGCAA CCGAAGGTGG CAGCAGGTGC CAGCGTGACG TTTACAGCCG ATGTGAGCGG CGGCAATAAC GGTGCGACCG GCAGCGTCAC CTTCTATGAC GGAACCACCG CACTGGGCAA TGCCGTGGAT GTTTCCAACG GACAAGCGAC TCTGGCAGTA AACACGCTGA CCGTCGGAAC CCATGCGATC ACCGCCAAGT ACACCGGCGA TTCTTCGCAT GCGGCATCGG TTTCGCAGCC CATGTACCAG GCCATCACCG GCACCACGAC GTTGGCCGTC ACAGCAACCT CGGGTTCGAC GAGCCACGTG TTGAACCTCA ACCTGACCGT GCAATAA
|
Protein sequence | MNRFFVRARL LSICLALLWI ASYAGAQAVK NPSRITQKID NTARVTLSGT VPKAAKNAHD LGEVDGGMKL QRMMLVLKPS AEQQASLQRL LDSQQNKKSP NHHKWLTPEK FAASYGPSEE DVAKVKNWLE SQGMSVSKIG NGRQWIEFSG SAHQVGTAFG TSLHYFETNG ERHMANATEI SIPRAISPVV SGVLSLHNFH KQPNTSKLSK VKLGDDGKLA PVDPAWTYRD YNLNPYYYLA PTDAQKIYNA SPLLNDGVDG TGISIAVAGR SNIYLSDVQL FRNVFGLKQN DPNFIVNGPD PGYPFGDLVE NTLDVEWASA MAPGATINFV ASGSTDTTDG VDLSAAYIVD NAVAPIVTVS YGLCEALMGP ASNQFYNSLW AQAAAQGMTV FVSTGDVGAA ECDGDLQRAG YDPPGPAQYG PTISGLSSTP YNVAVGGTQY NEGSNFGQYW SPNNDSTFGS VLGYVPEQAW NESCDPNLPQ EGTNCVYGQT NYNLDGGGGG PSNCSQSTVD DQGLITCVAG TPKPSWQTGL GVPNDGVRDT PDLSLNASPD DDGFLFCLIG GCQTTTVNGQ TILTNASTIG GTSASTPAMA GIMALIEQKN GAFQGQANYI FYKLAAMDDQ AACDSSKRTD PTATSTCNFN DVTMGSNSVP GLPGYGTDSA EWSATTGYDM ATGLGTVNAA NIATNWAKVT FAASSTALTV GGGTVAHGDP VTVNIAVTAT DGGSSKPTGP VSLVTDKYGA VGQVTLDANG SYSGPVANLP GGSYSLTAQY GGDGTFGAST SAPAAVTVTP EDSTTTIIGL YTVDPNTSRV IPYTGSAQWA YPLWISVKVD GKSGEGRATG TVNVLRNGTV VMSAPLNSDG SAYIQTGNGM SYTFPAGDSD LSIQYSGDSG FNASTSGVTK ISFTPQKVWS TIQISWWQVQ AGQPVLLTAG VRAFGTPVPT GTMTFYDNGK KLSDAIPLAT DGPYGPTVPE VTYTAKLTTV GDHWITAEYS GDANYAAVAQ DDPTYSWGSS YTVIPAAGET TTTTVTQYPA AVSFGQWINF LVNVKPAKAG GAAPTGEVVL TSNGQVNGQG NLVNGQVTIS VQAGAQTAEV YAQYQGDSTY ASSSSGVFKT TIAKLDSTVS LTTTGAYVLA GQQTSLNFVV QGYYYNSTSW YQPQGSVQFF DAVNGGAPQA ITAQLGMTGM NPWANSGLSL RETLPAGTNV ITAQYSGDSY FNPATTAAVT VVVSPPDFTV SSDPSALTIS AGGTTSAILS VAPILGFSGA VTLTCGDGLP AGTTCSFSPA TLDASGGQST MTVTMKGPFT NAAANHVSGW WMLTGGSGAL GFFLLGISGK RRKYLAGMLA TIALFGLMMA CGGDSHPPAA TTSVMLESSQ PKVAAGASVT FTADVSGGNN GATGSVTFYD GTTALGNAVD VSNGQATLAV NTLTVGTHAI TAKYTGDSSH AASVSQPMYQ AITGTTTLAV TATSGSTSHV LNLNLTVQ
|
| |