Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0915 |
Symbol | |
ID | 4069126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1153410 |
End bp | 1154384 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637982922 |
Product | TPR repeat-containing protein |
Protein accession | YP_589992 |
Protein GI | 94967944 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0051049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGCTG CGGCGCAGAC CACGGCCACT GCCACGGACC CGCAGAAGGT CATCGCCGAT GCTCATAGAC AACTAGATCA TGGACAGTAT GACGAAGCGA TTGTGCAACT GCAGAAACTT CAACAGGAAC AGTCGGCAAT TGAAGGCCTC GTGAGGGAAA TTGGAATTGC CTATTACAAG AAGGGCGATT ACCTAAGCGC GATTCGTTAT CTCAAACAAG CGACGAAACA AAATGAAAAA GACAGCGAAG CGGTGCAACT GCTGGGATTG TCGTACTACT TGTCTGGGAA GCCGCAGGAC GCGATCCCTC TGTTGGAGAA GGTGCAGACG TGGTTTCCGA GCGCGAATGT TGATGCGTCG TACATCCTGG GAATCTGCTA CATCCATTTG AAGAATTACG ATCAGGCGCG CGGCGCGTTC GCACAGATGT TCGATGTGAA GCGAGACTCG GCTGCCAGTT ATCTATTCAC AGCACGGATG TTGCTTCGAC AGGAATATGA TCCGGTAGCA GAGGAATATG CGCAGAAGGC GATTGCGCTC GAGCCTACGC TTCCACTGGC GCACTATTTA TTAGGCGAGC TTCACCTCTA TAAGTCACGC GTGCCAGAGG CAATCGAGGA TTTCCGGAAG GAGTTGAACC TCAATCCGTC CTACGCGCCG GCGTATTACA AGCTGGCGGA CGCCTACTCG CGCGTTCAAA AGTACGATGA TGCGGAGCGA CTGCTGCAAC GCTCGATCTG GCTGGATGCA ACGACGACAG GGCCGTACAT ACTTCTCGGC AAAGTGCTGG AAAAGAAGGG AGAGCCGCTA CTAGCGTTGC GCGCGTTGCA ACATGCAGCG AGCATGGATC CGAAAAACCC GATCACCCAC CACTTACTCG GGCAGTCGTA TCGCGATCTA GGTAAGAGTG ATGACGCAGA GCGGGAATTA AAACTCGCCG AGGAATTGCA GAACAAGCAG AACGAGAATC GCTAA
|
Protein sequence | MSAAAQTTAT ATDPQKVIAD AHRQLDHGQY DEAIVQLQKL QQEQSAIEGL VREIGIAYYK KGDYLSAIRY LKQATKQNEK DSEAVQLLGL SYYLSGKPQD AIPLLEKVQT WFPSANVDAS YILGICYIHL KNYDQARGAF AQMFDVKRDS AASYLFTARM LLRQEYDPVA EEYAQKAIAL EPTLPLAHYL LGELHLYKSR VPEAIEDFRK ELNLNPSYAP AYYKLADAYS RVQKYDDAER LLQRSIWLDA TTTGPYILLG KVLEKKGEPL LALRALQHAA SMDPKNPITH HLLGQSYRDL GKSDDAEREL KLAEELQNKQ NENR
|
| |