Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3298 |
Symbol | |
ID | 4072710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3905994 |
End bp | 3907529 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985319 |
Product | peptidase S1C, Do |
Protein accession | YP_592373 |
Protein GI | 94970325 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGA ACAATCTGTT GGGAACATTG AAGGCGAAGA TCGGCCGCCG TTTCTATGCC AGCGTGCTGG CAGGAGCGGT AGCGTTTTCT CTCGCGAGCT ATGAATTTGC TGGCCACGCA CGCGCAGCAA CACCGAGTCC GGCAGCCGCA GCGCTCGACG ACAACAGTGT GGGCGCGCTG CTCTCGCTCG ACAAAGCCAT GGAGAACCTC GCCGCGCGCG TAACGCCGGC GACAGTGAAC GTAACCGTTA CGTCGAAACG TTCCGCACAT AACGCATCCA TGCAAGGCAC GGAGGATGGC GACGACGATG GCAATCCGAT GCAGCAGTTC GGACCATTTC AGTTCGGACC GCAGCGTCGC CAGCCGCAAT ACGAGCATGG CCTCGGTAGC GGCGTGATCA TCTCGCCGGA TGGATACATC GTCACCAACA ACCACGTGAT CGATGGTGCG ACTGACATTC GCGTCACCCT CACGGACAAA CGCATCCTGC CCGCGAAATT GATCGGCGCC GATCCGCTGA CTGACCTGGC TGTAATCAAA GTCGAGGGCA GCAATATGCC AAGCGTGCCG CTTGGTGATT CGACTTCCCT GCATCCGGGC CAGACGGTGC TTGCCTTCGG CAATCCGCTT GGCTTCCGCT TCACGGTGAC GCGCGGCATC GTCAGCGCAT TGAATCGGCC GAATCCCTAC GCGCAGGACC GTCGTTCTCC GGGACAGTTC ATTCAGACCG ACGCGGCGAT CAATCCCGGC AACTCCGGTG GGCCGCTGGT GAACGCCCAC GGTGAAGTGA TCGGGATCAA CACGTTCCTC ATCTCGGAGA CCGGCGGATT CTCGGGAATG GGATTCGCGA TTCCCACGCA GATCGTGAAG CCGACGGTGG ACAGCCTGAT CAAGTACGGC AAAGTGAACC ATGGATACAT GGGCATCGGG ATCAGCGATG TATCGCCGGA CGAGGCGAAG TTCTTCAACG TGACCGACGC AAACGGCGCC GTGGTAACGC AGGTGGAACC GAATTCGCCG GGCGCGAAAG CCGGCTTGAA GGTTGGTGAC ATCATCACTG CTGTGAACGG CAAGCAAGTC GCAGACGCCG GCGCACTGCA AGTGGAAGTG GGCCAGCAGC AGCCCGGGAC CAAACTCGAC CTGACGGTGA AGCGCGACGG CAAAGCCTCG ACGCTGAACG TAACGCTGGC CTCGATGGAC AAAGGTGATC GGGACAACGA AACGGCGAGC GCAGGTCATG GCAAGCCGCG GTGGGGAATC GGATTAGCCG ACCTGTCGCC GGAAGCGCGT CAGCAGTTGC AGGCGGGTGA TTCGGTGCAG GGTGCGCTGG TAGGACAAGT AACGCCCGGC AGCCCAGCCG ACAACGCGGG ATTGCAGCCC GGCGATGTGA TCACCGAGGT GAATCGCAAG CCGGTGAAAT CTGCGAGTGA CGCAAAGGAC GCGCTGAGCG GAATCGCGAA TGGCGGTGAC GCGCTGGTAC TGGTGTGGTC GCGCGGCGGC AGCAGCTTCC GGGTGTTGCA CGCGAGCCAG GGATAG
|
Protein sequence | MKVNNLLGTL KAKIGRRFYA SVLAGAVAFS LASYEFAGHA RAATPSPAAA ALDDNSVGAL LSLDKAMENL AARVTPATVN VTVTSKRSAH NASMQGTEDG DDDGNPMQQF GPFQFGPQRR QPQYEHGLGS GVIISPDGYI VTNNHVIDGA TDIRVTLTDK RILPAKLIGA DPLTDLAVIK VEGSNMPSVP LGDSTSLHPG QTVLAFGNPL GFRFTVTRGI VSALNRPNPY AQDRRSPGQF IQTDAAINPG NSGGPLVNAH GEVIGINTFL ISETGGFSGM GFAIPTQIVK PTVDSLIKYG KVNHGYMGIG ISDVSPDEAK FFNVTDANGA VVTQVEPNSP GAKAGLKVGD IITAVNGKQV ADAGALQVEV GQQQPGTKLD LTVKRDGKAS TLNVTLASMD KGDRDNETAS AGHGKPRWGI GLADLSPEAR QQLQAGDSVQ GALVGQVTPG SPADNAGLQP GDVITEVNRK PVKSASDAKD ALSGIANGGD ALVLVWSRGG SSFRVLHASQ G
|
| |