Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2635 |
Symbol | |
ID | 4072044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3106467 |
End bp | 3108128 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984652 |
Product | serine protease, kumamolysin |
Protein accession | YP_591710 |
Protein GI | 94969662 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.250819 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATGGGA ATCCGTGGCT CGCGTCAACC CGCGCTCAGC AAACTGCACG GTTCAGCATC GCAGTGCTAC TCTCATCGCA CCCGGTTAAC CCCATGACAG ACGCCATTCC CCCAACTCGT CTCGATCTTG CCGATCTGAA TCGCGCTCCT CGCTCCGAGG AACAAGTCAT CGGCCGCACC GCTCCTGATA CCCAACTCTC GGTGACTATC GTCTTGCGCC GTGCTACTGA CGCGGCGATG CGCGCCGCCG ATCTCGCCGC CCTGCGCGAC TTCTCCATAC GACATAAGCT CGATCTCGAA GACTCCGGAG ACCCGGACGA CTTCGTAACC CTGTGCGGAC GCGCGGCCGA TTTCGAATAT GCGTTTCACT TTGAGTTGCT CGACGTGGAA CAGGATGGCG ACCGTTATCG CCGCTATACA GAAACGCCGT CTCTACGGCC GGGAATCCGC GAAGTGGTCG TCGGCATTTT CGGACTTCGC GACCGTCCCG CGCGTCCGCG TCCGCGGGTC GATCACGGCG GAACCACCGC GCCGTTCTGG ACTGCAACCG ACCTCGAGCG TGCGTACTCG TTTCCCGAAG GGACCGACGG TGCCGGCCAG ACAATCGCAC TCATCGAACT CGGTGGCGGC TACGACCCGC AAGACATAGC AGACTTGCTC GCGAGCCTGG GCCGCCCGCT GCCACAGGTG ACTTTCCGGC CCGTTGCGAA CGCCCTCAAC CAGCCTTGCG ACGCGGACAC GATTCAGCAG TGGCTCGATG TGATCGAAGG GCGCCTGCAA TTGTCCGCTG TCGATCCGAA GGTACTCGAG GCCGCACAAG CCACCGCGGA AGTCACCATG GACATCGAAG TCGCTGCCGC ACTCGCTCCC GGCGCCCACC TCGTCGTCTA CATGGCGCCG CCTACCGAGC AGGGCCTCTA CAAAGCGCTT GACGCAGCGA TCCACGACAC GCCTCCGCTT GTGGATGTCG TCTCCATCAG TTGGGGCGAA GCTGAGCTCT ATGTCTCCGA CGCCTACAGA AAATCGCTCA CGCAACTGCT GGAAGACGCC GCCGCGCGCG GGATCACTGT CTGCGCGTCG TCGGGAGATA ACGGCGCGTA TGACGATCCG CCCAATCAGA CTCTCTGCGT GAACTTTCCC GCCAGTAGCC CACTCGTACT CGCCTGCGGC GGCACCACGA TCGCAAGTTA TAGTTCCGGA ATCCAAAAAG AAGTCGTATG GAATTGCGGT GTGCATGGCA TCCACGCTGC CACCGGCGGT GGCGTCAGCG AACACTTCCC GCTGCCAACT TGGCAGGACG CGAAACTCGT CCCAGCATCC GCGAACGGAT ATCGCGGTCG CGGCGTGCCC GATGTCGCTG CTGTTGCCGA TCCTCATAAC GGCTGCGAGA TTCTCGTGCG TGGAATTCGT TGCAGTTCGT TCGGCACCAG CGCCGTCGTG CCCTTCTGGG CTGCGCTCAT CGCGCGTTGC AACCAAGCTT TGGGAAAACG CAGCGGCCAA ATCCAGCCAA AACTGTACGA ACTCGCCAAG TCCGAGAGTT CACCGTTCCG AGCGATTTTA GAGGGAGACA ATTTCTTCTA TCGTGCGGCG GCGGGATGGA ACCCTTGCAC AGGATTGGGA GCTCCAGATG GAAGCCGACT ACTCACTGCT TTACGGAGTT GA
|
Protein sequence | MYGNPWLAST RAQQTARFSI AVLLSSHPVN PMTDAIPPTR LDLADLNRAP RSEEQVIGRT APDTQLSVTI VLRRATDAAM RAADLAALRD FSIRHKLDLE DSGDPDDFVT LCGRAADFEY AFHFELLDVE QDGDRYRRYT ETPSLRPGIR EVVVGIFGLR DRPARPRPRV DHGGTTAPFW TATDLERAYS FPEGTDGAGQ TIALIELGGG YDPQDIADLL ASLGRPLPQV TFRPVANALN QPCDADTIQQ WLDVIEGRLQ LSAVDPKVLE AAQATAEVTM DIEVAAALAP GAHLVVYMAP PTEQGLYKAL DAAIHDTPPL VDVVSISWGE AELYVSDAYR KSLTQLLEDA AARGITVCAS SGDNGAYDDP PNQTLCVNFP ASSPLVLACG GTTIASYSSG IQKEVVWNCG VHGIHAATGG GVSEHFPLPT WQDAKLVPAS ANGYRGRGVP DVAAVADPHN GCEILVRGIR CSSFGTSAVV PFWAALIARC NQALGKRSGQ IQPKLYELAK SESSPFRAIL EGDNFFYRAA AGWNPCTGLG APDGSRLLTA LRS
|
| |