Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2116 |
Symbol | |
ID | 4069542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2530555 |
End bp | 2532501 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984131 |
Product | peptidase S9, prolyl oligopeptidase |
Protein accession | YP_591191 |
Protein GI | 94969143 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.577161 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTC GTTCCGCCAT CATCACTGCC CTTCTCGCCA CGTTCGTCAT AACGGCTTCC GCCCAGCAAG ACCTTACTCA CGCTGCGGCG GTTGTGGACT CGATCCCTAA AACCAAGCGC ATTGACCAGG TCGCCATCTC GCCCGACGGT CAACAGGTGG CAGCCATCGT CGAGGGCCAG CTTACCGTTT CGTCGGCATC GGGCGGCGAT TCCCGCTCGA TCCCGCTGCG CAGCAAACAG CAGGCCCGCG AAGTCGCTTG GAGCAACGAC AGCCGCCAAC TCGCCATCAT CGGCGATCTC GATAGCGACG TCCCGCAGTC CGATATCTAT CTCTATAACG GTGGCGCCGC AAAGAGCATT GCCTCGCTCA AGGGTTACGT GCAAACCCCG CGCTTCTCGC CCGATGGCAG CAAACTCGCA CTCCTGTTCA TCGAGGACCT GCCGCGCGTG GCGGGACCGC TGCAGCCGAT GACGCCGTTA GCGGGCGTGA TTGACGAGAA GGTCTACGAG CAGCGCATTA CTACGATCGA CGTCGCGAGC AAGGCGATCA AGCAGGTCAC GCCTGCAGAC GTCTACATCT ACGAGTACGA CTGGCTTCCC GATGGCAGCG GCTGGGCGGC GATCGCGGCG CATGGCTCCG GTGACAACAA CTGGTGGATC GCGCGTCTCT ACCGCGGCGA TGCGGCAACC GGCGAACTGC ATGAGATCTA TGCGCCGAAG CTCCAGCTCG CGATGCCGCG GGTTTCGCCC GACGGCAAGA CGGTCGTGTT CATCGAGAGC CTGATGAGCG ACGAAGATGT AGTCGGCGGC GACATCTATA TCGTTCCAAT CGGCGGCGGA GAGGCGCGCA ATCTCACGCC TGGCATCAAG ACTTCGCCTG CTTCTCTGCG CTGGACCAAG GACGGGCACA TTCTCTTCGG GCAGAACGTA GACGGCGAAT CGGGCTTCGG GACCGTCAAC GCCAGCGGCG AGATCGCGAA GCTGTGGCAA GGTCCGGACG AAGTATCGGA TGCAAGCACT TCGGGAACCA TCGGCGGCTC GTTCTCAGCC GACGGCGCAC TGAGCGCGAT CGTCCGTCAG TCCAAGTCGG AAGCTCCCGA GATCTGGGTG GGCGCGATCG GCAAGTGGAC CAAGTTGACC TCGGTGAACG AGGAGGCGCA AGCGACGTGG GGCAAGAGCA ATAGCGTGCA CTGGATGAAC GGCACACAGC GCATCCAGGG CTGGCTCACC GCACCGAAGG AAGTGAAGCA GGGCGAGAAA TATCCGCTGG TCATCAGTGT TCACGGCGGT CCGTCGGCCT CGTGTAAGAA CAGTTGGGAT GTGCACTACG CTGCGCCGCT CTCGCTGATG GGGTACTACG TTCTCTGTCC GAATCCGCGC GGCAGCTACG GTCAGGGCGA AGCGTTCACC CGCGCCAACG TGAAGGATTT CGGCGGCGGC GATTATCACG ATATCGTCTC CGCGATTGAT GCGCTCGCCA AGGAATATCC GATTGATACC AAGCGCGTCG GCATCACCGG ACACAGTTAC GGTGGCTACA TGACGATGTG GGCAGAGTCG CAAACCACGC GCTTCGCCGC AGCGGTTTCA GGCGCAGGCC TTTCGCACTG GCTGAGCTAT TACGGTCTCA ACGATATCGA CGAGTGGATG ATTCCCTTCT TCGGCGCATC GGTGTACGAC GATCCTGCGG TTTATCTGAA GAGCGATCCC ATGCACTTCG TGAAGCAAGT AAAAACGCCA ACGCTGATTC TCGTCGGCGA TCGCGACGGC GAAGTGCCGA TGGAACAGTC GGTCGAGTGG TGGCATGCGC TGAAGACGTT CAACGTTCCG ACGACGCTCG TGGTGTATCC GAACGAAGGG CACGCGATCG GAAAACCTGC GGACCGTCGT GACTACGCGG TGCGAACCGC TGCGTGGTTT GAAGAGTGGT TTGCGAAGGT GAAGTAG
|
Protein sequence | MRVRSAIITA LLATFVITAS AQQDLTHAAA VVDSIPKTKR IDQVAISPDG QQVAAIVEGQ LTVSSASGGD SRSIPLRSKQ QAREVAWSND SRQLAIIGDL DSDVPQSDIY LYNGGAAKSI ASLKGYVQTP RFSPDGSKLA LLFIEDLPRV AGPLQPMTPL AGVIDEKVYE QRITTIDVAS KAIKQVTPAD VYIYEYDWLP DGSGWAAIAA HGSGDNNWWI ARLYRGDAAT GELHEIYAPK LQLAMPRVSP DGKTVVFIES LMSDEDVVGG DIYIVPIGGG EARNLTPGIK TSPASLRWTK DGHILFGQNV DGESGFGTVN ASGEIAKLWQ GPDEVSDAST SGTIGGSFSA DGALSAIVRQ SKSEAPEIWV GAIGKWTKLT SVNEEAQATW GKSNSVHWMN GTQRIQGWLT APKEVKQGEK YPLVISVHGG PSASCKNSWD VHYAAPLSLM GYYVLCPNPR GSYGQGEAFT RANVKDFGGG DYHDIVSAID ALAKEYPIDT KRVGITGHSY GGYMTMWAES QTTRFAAAVS GAGLSHWLSY YGLNDIDEWM IPFFGASVYD DPAVYLKSDP MHFVKQVKTP TLILVGDRDG EVPMEQSVEW WHALKTFNVP TTLVVYPNEG HAIGKPADRR DYAVRTAAWF EEWFAKVK
|
| |