Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0056 |
Symbol | |
ID | 4069990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 52837 |
End bp | 55080 |
Gene Length | 2244 bp |
Protein Length | 747 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982056 |
Product | peptidase S9B, dipeptidylpeptidase IV-like |
Protein accession | YP_589135 |
Protein GI | 94967087 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAGCAC ATCTAGCTAC GTTCCTTCTC TTCACTTCCT CTTTTACGTT GGTAAGCGCG CAGGAAGCGC CGAAGCCGAA ACAGCTAACC ATTGAAGCGA TCTTCGCGAA GGGCGGAGTG CTCGGCCGCG CGCCGGAGTC GGTGGAGTGG AGTCCGGATG GAACGAAGGT CTCGTTCGTG CAGCGTGATG ATTCGGGCGA TAACGGTGCG CTCTATTACG TGGATGTGAC CACCGGAGCG AAGCCAGCTG TGCTTGTCGC GCAGGAGAAG CTGACCGAGA TGAAGCCGCC GGCGAAGACG AAGTCCGATG ACCGCGAGAA GGACAATCGC GAGCGGTATT CGGTCGCGGC CTATCATTGG GCGCCGGATT CCAAGCACAT CCTCTTCGAC TCGGGCGGTG CGCTGTGGAA CTATGACCTC GCGGCGGGGA AGTCGGCGAT GATTGCGTCG GCCGAAGGTG GGCTTGGCGA TCCGAAGTTC TCGCCGTCGG GAGATCGCAT TTCCTATCTG CGCGAGCACG ATCTGTATGT ATCGGGACTT GATGGAAAGG CGAAGCGAAT TACCGAAGGT GGCAATGCGA ACGTCCTGAA CGGCGAAGTG GATTGGGTTT ACGCGGAGGA GCTGAACGTC CGCAGCAATT ACTTCTGGTC ACCGAATGGG AAGCAGATTG TGTATCTGCA AATGGACCAG ACGAAGGTGC CGACCTATCC GATTACCGAT TACATCCCGA CGCAGGCGAC GGTGGATGAG GAGAAATTCC CCAAGCCGGG CGATCCGAAC CCGTCGGTGA AACTTGGCGT GGTGAGCGCG ACGGGCGGGA AGACGAAGTG GATCGAGCTG CCGGCAACGG ATGCATATAT CTCGCGCTTT GGGTGGGTGC ACGATGGACT GCTGTATGCG TTCGTGCTGA ACCGTCCGCA GAACAAGCTG GATTTGTACC TGGTGGATGC GAAGTCGGGG CGCACGCAGG TAGCGATGAC GGAGACGAGC CCGTCGTGGA TTGAGACCAA TGATGAGTAC AAGTTCATTG CTAATGGAGA GAAGCTGCTG TGGACGAGCT GGCGCGATGG GCACACGCAC ATCTACTTGT ATGACCTCGA CAAGAGCAAT GCGCTGGCGC CGCTGAAGCT GGAGCGGCAA CTGACGCGCG GGGATTGGGA CGTCGTCTCC ATCGATGGCG TGAATGAGAA GACGGGGATC GTCTACTACA GCTCGGACCA GGAAGACGAG CGGCAGCGGC AGGAGTATCG CGTGAACCTG GCCGATGGCG TGAGCGAGAA GATCACCAAG GACCACGGCA CGCATGAAGC GAAGTTTGCA CCCGAGGCGA ACTGGTTCGT GGACAACTAC TCGGCGCTGA CGACTCCGCC GGCGCTTGCG GTGTGCACGC TGAAGGATGA GTGCACAACG TTCTGGAGCG GACGGAGCGT GAAGGACTAC GCTCTGCTGG TACCGCAGTT TGTAGATGCA AAGGCCGATG ATGGCACGGT GATGCACGGG GTTCTGTTGA TGCCAACAGA AGGCGTAGCG ATGGTGAACG GCAAAGTGCC GTTGATCACC AATCCGTACG GCGGACCGGG CGTTCCGGGG GATTGGGATT CCTGGGGATC GGTTGATCTC TTCGACCAAT ATATGGCGAA GCGCGGCTAC GCGATCCTGA AGATGGAGAA CCGCGGCATG GCGGGACGCG GCGAGAAGTT CGCGGCGCCG ATTATGCACC ACATGTGCGA GCTTCCGCTG AAAGACCAAC TGGCTTCCGT CGAGCAGGTG CTGAAGCAGT TCCCGCAGCT CGATCGCGCC CGGCTTGGGT GGTGGGGATG GAGCTATGGC GGCACCATGA CCGCCTGGGC GCTCGAGCAT TCCGACTGGT TCAAGGTGGG CGTGAGCGTG GCGCCGGTGA CCGACTGGCG CAACTACGAC TCGATCTACA CCGAGCGCTA CATGGGCATG CCGAAGGAAC AGGCGGCGGA CTATGACCGG ACGTCGGTGG TGCTGAACGC GAAGCAGATT CATGGCAGGC TGCTGGTGGT GCACGGCACG AGCGACGACA ACGTGCACAT GCAGAACTCG ATGCAATTCA TGTATGCGTT GATCAATCAC GGCGTGCCGT TCGATGTGCA GATTTATCCG CGAAAAACGC ATTCGATCTC GGGAGAAGAA ACGAGGGTGC ATCTCTTTCA CCGCATTCAA AGGCAGTTTG ACGATGTGCT GATGCCGAAG ACGGCTGCGT CATCTACTCC GTAA
|
Protein sequence | MKAHLATFLL FTSSFTLVSA QEAPKPKQLT IEAIFAKGGV LGRAPESVEW SPDGTKVSFV QRDDSGDNGA LYYVDVTTGA KPAVLVAQEK LTEMKPPAKT KSDDREKDNR ERYSVAAYHW APDSKHILFD SGGALWNYDL AAGKSAMIAS AEGGLGDPKF SPSGDRISYL REHDLYVSGL DGKAKRITEG GNANVLNGEV DWVYAEELNV RSNYFWSPNG KQIVYLQMDQ TKVPTYPITD YIPTQATVDE EKFPKPGDPN PSVKLGVVSA TGGKTKWIEL PATDAYISRF GWVHDGLLYA FVLNRPQNKL DLYLVDAKSG RTQVAMTETS PSWIETNDEY KFIANGEKLL WTSWRDGHTH IYLYDLDKSN ALAPLKLERQ LTRGDWDVVS IDGVNEKTGI VYYSSDQEDE RQRQEYRVNL ADGVSEKITK DHGTHEAKFA PEANWFVDNY SALTTPPALA VCTLKDECTT FWSGRSVKDY ALLVPQFVDA KADDGTVMHG VLLMPTEGVA MVNGKVPLIT NPYGGPGVPG DWDSWGSVDL FDQYMAKRGY AILKMENRGM AGRGEKFAAP IMHHMCELPL KDQLASVEQV LKQFPQLDRA RLGWWGWSYG GTMTAWALEH SDWFKVGVSV APVTDWRNYD SIYTERYMGM PKEQAADYDR TSVVLNAKQI HGRLLVVHGT SDDNVHMQNS MQFMYALINH GVPFDVQIYP RKTHSISGEE TRVHLFHRIQ RQFDDVLMPK TAASSTP
|
| |