Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1930 |
Symbol | |
ID | 4071041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2320039 |
End bp | 2321979 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983942 |
Product | peptidase S9, prolyl oligopeptidase |
Protein accession | YP_591005 |
Protein GI | 94968957 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCTCAC TACGTCGCAT TGCAATACTC GCCCTGTGCA CTCTGTTTGT TTTGCCTGCG TTCGGGGAAG AGAAGTTCAC GACCGAGTTG ATTCCGCGCT CGGTGTTGTT CGGCAACCCA GAACGGGCCG ATCCGCAAAT CTCGCCGGAC GGGAAACAGC TTGCGTATCT CGCGCCGGTG AATGGCGTAC TGAATGTGTG GGTGCGTACG CTCGGCAAGA CTGACGATCG TGCCGTGACC AGCGACACCA ATCGCGGCAT CCGAAATTTC ATTTGGCAAT ACGATGACCA ACACCTTCTC TATCTTCAGG ACGCGGGAGG CGACGAGAAC TGGCGCCTCT ATCAAACTGA TATCGCCACC AAGCAGACGA AAGATCTCAC TCCGTTCGAC AAAGTTCGCG TGGATATCGT CGCCTATTCC TGGAAAACGC CCGACGCGAT CCTCGTCCAG ATGAACCAAC GCGACCCGAA GGTCTTCGAC GTGCACCGCG TTGATCTTAA GACCGGCAAG GTGACGCTCG ACACGCAGAA CCCTGGCGAT GTGGCGAGCT GGCAGGCCGA CAACTCTCTC GAAGTTCGCG CTGCGCAGGT ATCAACCGAC GACGGCGGGA CCATCATCCG CGTGCGTAAC GACGTCAAGT CGCCATGGCG CGACCTGATG AAATGGGGAC CCGATGAGAC CTTCGGCGGC GTCGGTGGGT TCACGCCCGA CAATCGTTCG CTGTGGGTGA TGACCAGCCT CGATGCAAAC GCCGCGCGTT TGCTGGAGAT CGACATCGCC AGTGGCAAGC AGAAGGTGGT TTCGGAAGAT CCGCAGTTCG ACGTTCGGGC GACGATCAAC AATCCTAAGA CGAATGCACT CGAAGCCATC AGCTATGCCA AAGACAAAAC CGATTACATC TTTGTAGATC CCAAGGTAAA GAGCGACTTC GCCGTGCTTT CCAAGGTGCA TGATGGCGAG ATCGACAGCC TTTCGCAAAG CCTCGACGAC AATCGCTGGA TCGTCGGCTA CATCAGCGAC GACGCTCCTG AGTACTGGTA CCTCTACGAT CGCCCGACGC AGAAAGCTAC GCTGCTCTTC AGCAACCGTC CGCAACTGGA GAAGTACAAG CTCTCGAAGA TGCAGCCGAT CGAGTACACC GCGCGCGACG GCATGAAACT CTATGGCTAC TTGAGCACGC CGGCCGGCAT GGAAGCAAAG AACCTGCCGA TGGTCGTCTT CGTTCACGGT GGCCCGTGGG GCCGCGACGA GTGGGGGTAC AACCGCTACG CGCAGTGGCT CGCCAATCGT GGATATGCAG TGCTACAAGT GAATTTCCGC GGTTCAACCG GCTATGGCAA GAAGTATGTC AATGCTGGAG ATCGTCAATG GGCAGGATCG ATGCATACCG ATCTCCTCGA CGGCAAAGAC TGGGTTGTGA AGCAGGGCAT CGCCGATCCT GCGAAAGTTT GCATTATGGG CGGCAGCTAC GGCGGCTATG CAACTCTGGC CGGCGTGACC TTTGCGCCCG ATGCATTCGC CTGCGGCGTG GACATCGTTG GTCCGTCGAA CCTGAACACG CTATTGAAGA CGATTCCGCC TTATTGGTCC ACGATATTGT CCACCTTCCA CAAACGCATG GGAGATTCGG AAGCGGTGCT CACTTCGCAG TCGCCGCTCT TCAAAGCCGA CCAGATCAAG GTGCCGCTGC TGATCGGGCA GGGCAAAAAT GATCCCCGCG TAAACGTGGC GGAGAGCAAT CAGATCGTGG CCGCTATGCG CAAGAACAAT AAGCCGGTGG AATACTACAT CTTCCCCGAC GAAGGCCACG GCTTCGCCAA ACCAACCAAC AACATGGCCT TCAATGCGGC GTCAGAGGAG TTCCTCGCCA AATACCTCGG AGGCCGTGCC GAGCCGCCAA GCGAGGCAGA AAGCAAGCTG CTGGCCAGCA TCAAGCAGTA A
|
Protein sequence | MFSLRRIAIL ALCTLFVLPA FGEEKFTTEL IPRSVLFGNP ERADPQISPD GKQLAYLAPV NGVLNVWVRT LGKTDDRAVT SDTNRGIRNF IWQYDDQHLL YLQDAGGDEN WRLYQTDIAT KQTKDLTPFD KVRVDIVAYS WKTPDAILVQ MNQRDPKVFD VHRVDLKTGK VTLDTQNPGD VASWQADNSL EVRAAQVSTD DGGTIIRVRN DVKSPWRDLM KWGPDETFGG VGGFTPDNRS LWVMTSLDAN AARLLEIDIA SGKQKVVSED PQFDVRATIN NPKTNALEAI SYAKDKTDYI FVDPKVKSDF AVLSKVHDGE IDSLSQSLDD NRWIVGYISD DAPEYWYLYD RPTQKATLLF SNRPQLEKYK LSKMQPIEYT ARDGMKLYGY LSTPAGMEAK NLPMVVFVHG GPWGRDEWGY NRYAQWLANR GYAVLQVNFR GSTGYGKKYV NAGDRQWAGS MHTDLLDGKD WVVKQGIADP AKVCIMGGSY GGYATLAGVT FAPDAFACGV DIVGPSNLNT LLKTIPPYWS TILSTFHKRM GDSEAVLTSQ SPLFKADQIK VPLLIGQGKN DPRVNVAESN QIVAAMRKNN KPVEYYIFPD EGHGFAKPTN NMAFNAASEE FLAKYLGGRA EPPSEAESKL LASIKQ
|
| |