Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4003 |
Symbol | |
ID | 4071139 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4729802 |
End bp | 4731487 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986030 |
Product | peptidase S10, serine carboxypeptidase |
Protein accession | YP_593077 |
Protein GI | 94971029 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.927019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAT CCTGGCTGAA CGCCCTACTG TGCTGCGCGA TTTTGGGCGC AACCGCAGGG GCACAGCAGA AGAATCCTCC GAAACCTCAA GTTGAAAAGA GTTCGCCCAG CGAAAACATC GCAAACCGAC CGGCGAACGC GCCGGAAGAA CCGCGCGCTC AGCGCGAAGA ACGACGGCAG GAAGACGCTC CGCAACAACC ACGCGGTGAG GGGCAGAAGC CGTCGATGAA GTGGGACATG ACGGAGACAG CTCCGGTGGT GACCCACCAC GAAATCAACG TAAACGGCCG GGCTCTGCGA TACACCGCCA CGGTCGGACG GTTGCCGATT AAAGACCTCA CTGGCACCAC CGAAGCGCTG ATGTTCTACG TCGCTTACAC GCTCGATGGA CAGGATGCGA CGAAGCGTCC AGTCACGTTT GCGTTTAATG GCGGTCCGGG ATCGGCGTCG ATTTGGCTGC ACATGGGAGC GCTCGGCCCG CGCCGCGTCG CGCTGCAGCA GGACGGCATG ATGCCGCCGT CGCCGTATCA CCTCATCGAC AACCCCGGCA CGCCGCTCGA AAAGACCGAC CTGGTATTGA TTGACGCCAT CGGCACCGGC TTCAGCCGCC CCGCGGACCT GGAAAAGGGC AAGAAGTTCT GGAGCGTGAA GGGCGACATC GAAGCGTTTG GCGAATTCAT TCGCCTCTAC ATCACGCGGA ATGAACGTTG GGCTTCGCCG CTCTACATCT TCGGCGAAAG CTATGGAACC ACGCGCGCAG CGGGAATTTC GGGCTATTTA GTGGATCGCG GCATTGCCTT CAACGGCATT TGCTTGCTCT CGGAAGTGCT GAACTTCGAG ACGTTGGAAT TCAGCAAGAG CAACGACCTT GGCTATCAGC TCACGCTGCC GTCGTACACC ATGATCGCCG GATACCACAA GATGCTCGCC CCGGAGCTTC TCCAGAACAT GGAGAAGACG AAGTCTGAGG TGGAGCAGTT TGCGAATGGT GAATACGCGC AGGCGCTGCA AGCGGGCGAC AGTCTCACTG CCGACCAGCG GGCGCACATC GTTGAGCAGC TCGCGAAATA CACGGGACTC AAGAAGGACT TCATCGAGCA GTCGAACATG CGGATCGATG TCCGCGGCTT CACGCATAAT TTGCTCATCG ACCAAAAGCT GCGCGTCGGA CGCCTCGACG GCCGCTATAC CGGACCGGAT CCAAACGGCC TGATGGATAC GCCGTTCTAC GATCCGACGG GCTCGGCAAC CGATCCACCG TTCACCGCGA CGTTTAACAA CTATCTGCGG AATGACCTGG GCTACAAAAC CGACATGCCT TACTACGTAT CGGCACGCGA CATGGCGGGC GCAACCGAGC CCGGGCAGCG CGGTGGTGGA CCGTTCCAGT GGGAGTGGGG ATCCGCAATT GAAGGCTATC CCGACACTGC GACCGCATTA CGTGCAGCGA TGGTGAAGGA CCCGTACTTG AAGGTGTTGG TGATGGAGGG CGATTACGAC CTCGCCACGC CGTATTTCGC GGCGAACTAC ACCATGAATC ATCTCGACCT GACGCAGCAG TACCGCAAGA ATATTTCGTA CGCGCGGTAT GCGGCGGGTC ACATGGTTTA TCTGCCGATG GATGGGCTTG CGAAGATGAA GAAGGACTAC GACTCGTTCC TCGATCAAAC CGCGAACCGT CAGTAA
|
Protein sequence | MKQSWLNALL CCAILGATAG AQQKNPPKPQ VEKSSPSENI ANRPANAPEE PRAQREERRQ EDAPQQPRGE GQKPSMKWDM TETAPVVTHH EINVNGRALR YTATVGRLPI KDLTGTTEAL MFYVAYTLDG QDATKRPVTF AFNGGPGSAS IWLHMGALGP RRVALQQDGM MPPSPYHLID NPGTPLEKTD LVLIDAIGTG FSRPADLEKG KKFWSVKGDI EAFGEFIRLY ITRNERWASP LYIFGESYGT TRAAGISGYL VDRGIAFNGI CLLSEVLNFE TLEFSKSNDL GYQLTLPSYT MIAGYHKMLA PELLQNMEKT KSEVEQFANG EYAQALQAGD SLTADQRAHI VEQLAKYTGL KKDFIEQSNM RIDVRGFTHN LLIDQKLRVG RLDGRYTGPD PNGLMDTPFY DPTGSATDPP FTATFNNYLR NDLGYKTDMP YYVSARDMAG ATEPGQRGGG PFQWEWGSAI EGYPDTATAL RAAMVKDPYL KVLVMEGDYD LATPYFAANY TMNHLDLTQQ YRKNISYARY AAGHMVYLPM DGLAKMKKDY DSFLDQTANR Q
|
| |