Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2819 |
Symbol | |
ID | 4071822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3346492 |
End bp | 3348765 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984837 |
Product | glutamate carboxypeptidase II |
Protein accession | YP_591894 |
Protein GI | 94969846 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.140178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTTC GTCGCGCATT GATCGCGTCG CTGCTTTGCT GCGCGGTGGT TCCGCTACCG GCGCAGACTT CGGCCCCGGC GCTCGCAGGA TATTCGGCGC AGAATTCGGC GTCTGAGCAG CAGTGGGAAG AGAAATTCCG GGCGATTCCC TCCCCCGATA ACCTGAAGAA CTACATGCAG CGCCTGAGCG CGCGTCCGCA CCACGTGGGC TCGCCCTATG ACAAGGACAA TGCCGAGTGG ATGCTGGCGC AGTTCAAGTC GTGGGGACTG GATGCGCATA TCGAAGAGTT CGACGTGCTC TTCCCGACGC CGAAGGAGCG CTCTGTCGAG TTGATTGAGC CGACACACTT TGTGGCGAAA CTGCAGGAGT CGGCGGTGAA TGGAGATCCG ACTTCGAGCC AGCAGAAAGA ACAGTTGCCG ACATATAACG CGTACTCGGC GGATGGCGAC GTCACCGGGC CGCTGGTGTA TGTGAACTAC GGCATGCGCG AGGACTACGA GAAGCTGGAG CGGCTGGGCG TTTCGGTGAA GGGCGCAATC GTCATCGCAC GTTACGGCGG CGGCTGGCGC GGCATTAAGC CGAAGGTTGC GGCGGAGCAC GGCGCGGTGG GCTGCATTAT TTATTCCGAC CCGAAAGACG ATGGATACTC GGAAGGTGAT GTATTTCCGA AGGGCGCGTT CCGTCCGTCG GATGGCGTTC AACGCGGAAG CGTCGAGGAC ACGGACTATC CGGGCGATCC GCTGACGCCG GGCGTTGGCG CGACGAAAGA TGCGAAGCGG CTGACGGTGA AGGAGTCGCC GGTAATCCAG AAGATCCCGG TGCTGCCGAT ATCCTATGGC GATGCGCAGC CGCTGATGCA AGCGATCCAG GGGCCGGATG TTCCGCAGGA ATGGCGCGGC GGGCTGCCGA TTACCTATCA CGTGGGGCCG GGGCCGGCGA AAGTGCACCT CAAGATGTTC TCCAACTGGG ACATCAAGCC GGTGTATGAC GTGATCGCGA AGATTCCGGG CTCGGAATTT CCGGATGAGT GGGTTATTCG CGGGAACCAT CATGATGCGT GGGTGAATGG CGCGGAAGAC CCGCTCTCGG GCATGGTCGC GCTTATGGAA GAAGCGCGCT CGATGGGCGA ACTGCTGAAG CAGGGATGGA AGCCGAAGCG CACGATCATC TTCTGCGCGT GGGACGGTGA GGAACCGGGA CTGCTCGGTT CGACCGAGTG GGTGGAGACG CATGCCCAGG AACTGACCGA GCACGCGGTG ATGTATGTGA ACTCGGATTC CAACGGTCGC GGCTGGTTGT TTGCTTCGGG CTCGCACTCG CTTGAGCATT TCGTGAACGA CGTCGCGAAG ACCGTGCAGG ACCCTGAGAC CAAGAAAACT GCGTGGGAGC GGTCGCGGCT GGTGCGCATA TCGCGCGCGA AGGGCGACGA ACGTGGTGAG GTCAGGAGCC GTCCGGACAC GCGAATTGGC TCGCTGGGCG ATGGTTCGGA TTACGCATCG TTCCTCGATC ACCTGGGCGT GGCCGCGCTG AATATTGGCT ACGGCGGCGA GACCGATGGC GGTATCTATC ACTCCATCTA CGACGATTTC TACTGGTATA CGCACTTCGG CGATCCGAGC TTCCAGTACG GACGTGCGTT GTCGCAGACC GGTGGAACGC TGGTGATGCG GATGGCGGAT GCGGATGTGC TTCCCTTCCA GTTCACAAAT GCCGCCGAGA CCGTTGGCAG GTTCGTGAAT GAATTGAAGA AGCAGCTGAA GGAGAAGCAG GACGAAATCG CCGAATTGAA CACAGAGATT GATGAAGGTA TGTTCACGGC GACGGCGGAC CCGACCAAGA AATTTGTGCC TCCGCCGAAG GAGCAGGTGC CTCCGTTCTT GAACTTCGCG CCGCTGGATA ACGCGGTGGT GACCTTCAAG AAGAGCGCCG AACGTTACGA CAAGGCAATG AAGGCGCTCG TCAAGGCCGG CTTGCCGACT GGCGACAAAA CAGCCGAGCT GAACAAGTTG CTCATCCAGA GCGAGCGTGC GTACACAAAT GCACAGGGGC TTGCGGAGCG GCCGTGGTTC AAGCACATGA TCTATGCACC GGGCGCTTAC ACCGGCTACG GCGTCAAGAC GATTCCAACC GTGCATGAGC CGATGGACGC GAAGAAGTGG GCGCAGGCAG ATGCCGGCGT ACCAGCAGCA GCCAAGGCAA TTGAGGACGA AGCCAAGCTG GTGGACTCCG CGGCGGAAGC GGTGGAGAAG TTGGCGGGCG AGGCTGGAAA GTAG
|
Protein sequence | MQVRRALIAS LLCCAVVPLP AQTSAPALAG YSAQNSASEQ QWEEKFRAIP SPDNLKNYMQ RLSARPHHVG SPYDKDNAEW MLAQFKSWGL DAHIEEFDVL FPTPKERSVE LIEPTHFVAK LQESAVNGDP TSSQQKEQLP TYNAYSADGD VTGPLVYVNY GMREDYEKLE RLGVSVKGAI VIARYGGGWR GIKPKVAAEH GAVGCIIYSD PKDDGYSEGD VFPKGAFRPS DGVQRGSVED TDYPGDPLTP GVGATKDAKR LTVKESPVIQ KIPVLPISYG DAQPLMQAIQ GPDVPQEWRG GLPITYHVGP GPAKVHLKMF SNWDIKPVYD VIAKIPGSEF PDEWVIRGNH HDAWVNGAED PLSGMVALME EARSMGELLK QGWKPKRTII FCAWDGEEPG LLGSTEWVET HAQELTEHAV MYVNSDSNGR GWLFASGSHS LEHFVNDVAK TVQDPETKKT AWERSRLVRI SRAKGDERGE VRSRPDTRIG SLGDGSDYAS FLDHLGVAAL NIGYGGETDG GIYHSIYDDF YWYTHFGDPS FQYGRALSQT GGTLVMRMAD ADVLPFQFTN AAETVGRFVN ELKKQLKEKQ DEIAELNTEI DEGMFTATAD PTKKFVPPPK EQVPPFLNFA PLDNAVVTFK KSAERYDKAM KALVKAGLPT GDKTAELNKL LIQSERAYTN AQGLAERPWF KHMIYAPGAY TGYGVKTIPT VHEPMDAKKW AQADAGVPAA AKAIEDEAKL VDSAAEAVEK LAGEAGK
|
| |