Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4664 |
Symbol | |
ID | 4070709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5521295 |
End bp | 5522875 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986704 |
Product | peptidase M28 |
Protein accession | YP_593738 |
Protein GI | 94971690 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.062927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGCG AAATTCCGTC TTGCACCCTG TCAGTATATT TAGAATCTCT TACAGCTGGC GGTCAAGAGC AACGCCAAAT TTTACGCGCC CTTGCTCAGG GCTGCGGAAA CGGCATCCAA CCGGATGTTT GCCGCGGAAA GATTCGAATG TCCTTCGTAT CTCGATTGAA CTCTGCATGT CGAACCAGCG TTTCTTTGAT GCTTGCCAAC GCTATTGGTT GCGGGCTTGT GATGTTCAGC GGCGTATCCG CCGTTGCCCA GTCCAAGCCA GTGAAGAAAC CATCTCAAGT ATCGCCGGAA ATCCAGGCAA TACTGCGCGA TGTTTCAGCA AAGCAGATCG AGGCCAACAT CAACAAGCTC GTAAGTTTCG GCAACCGTTC CACGCTTTCG TCGGATGTAC CTGCCGATAG CGGTAAAGGC ATTACCGCGG CGCACGAATG GATCAAGAGC GAATTCGAGC GCTACTCGAA AGAATGCGGC GGTTGTCTCG AGGTAAAGAC GGACGACTTC ACCGAAAGTC CCATGGACCG CATCCCGAAG CCCACACAAA TTACCAATGT TTACGCGGTT CTGAAGGGCA CGGACCCGGC GAATGCCGAC CGCATCGTGC TCGTCACCGG ACATTACGAT TCACGCAATT CGACGAACGA GAACACCACG GATCCCGCGC CAGGAGCCAA TGACGATGGC AGCGGAACGG CGGTTTCGCT CGAATGTGCG CGAGTGCTGA GCAAGCACAA GTTTCCGGCG ACGATCATCT TCCTCACCGT CGCGGGCGAA GAGCAGGGAC TGAATGGCAG TAAACACTTC GCCAAGATGG CGCGCGCGCA AGGCTGGCAG ATCGAAGCAG CGTTGAATAA CGACATCGTC GGCGGCAACA AAACACCGGG CGATACGACG CAGAACCCGC ACACGGTGCG GGTGTTCTCC GAAGGCGTTC CCGCAAACGC TACGGAAGCC GACCTCAGGT TGATCCGCGC CACTGGTACG GAGAATGACT CGCCGTCGCG CGAACTGGCG CGCTACGTCG GTGAAGTGGG CAAGGCCGAC TTGCCGAAAA CCTTTCAACC GACACTGATC TACCGCCGCG ATCGCTTCCT CCGCGGTGGG GACCATAGCA GCTTCAATAT GGAAGGATTT GCCGCCGTCC GGTTCACGGA ATGGCGTGAG GACTTCCACC ATCAGCACCA GAACCTGCGG ACGGAAAACG GCATCGAATA TGGTGATTTG CCGAAGTTCG TGGACTTCGA GTACGTTGCC AACGTCGCGC GGTTGAATGC CATTACGCTG GCTACACTGG CGATGGCGCC TGCTCCTCCA GACAATGCCC GGTTACAGAC CAAAGACCTG GAGAACGGCA CCACGATTAC GTGGAAGCCC TCGCCGGGCG GGCTCGCGAC GGGCTACGAG GTGGTTTGGC GCAACACTTC ATCGCCCGAT TGGGAAGAAT CGAAGGGCTT CCCAGCCGAT GCGAACAGCG CCAAGATCGA CGTGTCGAAA GACAACGTCA TTTTCGGGAT CCGCGCCGTG GGCAAGAACG GGCTGAAGAG CCCAGTCGTA ATTCCTCCGC CGGAGAGGTA G
|
Protein sequence | MSREIPSCTL SVYLESLTAG GQEQRQILRA LAQGCGNGIQ PDVCRGKIRM SFVSRLNSAC RTSVSLMLAN AIGCGLVMFS GVSAVAQSKP VKKPSQVSPE IQAILRDVSA KQIEANINKL VSFGNRSTLS SDVPADSGKG ITAAHEWIKS EFERYSKECG GCLEVKTDDF TESPMDRIPK PTQITNVYAV LKGTDPANAD RIVLVTGHYD SRNSTNENTT DPAPGANDDG SGTAVSLECA RVLSKHKFPA TIIFLTVAGE EQGLNGSKHF AKMARAQGWQ IEAALNNDIV GGNKTPGDTT QNPHTVRVFS EGVPANATEA DLRLIRATGT ENDSPSRELA RYVGEVGKAD LPKTFQPTLI YRRDRFLRGG DHSSFNMEGF AAVRFTEWRE DFHHQHQNLR TENGIEYGDL PKFVDFEYVA NVARLNAITL ATLAMAPAPP DNARLQTKDL ENGTTITWKP SPGGLATGYE VVWRNTSSPD WEESKGFPAD ANSAKIDVSK DNVIFGIRAV GKNGLKSPVV IPPPER
|
| |