Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3483 |
Symbol | |
ID | 4069059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4108115 |
End bp | 4109911 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985505 |
Product | peptidase M28 |
Protein accession | YP_592558 |
Protein GI | 94970510 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTGC GCAGAGTTTC GTTGGTTGCG ATCGCTCCGC TCCTGGTGAT TTTCGCCGCG TCTGCTGATA CGCCTACGGT TTCGCCGGCC GATCCGAACC GGTATATCGC GGATATCAAG GTGCTCGCGT CGCCTGAATT CGAGGGGCGA GGCGCGGGCA CCAAGGGCAT TGAGCGCGCG ACGAAGATGC TCGAACAGCG CTATAAGAGC CTGGGGCTGG AACCTGCGGG GATGAAGGGC TTTTTGCAGC CGTTCACGGT GACAACGGGC GCGAAGCTGA AATCGGACAA TCGCGCCAGC GTGAACAAAG CCGAGTTGCA GCTGACTAAG GACTACGTGC CGTTCAGCTT TTCATCTTCT GGAACGGTGA CGGCGCCGTT GGTGTTCGTG GGTTACGGTG CGAGCGCGGA TGAGTTCCAG TACGACGACT ATGCTGGTCA GGATGTGAAG GACAAGATCG TTGTCGTCTT GCGCTATGAG CCCGACAGTT TTTCGAAGAA CGGCAAAGAT GGGCTGACGC AACATTCGCA CCTGATCACG AAGGCGATCA ATGCGCGCAA CCACGGCGCG AAGGCTGTAA TCATCGTGAA CGGCAAGCTG GCGGACAACG AAGAAGACGT CCTGCCGCGG TTTGGAAGCA CGAGTGGGCC GCAGGATTCC GGTATTTTGC TGATCCAGGT AAAGAACGCG GTCGCGGATG AATGGTTCAA GGCGGCGGGA AAGTCGCTCA CCGAGGTGCA GATGCAGATT GCACATTCCG GCAAGCCGAA CTCCTTCGCC TTCCCTGCGG ACATGCAAGC CACGATCAAA GTGGACATTG AAGGAACGCA TGCGCGCGTA AATAACGTGC TCGCCTATCT GCCGGGCAAG AGAGATGAGT ATGTGATCAT CGGCGCGCAC TACGACCATC TGGGATATGG CGATTCGAAT TCGCTGGCGC CGTCGCAGAT TGGGCACATC CATCCGGGAG CAGATGACAA CGGTTCGGGT ACGGCGGGTG TGCTGGAGTT GGCGCGGGTG CTGGCGCCGT TAAAGGGGCA GCTTCCGCGC GGGATTTTAT TCATGTCGTT CGCGGGCGAA GAGCTTGGGC TGTTGGGATC GGCAGAGTGG GTGAAGCATC CGACGAAGCC GCTGGACAAG GCAGTAGCGA TGCTGAACAT GGACATGATC GGCCGCATTA AGGACGACAA GGTTTTCATC GGCGGCGTGG GTACGGGTTC AAGCTTCAAG CAGTTCCTTG AGGAAGACCA GCCGAAGTCA GGATTCAAGG TGGAGTATTC GCAAGGTGGG TATTCGGCGA GCGATCACAC ATCGTTCGTG ACCGCGAAGA TCCCGGTGCT GTTCTTCTTC TCGGGATTGC ACTCGGACTA TCACAAGCCA TCGGACACCT GGGACAAGAT CAATGCACCG GAGGCGGCGA AGCTGCTGAA CCTTGTGGAT CAGGTGGCGC TGCAAATCGA CGAGGACGCC AAGCGGCCGG AGTTTAAGAC GGTCATCGAA GACAAGCCTG TTGGCGGCGG GGGGAGCGGC TATGGGCCGT ACTTCGGATC GATTCCTGAT TTCGGCGAGG TGAAAGAGGG AGTGAAGTTC TCCGATGTGC GGCCGGGATC GCCGGCGGCG AAGGCGGGGT TGAAGGGCGG CGATATTCTC GTCCAGTTCG GTGATAAGCC GATTAAGAAC CTTTACGACT TCACGGATGC GTTGCGGCGA AGCAAGGTGG GGGATGTGGT TAAGGTGAAG GTTTTGCGCG ATGGGCAGCC GATTGAGGCA GACGTTAAAC TCGAACAACG CAAATAA
|
Protein sequence | MNVRRVSLVA IAPLLVIFAA SADTPTVSPA DPNRYIADIK VLASPEFEGR GAGTKGIERA TKMLEQRYKS LGLEPAGMKG FLQPFTVTTG AKLKSDNRAS VNKAELQLTK DYVPFSFSSS GTVTAPLVFV GYGASADEFQ YDDYAGQDVK DKIVVVLRYE PDSFSKNGKD GLTQHSHLIT KAINARNHGA KAVIIVNGKL ADNEEDVLPR FGSTSGPQDS GILLIQVKNA VADEWFKAAG KSLTEVQMQI AHSGKPNSFA FPADMQATIK VDIEGTHARV NNVLAYLPGK RDEYVIIGAH YDHLGYGDSN SLAPSQIGHI HPGADDNGSG TAGVLELARV LAPLKGQLPR GILFMSFAGE ELGLLGSAEW VKHPTKPLDK AVAMLNMDMI GRIKDDKVFI GGVGTGSSFK QFLEEDQPKS GFKVEYSQGG YSASDHTSFV TAKIPVLFFF SGLHSDYHKP SDTWDKINAP EAAKLLNLVD QVALQIDEDA KRPEFKTVIE DKPVGGGGSG YGPYFGSIPD FGEVKEGVKF SDVRPGSPAA KAGLKGGDIL VQFGDKPIKN LYDFTDALRR SKVGDVVKVK VLRDGQPIEA DVKLEQRK
|
| |