Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4338 |
Symbol | |
ID | 4071756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5146227 |
End bp | 5147816 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986371 |
Product | peptidase M28 |
Protein accession | YP_593412 |
Protein GI | 94971364 |
COG category | [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.387318 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.135899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAAG AGATATTGCT CTCTTCTTTA TTGGTAGCGA TTGCGGCGGG TGGACAGCAG ACTGGGCCGG TTGCGTTCGA TGGCAAGCAG TGGTGGACAT ACGTCAAGGT ACTTGCGGAC GACAACATGG AAGGGCGAAA CACCGGCAGC GATGGCGAGA AGCGTGCCGA AGCGTACGTA GTGGAGCAGG CGAAGGCGAG CGGTCTGCAA GCGGCCGGAA CGAATGGTTT TTATCAGCCG GTGAAGCTGG TAGAGAGCAA GCTGGATGAG GCGGGTTCAA GCTTTGCACT GGTGAAGGAT GGAAAGAGTG AGTCGTTGAC GTTGGGCGAT GACTTGACTC TAAGTGCGCG GCTGGACGGC GGCGACGTGG AAGCACCGCT GGTGTTCGTG GGATACGGGC TGACGATTCC GGAGAAAAAC TACGATGACC TCGCGGGACT CGACCTGAAG GGGAAGGTCG CGGTGATCTT CAGTGGATCG CCGGCTTCGA TTCCGACGGA GTTGGCCTCG CATGCGCAGT CTGCCGCCGA ACGGTGGAAG GCGTTGAAGG CGGCGGGAGT GGTGGGAGTG ATCTCGATCC CGAACCCAAA GGCGATGGAT ATTCCGTGGG AACGCATTAA GGGGAACCGG CTACAGGCGT CGATGCGGCT GGTGGAACTG AATGAGACGG CGGACGAGAA GCTGGGGGGA TATTTCAATC CGGCGTCGGC ACAGAAGCTT TTCGAAGGAT CAGGACATAG CTTCGACGAA ATTGCCGCGC TGGGAGCGAA CCGGGAGGCG TTGCCGCATT TCGCGCTCAA GGTTTCGGTG AAGGCCAAGA CGACGATCGA ACGCAAGGAG ATCGAATCGG CAAACGTGGT CGCGAAGCTG GTGGGATCCG ACGCAAAGTT GAAGAACGAG TATGTGGTGG TGTCGGCGCA TATAGACCAC CTGGGAATGG GAGAGCCGGT GAACGGCGAT CGCGTATACA ACGGCGCGAT GGACAATGGC TCAGGCAGTG CACTGCTGCT CGACCTGGCG CGGTCGTTCA AGGAGCATCC GGAGAACCTG AAGCGGTCGG TGCTGTTTGT GTGGGTGACG GGCGAAGAGA AAGGTCTGCT CGGTTCGCGC TATTTCGGAC TGCATCCGAC GGTATCGCGG CGGGCGATGG TGGCGGACAT TAATACAGAT ATGTTCCTGC CGATTGAGCC GATGAAAGTG ATTACAGCGT TCGGATTGAA TGAGACAACG CTGGGCGACG CATTGAAGAA GTTGGCGGGC GAGCGGAACG TGCAGGTACA ACCCGACCCG CAGCCGCTGC GGAACATTTT TATTCGCAGC GATCAGTACA GCTTCGTGCG AGTGGGAGTG CCTTCGATCA TGTTCATGGG GGGAAGTCCG GCCGATCCGG TACTGGAGCA GTGGTTGAAA GAGCGATACC ACGGGCGCAG CGATGACACG AACCAGCCGG TGGACTTGGT TGCGGCGGGA GAGTTCGAGG CGATATCGCG GGCGCTGGTG GTGGATGTGG CGAATGCGAG CGCAAAGCCG GAGTGGAAGG CGGAGAGCTT CTTTAAGAGA TATGTGGCGG AGCCGGATTT GGGTAAGTAG
|
Protein sequence | MRKEILLSSL LVAIAAGGQQ TGPVAFDGKQ WWTYVKVLAD DNMEGRNTGS DGEKRAEAYV VEQAKASGLQ AAGTNGFYQP VKLVESKLDE AGSSFALVKD GKSESLTLGD DLTLSARLDG GDVEAPLVFV GYGLTIPEKN YDDLAGLDLK GKVAVIFSGS PASIPTELAS HAQSAAERWK ALKAAGVVGV ISIPNPKAMD IPWERIKGNR LQASMRLVEL NETADEKLGG YFNPASAQKL FEGSGHSFDE IAALGANREA LPHFALKVSV KAKTTIERKE IESANVVAKL VGSDAKLKNE YVVVSAHIDH LGMGEPVNGD RVYNGAMDNG SGSALLLDLA RSFKEHPENL KRSVLFVWVT GEEKGLLGSR YFGLHPTVSR RAMVADINTD MFLPIEPMKV ITAFGLNETT LGDALKKLAG ERNVQVQPDP QPLRNIFIRS DQYSFVRVGV PSIMFMGGSP ADPVLEQWLK ERYHGRSDDT NQPVDLVAAG EFEAISRALV VDVANASAKP EWKAESFFKR YVAEPDLGK
|
| |