Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3224 |
Symbol | |
ID | 4072559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3817505 |
End bp | 3818839 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985245 |
Product | aminopeptidase P |
Protein accession | YP_592299 |
Protein GI | 94970251 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.638198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0560599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGTA AGCTGCTGGT CCTCGCTCTG CTCCTTGCGC CGTTCTCCGG CGCCATGGAA CGCCAAAATA ACGCCGACTA CCGTGCTCGG CGCCAGAAAC TTGCCGCCGA ATTGAAGGGC GGTGTTCTGG TGCTCTTCGC ACCCACCGAG CCCTCCGCCG GAAACGCCAC CAGCGGCTTT CGCCAGGACG ATAACTTCTA TTACCTCACC GGCTGGTCGG AGCCCGGCGC CGCGATCATG ATCGCCGCCG AAGTCGTGGC GAAAGATGAG CATCCCGCGC GTGCCTATAC GGAAGTGCTC TACCTTCCGG CGCACAACAC CGTGCAGGAA AAGTGGACTG GCCCGAAACT CGGCCCTGAG AACCCGCAAG CCCGCGACCT CACCGGATTC GACCGCGTCG AACTGCTCGA CAAAATGCGC GACGACATCG CCGACCTTCT CCAAAAGGAT CCTCGTGCGC CGATCTATTC CGACATCTCG ACTGGCGACG AAGTCTCGCC TTCCGCCGAC GGACTAGCCT GGCTGAAACG CGCCAACGCG TTTCCCGTCG TCCGCTTCGC CGACTTCAAG CCGATCGTCA GCGACCAGCG CCGTGTTAAG GACGCTGGCG AAATCGAGTT GATCCGCAAA GGTACGAATG CCTCCATCGC TGGCCATTTG GCCGCATTCA AAGCCATACA TCCCGGCGTA ACCGAGCGCG AAATCGCCGC GCTGCAGATG TACGAGTTCG GCAAGCGCGG CTGCGAGCGA CCGGCCTACG CGCCCATCGT CGGCTCCGGC TACAACGGCA CCGTGCTGCA CTACTCCGAA GATTCCGGCA CGCTGAAAGA TGGCGACCTC GTCGTCATGG ACGTAGCTGG CGAATACAGC ATGTACGCCT CCGACATCAC CCGTACGGCT CCGGTCAACG GCCATTTCAC GGCCCGCCAG CGCGAGATCT ATGAAATCGT CCTTGGCGCA CAGCGCGCGG CCATCGAAGC ATTCGTCTCA GGCAAGTCTG TGCTGCTCGG CAAGACTGAC GACTCGCTCT ACAAAGTCGC CTACGACTAC ATCAATACCC ACGGCAAAGA CCTGCACGGC GAGCCGTTAG GCAAGTACTT CATCCACGGC CTCGGCCACT ACGTTGGGCT TGAGGTCCAC GACCCCGGTT CCTACGCCAC GCCGTTGCAG CCAGGCATGG TCTTCACCAT CGAACCGGGT GTTTATATCC CCGAAGAGAA GCTCGGCGTA CGCATCGAAG ATATTGTGTA CGTTGACGCC AACGGCAAAC TCGTGGACTA CACCGCCGCG CTCCCGCACA CCGTCGAAGA AGTCGAAAAG GCAATGAAGA AATAG
|
Protein sequence | MIRKLLVLAL LLAPFSGAME RQNNADYRAR RQKLAAELKG GVLVLFAPTE PSAGNATSGF RQDDNFYYLT GWSEPGAAIM IAAEVVAKDE HPARAYTEVL YLPAHNTVQE KWTGPKLGPE NPQARDLTGF DRVELLDKMR DDIADLLQKD PRAPIYSDIS TGDEVSPSAD GLAWLKRANA FPVVRFADFK PIVSDQRRVK DAGEIELIRK GTNASIAGHL AAFKAIHPGV TEREIAALQM YEFGKRGCER PAYAPIVGSG YNGTVLHYSE DSGTLKDGDL VVMDVAGEYS MYASDITRTA PVNGHFTARQ REIYEIVLGA QRAAIEAFVS GKSVLLGKTD DSLYKVAYDY INTHGKDLHG EPLGKYFIHG LGHYVGLEVH DPGSYATPLQ PGMVFTIEPG VYIPEEKLGV RIEDIVYVDA NGKLVDYTAA LPHTVEEVEK AMKK
|
| |