Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2102 |
Symbol | |
ID | 4069701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2514108 |
End bp | 2515646 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984117 |
Product | leucyl aminopeptidase |
Protein accession | YP_591177 |
Protein GI | 94969129 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCC ATCTCTCGAC CCTCGATCCC GCGCAGTTGG AAACCGACGC TCTCATCGTG CTCGCCATTG ACGGCGGCGA CAAAGACAAC AACAAGCCGC AACTCCAGGC CAAGTCCGAC GCATTCGCGA AAGCTGCCGC CGACCTCATC GCCTCAAAAG AGATCACCGG CAAGCTGCTC GAGATCGCGA CTCTGCATAA GCCCGAGGGC GTGAAGGCCA AGCGCTTGAT CGTCGTGGGG GTTGGCAAAG CCAAGAGCTT TACGTCGTAC GAATTGCGCA AAGCTGCAGG TGCCGCCGTG CGCGCACTGA AGAAGAGCGT GAAGTCCGCG GCCATCGTTG CCCCCGAAAA CTGGGGCGGA GCCGCCGATC CCTCCACGAC CAGCACGTTG ATGTTTGAAC GCGGCGGATT ACCTGAAGCG GTGAAGGCGA TTGCGGAAGG CGCGGTGGTT GCCAACCCGG ATTACAACTA TTACCACTCC GATCGCAAGA CCTACGAACT CGACGAGCTC ACCATCCTCG TCCCTGCCAA TGGCCACGCC AACGATCTTG AAGCGGCCAT GAAGGAAGGC CATGTCATCG GCGAGTCGCA AAACTTCACT CGCGATCTCG TCAACGAGCC CGGCAACCGC ATGACGCCAA CCATCCTCGG CCAGCGCGCC AAGAAGATGG CGGAAGAAGT CGGCATCCAG TGCGATGTCT ACAGCACCGA CTTCCTGCAC GAAAAGAAGA TGGGCGCGTT CTGGAGCGTC TCGCAAGGTT CCGAGGAACC GCCCGCGCTG ATCGTGATGA AGTACGAGCC AGCAGGCGCG CCGCAATCGC CGGTTCTCGG TCTCGTCGGT AAAGGCATTA CCTTCGACAC CGGCGGCATC TCCATCAAGC CCGCCGACGG CATGGAGAAG ATGAAGTACG ACATGGCCGG CGGTGCCGCG ATGATCGGCG CTATGCGCGC CATCGCGCTG CTCAAGCCGA ATGTGCGCGT GATCGGCGTC GTTTGCGCGG CCGAGAACAT GCCCAGTGGC AAAGCGCAGA AGCCCGGCGA TGTCCAGATC GCGATGAGCG GCAAATCCAT CGAGATCATC AACACCGACG CTGAGGGCCG CCTCGTCCTC GCCGACGGCC TGCACTACGC CAAACAACTC GGCGCAACTC ATCTCATCGA CGCCGCCACC CTCACCGGCG CCTGCATGGT CGCCCTCGGC GGCATCAACG CCGGTGTCTT CGCCAACGAC GAAGACTACT TCAACCGCTT CGCCGAGGCC TTGAAGAAAT CGGGCGAGAA GATGTGGCGG CTGCCCATCG ACGACGACTA CAAAGAACTC ATCAAGTCGC CCATCGCCGA CATCAAGAAC ACCGGCGGCC GTTACGGCGG CGCCATCACG GCAGCAATGT TCCTGAAAGA ATTCGTCGGC GAAACCCCAT GGATCCACCT CGACATCGCC GGCGTTGCCT GGCAGGAAGA AGCGGTGCCG TTCCTCGCGA AAGGGCCGTC GGGGATTGCG GTGCGATCGA TTATTGAGTT GGTGCAGAGC TTCGGATAG
|
Protein sequence | MKIHLSTLDP AQLETDALIV LAIDGGDKDN NKPQLQAKSD AFAKAAADLI ASKEITGKLL EIATLHKPEG VKAKRLIVVG VGKAKSFTSY ELRKAAGAAV RALKKSVKSA AIVAPENWGG AADPSTTSTL MFERGGLPEA VKAIAEGAVV ANPDYNYYHS DRKTYELDEL TILVPANGHA NDLEAAMKEG HVIGESQNFT RDLVNEPGNR MTPTILGQRA KKMAEEVGIQ CDVYSTDFLH EKKMGAFWSV SQGSEEPPAL IVMKYEPAGA PQSPVLGLVG KGITFDTGGI SIKPADGMEK MKYDMAGGAA MIGAMRAIAL LKPNVRVIGV VCAAENMPSG KAQKPGDVQI AMSGKSIEII NTDAEGRLVL ADGLHYAKQL GATHLIDAAT LTGACMVALG GINAGVFAND EDYFNRFAEA LKKSGEKMWR LPIDDDYKEL IKSPIADIKN TGGRYGGAIT AAMFLKEFVG ETPWIHLDIA GVAWQEEAVP FLAKGPSGIA VRSIIELVQS FG
|
| |