Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0950 |
Symbol | |
ID | 4077344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1014475 |
End bp | 1015950 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006253 |
Product | leucyl aminopeptidase |
Protein accession | YP_612945 |
Protein GI | 99080791 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.111685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGC TCACCCCGAT CCGTTTTTCC GCCTTTGATC CCAAATCCCT GGCCGATGTC GAAGGTCATC TGGCTGTGGT GGTCACGCCC GAGGGCACAA TGGATCAGGC CGCCCGCAGT GCCAATCGGC TGACCAAGGG CGCGCTTGCC CGACTGATCG CCTCCGAGGC CTTTGCCAAA GCCAAGACCG GGCAGGTGAT TACGCTGGCC TGGCCGGCCG GTCTTGCGGT CGAGGCGCTG CATGTCGTTG TCTTGCCGCG TCGCCTCACG CCGATCGAGG CCCGCAAGGC CGGGGCGAAA CTTGCCGAGG CAAAAGGGCG CAAGCCGCTG GTTGTGAAGG CGGGGAACAT GGCGCGCGCC GCGGATCTGG CGCTGGGTCT CGCGCTGCGC AGCTACAGCT TTGATGCACA TAAATCCGCC GAGACCGAAA AGGCGGGCAC CATCACGATC GGTCACAACA AGCCCGAAGA GATGGAGGCT GCCTTTGCGC CGCTCCTTGC GGTGGCCGAG GGCGTGCATA TGACCCGTGA TCTGGTCAGC GAGCCTGCAA ACATCCTCAC CACAACCGAG TTTGCCAACC GCCTCGAAGA GATGACCGAG CTTGGTCTCG AGGTCGAAGT CCTTGAGGAA GCCGAACTCG AAAAGCTTGG CATGCGCACG CTGCTCTGCG TGGGGCAGGG GTCTGCCAGC CCGTCCAAAG TGGTCGTGAT GAAATGGAAC GGCGCCGATG ATAAAGAGGC CGCGCCTCTG GCCTTGGTCG GGAAGGGCGT GGTCTTTGAC ACTGGTGGCA TCTCGTTGAA ACCCGCCGCG GGCATGGAAG ACATGACCAT GGATATGGGC GGGGCCGGCG TTGTTGCTGG CACAATGCGC ACGCTTGCGC TGCGCAAGGC CAGAGCCAAT GTGGTGGGAC TTGTGGGGCT TGTTGAGAAC ATGCCCTCAA GCACCGCCAT CCGCCCCGGT GACGTGGTTA AATCCATGAA GGGCGATACC GTCGAGATCA TCAATACCGA TGCCGAAGGG CGACTGGTGC TCTGTGATGT GCTCTGGTAC GCGCAAGAGC GTTTCAAGCC CGCCGGGGTG ATCGATCTGG CGACTCTGAC AGGCGCAATC ATCATTGGCC TTGGCCATGA AAACGCAGGT GTCTTCTCAA ATGACGATGC GCTCTGTGGA GCCTTCCTCA AAGCCGCCGA CGCCGAAGGA GAAGGGGCTT GGCGCCTACC GCTCGGTCAA GCCTATGACG ATCAGTTGAA ATCCAATGTG GCGGATATGA AAAACGTTGG TGGTCGTCCG GCGGGGTCAA TCACTGCGGC GCAGTTCCTC CAGCGTTTCA TCAAGGAAGG CACGCCCTGG ATTCATCTCG ATATTGCAGG TGTGGCCTCG GTCAAGGCGG CAACGGACTA TGCGCCCAAA GGGGCGACCG GCTGGGGGGT GATGGCACTC AACCGTCTGG TGCAAGATCA GTTCGAGGCC GAGTAA
|
Protein sequence | MSALTPIRFS AFDPKSLADV EGHLAVVVTP EGTMDQAARS ANRLTKGALA RLIASEAFAK AKTGQVITLA WPAGLAVEAL HVVVLPRRLT PIEARKAGAK LAEAKGRKPL VVKAGNMARA ADLALGLALR SYSFDAHKSA ETEKAGTITI GHNKPEEMEA AFAPLLAVAE GVHMTRDLVS EPANILTTTE FANRLEEMTE LGLEVEVLEE AELEKLGMRT LLCVGQGSAS PSKVVVMKWN GADDKEAAPL ALVGKGVVFD TGGISLKPAA GMEDMTMDMG GAGVVAGTMR TLALRKARAN VVGLVGLVEN MPSSTAIRPG DVVKSMKGDT VEIINTDAEG RLVLCDVLWY AQERFKPAGV IDLATLTGAI IIGLGHENAG VFSNDDALCG AFLKAADAEG EGAWRLPLGQ AYDDQLKSNV ADMKNVGGRP AGSITAAQFL QRFIKEGTPW IHLDIAGVAS VKAATDYAPK GATGWGVMAL NRLVQDQFEA E
|
| |