Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3455 |
Symbol | |
ID | 4075089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 477875 |
End bp | 479293 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638004964 |
Product | leucyl aminopeptidase |
Protein accession | YP_611689 |
Protein GI | 99078431 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCCC ATTTTGCCGT GCCCAATCTG GCCGTACCGA CCTTTGCAGC CGCAACCGAT GCGGCCCTTC CCCTTCACAT CGTGACCGAA CGCACGCTGG ACGCCTGGCT TGCCGAACAG GACGGCGCCA CCGCAGGCTG GGTGCGCAGC ATGGGATTTC GCGCGTCCTG GGGACAGGTG GTTGTTCTGC CGGGACCGGA TGGCAGCCTT CGTGGCGCGC TGATCGGCGC AGGCACCCCC AAGAGTCGCC AGCGTCAGCG CTTCCTGCTG GCGGCTGCGG CGGCCAAACT GCCGGAGAGC ACCTATTCGC TGGCCTCTGA ACTGCCCGCA GATCTGTCGC TCGAGGTTGA GTGTTTTGGC TGGCTCATGG CGCAGTATGC CTTTGATCGC TACGCCAAAA ACACCGGTGC CAAAGCACGA CTTGTGGCCC CGGATGGCGT GGACGCAGCC CGGATCGAGG CGATGGCGGC GGGCGAGACC TTTACCCGCA CGCTCATCAA CACGCCCGCC GAACACATGG GACCCGCAGA GCTGCAAGGG GCCGTCGAGT CGCTGGCCGA AGCCCATGGG GCAGAGGTGC GCAGCATTGT GGGCGCGGAT CTTCTTGAGC AAAATTTTCC GCTCATTCAC ACGGTGGGCC GCGCGGCCGA GCGCGCGCCG CGCCTGATCG ATCTGCGCTG GGGCAGCGAA GGCCCAACGC TGACGCTGGT GGGCAAGGGG GTGTGTTTTG ACACCGGTGG TCTCAATCTC AAACCCGGAG CCAGCATGGC GTTGATGAAG AAAGACATGG GCGGGGCCGC CAATGTCCTG GGCCTTGCGC ATATGATCAT GGCGACGGGC CTCAAGCTTC GGCTGCGGGT GCTGATCCCG GCGGTGGAGA ACGCGATCTC CGGCCCGGCC TTCCGCCCGG GTGATATTCT GACCTCGCGC AAGGGGCTCA CGGTCGAGAT CAACAATACC GATGCCGAGG GGCGGCTGGT GCTCGCCGAT GCGCTGGCGT TTGCCCAGGA AGAGACGCCC GATCTGCTGA TCTCCATGGC GACGCTGACC GGTGCGGCGC GGGTGGCCGT GGGGGCCGAC ATAGCGCCCT TCTTCACCGA CCGCGAAGGC GATGCGGTGA CCCTTTCGAG CACGGCACAG GACGCGGCGG ATCCGGTCTG GCGCCTGCCC TTTCACGACG CTTACGAGGC CATGATCGAA CCCGGGATTG CCGATCTCGA CAATGCGCCA GCAGGCGGCT TTGCCGGGGC GATCACAGCG GCCCTGTTCC TGCGCCGTTT TGCGGATGAG GCGCGCTACA TGCATTTTGA TGTCTTTGGC TGGAGCCAGT CGAACCAGCC CGCCCGCCCG AAGGGCGGCG TCGGTCAAGG CACGCGGGCG CTTTATCAGG CTCTGCCAGA ACTGCTGGGT CTGTCATGA
|
Protein sequence | MQPHFAVPNL AVPTFAAATD AALPLHIVTE RTLDAWLAEQ DGATAGWVRS MGFRASWGQV VVLPGPDGSL RGALIGAGTP KSRQRQRFLL AAAAAKLPES TYSLASELPA DLSLEVECFG WLMAQYAFDR YAKNTGAKAR LVAPDGVDAA RIEAMAAGET FTRTLINTPA EHMGPAELQG AVESLAEAHG AEVRSIVGAD LLEQNFPLIH TVGRAAERAP RLIDLRWGSE GPTLTLVGKG VCFDTGGLNL KPGASMALMK KDMGGAANVL GLAHMIMATG LKLRLRVLIP AVENAISGPA FRPGDILTSR KGLTVEINNT DAEGRLVLAD ALAFAQEETP DLLISMATLT GAARVAVGAD IAPFFTDREG DAVTLSSTAQ DAADPVWRLP FHDAYEAMIE PGIADLDNAP AGGFAGAITA ALFLRRFADE ARYMHFDVFG WSQSNQPARP KGGVGQGTRA LYQALPELLG LS
|
| |