Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2907 |
Symbol | |
ID | 4078585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3074527 |
End bp | 3075507 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 638008236 |
Product | prolyl aminopeptidase |
Protein accession | YP_614901 |
Protein GI | 99082747 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAAAT ACCCGGATCA AAAGCGCGCA GTGCAGTTTC TTTATCCGCC GATCGAGCCT TTCGATCAGC GCATGCTGGA TGTGGGGCAA GGTCACAGGA TCTACATGGA GCAATGCGGA AATCCCGATG GGGTGCCGGT CATTGTCTGC CATGGCGGCC CCGGTGGTGG TACCAGCCCG GCGATGCGTC GGTATTTTGA TCCAAAGGTC TATCGTATCA TTCTGTTTGA TCAGCGTGGG TGCGGTCGCT CCAGACCTTA TGCGTCCTGC GAAGACAACA CGACATGGCA TCTGGTCGCG GATATGGAAC TGATCCGCGA GCTTTTGGAG ATCGACTCTT GGATGCTGTT TGGCGGCAGC TGGGGTGCAA CATTGGCGCT GATCTATGCG CAGACTCATC CTGAGCGCAC AACGCAGTTG ATCCTGCGCG GGGTGTTCTT GATGACCGAA GCTGAACTCG ACTGGTTCTA TGGTGGGGGT GCAGGGAAAT TCTGGCCGGA AACCTGGGCC AAGTTCACCG ATCTCGTGCC CAAGGACGAA CGGGACGATC TGATTGCGGC TTACCACAAG CGGCTGTTCT CTGGTGATCG CGACGAGGAA ATCCGCTTTG GTCGTGCTTG GTCGGCCTGG GAAAATGCCT TGGCGTCGGT TTATTCCAAC GGTTTGTCGG GTGAGAGTCC TGGAGATTAC GCGCGGGCTT TTGCGCGGCT TGAGAACCAC TATTTCTCCA ATGGTGGCTT CCTGCATATG GATGGACAGA TCCTTGCCAA CATGGGGCGC ATTGCCCACA TCCCTGGCGT GATCGTGCAG GGGCGGTATG ATATGATTTG CCCGCCGCAA GCGGCCTACA GCATTCATCA GGCTTGGCCG AACTCCGATC TGATCATGGT ACGAAACGCG GGACACGCGC TTTCGGAGCC GGGAATCAGC GCGGCTTTGG TCAGATGTAT GGATCAGCTG GCAGAGGAAA TGACCCAATG A
|
Protein sequence | MDKYPDQKRA VQFLYPPIEP FDQRMLDVGQ GHRIYMEQCG NPDGVPVIVC HGGPGGGTSP AMRRYFDPKV YRIILFDQRG CGRSRPYASC EDNTTWHLVA DMELIRELLE IDSWMLFGGS WGATLALIYA QTHPERTTQL ILRGVFLMTE AELDWFYGGG AGKFWPETWA KFTDLVPKDE RDDLIAAYHK RLFSGDRDEE IRFGRAWSAW ENALASVYSN GLSGESPGDY ARAFARLENH YFSNGGFLHM DGQILANMGR IAHIPGVIVQ GRYDMICPPQ AAYSIHQAWP NSDLIMVRNA GHALSEPGIS AALVRCMDQL AEEMTQ
|
| |