Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2074 |
Symbol | |
ID | 8535233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 2219729 |
End bp | 2220700 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646384452 |
Product | proline iminopeptidase |
Protein accession | YP_003263939 |
Protein GI | 261856656 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAC TGTATCCCGC GATTGAACCC TTGGTGACGC ACAGTATTCC CGTGGAGGCC CCACATATAC TTCATGCCGA GGAATGTGGA CGGCTCAAGG GTATTCCCGT GGTGTTCTTG CATGGTGGCC CCGGCGCCGG GTGCACGCCT GCCCATCGCC GTTTTTTCGA TCCTGACCGG TATCGCATTA TCCTGATCGA CCAGCGCGGT GCCGGTCGAT CCACGCCCCA CGCCCATTTG GAAGGCAATA CAACACAACA TCTGATTGCT GATCTTGAGC GGGTGCGCGT TCATCTGAAT ATCGAGCGAT GGCTTGTGTT TGGCGGTTCC TGGGGCTCGA CGCTGGCGTT GGCCTATGCG GCCACTCATC CAGAGCGGGT ACTGGGACTG ATCTTGCGCG GGATATTTCT TTGCCGCGAT GAGGATGTTT CCTGGTTCTA TCAGCGCGGA GCGGATCGCC TGTTTCCCGA TTATTGGGCC GACTATCTTG CTCCCATTCC CGAAGACGAG CGAGACGATC TGGTGGCAGC GTATCACCGT CGCCTGACGG GGAGCGACGA GTTGGCGCGG ATGCAGGCAG CCAAGGCCTG GTCGACCTGG GAGGGGCGAA CCGCAACGCT CCTGACTGAT CCGGCAACGG TCGATTTCTT TGCCGATCCG CATCATGCGC TCTCGATCGC TCGGATCGAA AATCATTACT TTATGCACGG CGCGTTCTTG CGCGAGCAAC CCTTGCTGGA ACAGGTTGAT CGACTGGCGG GTATCGAAGG GGAAATCATT CATGGACGGT ACGATGTGGT GTGTCCGGTG GATCAGGCGT TTTCCTTGGC TGCGGCTTGG CCGAATGCCA AGTTGACGGT TGTGGAGGAT GCGGGCCATG CCGCTAGTGA ACTGGGCATC ACCGATGCTC TGATTCGGGC AACGGATCGG TTTGCGGAGC GCTTGACCGG GCACCAGAAT GGTCGGGGAT AG
|
Protein sequence | MRELYPAIEP LVTHSIPVEA PHILHAEECG RLKGIPVVFL HGGPGAGCTP AHRRFFDPDR YRIILIDQRG AGRSTPHAHL EGNTTQHLIA DLERVRVHLN IERWLVFGGS WGSTLALAYA ATHPERVLGL ILRGIFLCRD EDVSWFYQRG ADRLFPDYWA DYLAPIPEDE RDDLVAAYHR RLTGSDELAR MQAAKAWSTW EGRTATLLTD PATVDFFADP HHALSIARIE NHYFMHGAFL REQPLLEQVD RLAGIEGEII HGRYDVVCPV DQAFSLAAAW PNAKLTVVED AGHAASELGI TDALIRATDR FAERLTGHQN GRG
|
| |