Gene Hneap_2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_2074 
Symbol 
ID8535233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2219729 
End bp2220700 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content59% 
IMG OID646384452 
Productproline iminopeptidase 
Protein accessionYP_003263939 
Protein GI261856656 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGAAC TGTATCCCGC GATTGAACCC TTGGTGACGC ACAGTATTCC CGTGGAGGCC 
CCACATATAC TTCATGCCGA GGAATGTGGA CGGCTCAAGG GTATTCCCGT GGTGTTCTTG
CATGGTGGCC CCGGCGCCGG GTGCACGCCT GCCCATCGCC GTTTTTTCGA TCCTGACCGG
TATCGCATTA TCCTGATCGA CCAGCGCGGT GCCGGTCGAT CCACGCCCCA CGCCCATTTG
GAAGGCAATA CAACACAACA TCTGATTGCT GATCTTGAGC GGGTGCGCGT TCATCTGAAT
ATCGAGCGAT GGCTTGTGTT TGGCGGTTCC TGGGGCTCGA CGCTGGCGTT GGCCTATGCG
GCCACTCATC CAGAGCGGGT ACTGGGACTG ATCTTGCGCG GGATATTTCT TTGCCGCGAT
GAGGATGTTT CCTGGTTCTA TCAGCGCGGA GCGGATCGCC TGTTTCCCGA TTATTGGGCC
GACTATCTTG CTCCCATTCC CGAAGACGAG CGAGACGATC TGGTGGCAGC GTATCACCGT
CGCCTGACGG GGAGCGACGA GTTGGCGCGG ATGCAGGCAG CCAAGGCCTG GTCGACCTGG
GAGGGGCGAA CCGCAACGCT CCTGACTGAT CCGGCAACGG TCGATTTCTT TGCCGATCCG
CATCATGCGC TCTCGATCGC TCGGATCGAA AATCATTACT TTATGCACGG CGCGTTCTTG
CGCGAGCAAC CCTTGCTGGA ACAGGTTGAT CGACTGGCGG GTATCGAAGG GGAAATCATT
CATGGACGGT ACGATGTGGT GTGTCCGGTG GATCAGGCGT TTTCCTTGGC TGCGGCTTGG
CCGAATGCCA AGTTGACGGT TGTGGAGGAT GCGGGCCATG CCGCTAGTGA ACTGGGCATC
ACCGATGCTC TGATTCGGGC AACGGATCGG TTTGCGGAGC GCTTGACCGG GCACCAGAAT
GGTCGGGGAT AG
 
Protein sequence
MRELYPAIEP LVTHSIPVEA PHILHAEECG RLKGIPVVFL HGGPGAGCTP AHRRFFDPDR 
YRIILIDQRG AGRSTPHAHL EGNTTQHLIA DLERVRVHLN IERWLVFGGS WGSTLALAYA
ATHPERVLGL ILRGIFLCRD EDVSWFYQRG ADRLFPDYWA DYLAPIPEDE RDDLVAAYHR
RLTGSDELAR MQAAKAWSTW EGRTATLLTD PATVDFFADP HHALSIARIE NHYFMHGAFL
REQPLLEQVD RLAGIEGEII HGRYDVVCPV DQAFSLAAAW PNAKLTVVED AGHAASELGI
TDALIRATDR FAERLTGHQN GRG