Gene Hneap_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1884 
Symbol 
ID8535042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp2019590 
End bp2020603 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content59% 
IMG OID646384265 
Productmetalloendopeptidase, glycoprotease family 
Protein accessionYP_003263753 
Protein GI261856470 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.635445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTAT TGGGCATCGA AACATCGTGC GATGAAACGG CCATCGCCAT TTATGACACG 
ACCCGCGGTC TGCTCGCGAA TCAGATTCAC TCGCAAACCG ATGTGCATGC ATGCTATGGC
GGTGTCGTAC CTGAGTTGGC TGCCCGCGAT CACGTACGCA AGTTGCCGTT ATTGTTCAGG
GCGGCACTGA TTGAGGCGAA TCTTCGCCGC GATCAGATCA ATGCAATCGG ATACACGGCC
GGTCCCGGTT TGCAGGGCGC ACTGATGACT GGTGCCGCCT TCGCCAAGGG GTTGGCTCGG
GCGCTTCAAT GCCCTGCGCT GGGTGTCCAT CATCTGGAAG GTCACGTGCT GGCGCCACTG
CTTGAGGAAG AACGCCCGCA ATTCCCGTTT CTGGCTGTGT TGGTGTCCGG TGGGCACACA
CAGTTGATTG CAGTCAAAGC GCTGGGAGAC TACGCGCTGC TCGGGGAAAG TATTGATGAT
GCCGTGGGCG AGGCTTTCGA TAAATCTGCC AAACTCATGG GTTTGGGTTA TCCCGGAGGG
GCGGCGCTTT CCCAGTTGGC GCAGCGCGGA CGTCGTGATG CCATCCGCTT CCCCCGACCG
ATGATCGATC GACCGGGATT GGATTTCAGT TTCAGTGGTC TGAAGACGGC GGTGGCATTG
GCCATCGCTG CGGGCAAAGA TCACGCCGAT ATCGCCGCTT CATTCGAACA GGCCGTCATC
GATACACTCG CAATCAAAAT CGGGCGGGCA CTGGAGCAGA CCGGTTACCG CCACGTGGTG
CTGGCTGGCG GGGTTGCGGC GAATCGTCCT TTGCGGTTGC GACTCAAAGA AATGATGGAT
GAGCGTGGCG GGCAGGTGTT CTACCCACCG CCCATACTGT GTACCGACAA TGCGGCCATG
ATCGCTTTGG TTGCCGCGCT TCGGTTGGAG CGGGGCGAGC GTGATGCAGC GGCGGGGTTC
GAGGTTCGTC CTCGCTGGCC ATTGGTTTCC TTGAGCCATT TGTCGTCGCG GTGA
 
Protein sequence
MRVLGIETSC DETAIAIYDT TRGLLANQIH SQTDVHACYG GVVPELAARD HVRKLPLLFR 
AALIEANLRR DQINAIGYTA GPGLQGALMT GAAFAKGLAR ALQCPALGVH HLEGHVLAPL
LEEERPQFPF LAVLVSGGHT QLIAVKALGD YALLGESIDD AVGEAFDKSA KLMGLGYPGG
AALSQLAQRG RRDAIRFPRP MIDRPGLDFS FSGLKTAVAL AIAAGKDHAD IAASFEQAVI
DTLAIKIGRA LEQTGYRHVV LAGGVAANRP LRLRLKEMMD ERGGQVFYPP PILCTDNAAM
IALVAALRLE RGERDAAAGF EVRPRWPLVS LSHLSSR