Gene Hneap_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHneap_1478 
Symbol 
ID8534635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothiobacillus neapolitanus c2 
KingdomBacteria 
Replicon accessionNC_013422 
Strand
Start bp1595731 
End bp1598643 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content56% 
IMG OID646383868 
ProductPeptidase M16C associated domain protein 
Protein accessionYP_003263357 
Protein GI261856074 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA GCATCCATCA CCCTGCGTTT ACCTTTGTTC GATCTCAGCC GATTGCGGCG 
TTGAATCTCA CGGTGGATGA ATACCGTCAC AATGCCACGG GCGCACGGCA TTATCATATG
GCGACCGATG ATCCGCAGAA TGTCTTTTTG GTGGGCTTGC GCACGGTGCC TGAAGATTCG
ACCGGGGTGG CGCATATTTT GGAGCACACC GTTTTGTGCG GCTCGGAGCG GTTCCCCGTG
CGTGATCCGT TTTTCATGAT GATCCGCCGT TCGCTCAATA CCTTTATGAA CGCGTTTACC
GCCAGCGATT GGACAGCGTA TCCGTTCGCT TCGGTCAATG TGAAAGATTT CTACAACCTG
CTGGATGTGT ATCTTGATGC GGTGTTTTTC TCACGCATCG ATGAACGCGA TTTCCGCCAG
GAAGGCCACC GCGTTGAATT CACCACGCCC GAAGATCCAT CCACACCGCT GACGTTCAAA
GGCGTGGTGT TCAATGAAAT GAAGGGCGCC ATGAGCAACC CGTCGTCTGT GCTCTGGCAG
ACGTTGACGA GCGCATTGTT CCCAACAACG ACCTATCACT ACAACTCCGG CGGCGAACCC
GTTGATATTC CTAACCTGAC CTACGCGCAA TTAAAAGCGT TTTACCAGCG GTTCTATCAT
CCATCGAACG CCGTATTCAT GACCTACGGC AATTTGCCTG TGAGCGATCT GCAAACGCAG
TTCGAAGAAA AGGCATTGAA GCGCTTTTCG CGCATCGATC CAAATTCGGC CGTGCCCATC
GAACAGGCGT TAACGTCCCC GCGCTTGATC GAAGAAAATT ATGCGCTCGA TGAACCCGAT
ACCGCCGAGA AAACCCATGT GGTTGTCGGC TGGTTGCTGG GTGAAAGTAC CGATCTGGAT
GCGGCACTGG AAGCCCAACT GCTCGAAGGC GTATTGCTTG AAAACAGCGC TTCTCCGCTC
TTGCGCGTGT TGGAAACCAC TGATTTGGGC GGTTCGCCTT CGCCCATACT GGGGCTGGAA
GACTCACAAC GGCAGATGGT GTTCGTGGCC GGTGTCGAGG GTAGCGAGCC GGATCGCGCC
GAGGCCGTGG AGAAACTGGT ACTCGATACC TTGGCCGAGA TCGCCGAAAA GGGCGTGCCT
GCGGATATGA TCGAGTCGGT GCTGCATCAG ATAGAACTCT CGCAGCGCGA AGTGACGGGC
GATGGCATGC CCTATGGCCT GCAACTGATC CTGCATGGTC TGCCTGCTGC GATTCACGAT
GGTGACCCGA TTGCGGTGCT CGATTTGGAG CCTGCGCTGG CGCGGTTGCG TAAGAAAGCC
GCCGATAACC AGTTCATCCC GAATCTGATC CGCACATTGC TGCTCGATAA CGCCCACCGC
GTGCGGGTTG TACTCAAGCC AGATACTGAG CTTTCTGCCG CCAAACAGGC CGCAGAGCTT
GCCCGTCTGG CCGCTATGCA GTCTGCCATG ACGGATGCCG AGAAACAGGC GGTTGTCGAA
CAAGCCAAGG CATTGGCCGA GCGGCAGGCC GAAGTCGATG ACATCAGTAT CCTGCCGACC
GTGACGCGCG AGGATATCCC CGAACACATC GATTTGCCAA CGCCAGAAAA GACCCTCCAT
CATCCGGCGA CATCAACCTG GTTTAATCGC AGCACCAATG GCTTGGTTTA CCTGCAAGCC
GCGCTGGATT TGCCGCAACT GACCCACGAT GAGCTCGATC TGTTGCCGAT TTACAGCGGC
GTATTGACCG AGCTGGGGGC AGGCGACCGT GATTATCTGC AAATGGCCGA GGCCGTTGCG
GCGCGCACGG GCGGGTTTTC TGCCCGCTCA TCGATCCGCC CGGATCTTAA CAATGCACAT
AACTTGAGTT CGTTCTTCCT GCTCGGCGGC AAGGCATTGG TTCGCCATAC CGATGAGCTT
GTTGAGCTGT TCCATCAGCA CCTCAACGCG GCACGCTTCG ATGAAACCAG CCGTATTCGG
GATCTGATCA GCCAGATACG TTTTCGCAGT GAACAAGGCA TCGCCGGCGC GGGTCATGTT
CATGCCATGA ACCTAGCCTC CAGCGGGATG TCGGCTCGCG CCAAATTGAC CCATGAATCG
GGTGGTGTGG CCGGTGTTCG GCGCATCAAG GCGATGGATG ACGCGCTCGA TGAAACCAAG
GCAATCAACG ATGTAGCCGA GCGATTGGCG CGACTGCACG ATAAGCTCAA GGGCGGGTTG
CGGCAATACA ATGTGATTGC CGAACAACGT CATTTCGATG CCATCCAACC GGTGTTGGAA
CGAGCCATGC AGCATGGCAA TGCGGTTGAG CACTTCCACT TGTCCAAAGT GCATCAGCCG
GTGCGTGAAG CCTGGATCGG CAACCTCGCG GTGAATTATT GCGCCAAGGC CCATGCCGCC
GTGCCGCCAA TGCATGAAGA TGCCGCCGCG CTGGCGGTTC TAGGCGGCTT TTTGCGCAAC
GGTTATCTGC ACCGCGCCAT TCGTGAGCAA GGCGGTGCGT ATGGCGGTGG CGCGGGTTAC
GATTCTGAAT CGGCAAGTTT CCGCTTTTTC TCTTACCGCG ATCCGCGCTT GACCGACACG
CTTAACGATT TTGATCGAGC CATTGATTGG CTGCTGGATA ATACACACGA TGGCCGCACG
GTCGATGAAG CGATTTTCGG TGTTATTTCC AGCATCGATA AGCCCGGTTC GCCCGCAGGC
GAGGCGAAAA AAGCGTTCCT TGATGGCCTG CATGGTCGTA CACTCGAGCA ACAGCGCCTG
ATGCGCGCCC GGATACTGGA TGTTACCGAA GCGGATCTCA AGCGCGTGGC CGAGACCTAT
CTCAACCCTC AAACGGCTTC TGTCGGCGTG CTTTCCGGCC CGACGAAAGA AGACGAGTTA
AAAGGGCTTG GTCTGCATAT TGAACGGATT TAA
 
Protein sequence
MSESIHHPAF TFVRSQPIAA LNLTVDEYRH NATGARHYHM ATDDPQNVFL VGLRTVPEDS 
TGVAHILEHT VLCGSERFPV RDPFFMMIRR SLNTFMNAFT ASDWTAYPFA SVNVKDFYNL
LDVYLDAVFF SRIDERDFRQ EGHRVEFTTP EDPSTPLTFK GVVFNEMKGA MSNPSSVLWQ
TLTSALFPTT TYHYNSGGEP VDIPNLTYAQ LKAFYQRFYH PSNAVFMTYG NLPVSDLQTQ
FEEKALKRFS RIDPNSAVPI EQALTSPRLI EENYALDEPD TAEKTHVVVG WLLGESTDLD
AALEAQLLEG VLLENSASPL LRVLETTDLG GSPSPILGLE DSQRQMVFVA GVEGSEPDRA
EAVEKLVLDT LAEIAEKGVP ADMIESVLHQ IELSQREVTG DGMPYGLQLI LHGLPAAIHD
GDPIAVLDLE PALARLRKKA ADNQFIPNLI RTLLLDNAHR VRVVLKPDTE LSAAKQAAEL
ARLAAMQSAM TDAEKQAVVE QAKALAERQA EVDDISILPT VTREDIPEHI DLPTPEKTLH
HPATSTWFNR STNGLVYLQA ALDLPQLTHD ELDLLPIYSG VLTELGAGDR DYLQMAEAVA
ARTGGFSARS SIRPDLNNAH NLSSFFLLGG KALVRHTDEL VELFHQHLNA ARFDETSRIR
DLISQIRFRS EQGIAGAGHV HAMNLASSGM SARAKLTHES GGVAGVRRIK AMDDALDETK
AINDVAERLA RLHDKLKGGL RQYNVIAEQR HFDAIQPVLE RAMQHGNAVE HFHLSKVHQP
VREAWIGNLA VNYCAKAHAA VPPMHEDAAA LAVLGGFLRN GYLHRAIREQ GGAYGGGAGY
DSESASFRFF SYRDPRLTDT LNDFDRAIDW LLDNTHDGRT VDEAIFGVIS SIDKPGSPAG
EAKKAFLDGL HGRTLEQQRL MRARILDVTE ADLKRVAETY LNPQTASVGV LSGPTKEDEL
KGLGLHIERI