Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1478 |
Symbol | |
ID | 8534635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1595731 |
End bp | 1598643 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646383868 |
Product | Peptidase M16C associated domain protein |
Protein accession | YP_003263357 |
Protein GI | 261856074 |
COG category | [R] General function prediction only |
COG ID | [COG1026] Predicted Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA GCATCCATCA CCCTGCGTTT ACCTTTGTTC GATCTCAGCC GATTGCGGCG TTGAATCTCA CGGTGGATGA ATACCGTCAC AATGCCACGG GCGCACGGCA TTATCATATG GCGACCGATG ATCCGCAGAA TGTCTTTTTG GTGGGCTTGC GCACGGTGCC TGAAGATTCG ACCGGGGTGG CGCATATTTT GGAGCACACC GTTTTGTGCG GCTCGGAGCG GTTCCCCGTG CGTGATCCGT TTTTCATGAT GATCCGCCGT TCGCTCAATA CCTTTATGAA CGCGTTTACC GCCAGCGATT GGACAGCGTA TCCGTTCGCT TCGGTCAATG TGAAAGATTT CTACAACCTG CTGGATGTGT ATCTTGATGC GGTGTTTTTC TCACGCATCG ATGAACGCGA TTTCCGCCAG GAAGGCCACC GCGTTGAATT CACCACGCCC GAAGATCCAT CCACACCGCT GACGTTCAAA GGCGTGGTGT TCAATGAAAT GAAGGGCGCC ATGAGCAACC CGTCGTCTGT GCTCTGGCAG ACGTTGACGA GCGCATTGTT CCCAACAACG ACCTATCACT ACAACTCCGG CGGCGAACCC GTTGATATTC CTAACCTGAC CTACGCGCAA TTAAAAGCGT TTTACCAGCG GTTCTATCAT CCATCGAACG CCGTATTCAT GACCTACGGC AATTTGCCTG TGAGCGATCT GCAAACGCAG TTCGAAGAAA AGGCATTGAA GCGCTTTTCG CGCATCGATC CAAATTCGGC CGTGCCCATC GAACAGGCGT TAACGTCCCC GCGCTTGATC GAAGAAAATT ATGCGCTCGA TGAACCCGAT ACCGCCGAGA AAACCCATGT GGTTGTCGGC TGGTTGCTGG GTGAAAGTAC CGATCTGGAT GCGGCACTGG AAGCCCAACT GCTCGAAGGC GTATTGCTTG AAAACAGCGC TTCTCCGCTC TTGCGCGTGT TGGAAACCAC TGATTTGGGC GGTTCGCCTT CGCCCATACT GGGGCTGGAA GACTCACAAC GGCAGATGGT GTTCGTGGCC GGTGTCGAGG GTAGCGAGCC GGATCGCGCC GAGGCCGTGG AGAAACTGGT ACTCGATACC TTGGCCGAGA TCGCCGAAAA GGGCGTGCCT GCGGATATGA TCGAGTCGGT GCTGCATCAG ATAGAACTCT CGCAGCGCGA AGTGACGGGC GATGGCATGC CCTATGGCCT GCAACTGATC CTGCATGGTC TGCCTGCTGC GATTCACGAT GGTGACCCGA TTGCGGTGCT CGATTTGGAG CCTGCGCTGG CGCGGTTGCG TAAGAAAGCC GCCGATAACC AGTTCATCCC GAATCTGATC CGCACATTGC TGCTCGATAA CGCCCACCGC GTGCGGGTTG TACTCAAGCC AGATACTGAG CTTTCTGCCG CCAAACAGGC CGCAGAGCTT GCCCGTCTGG CCGCTATGCA GTCTGCCATG ACGGATGCCG AGAAACAGGC GGTTGTCGAA CAAGCCAAGG CATTGGCCGA GCGGCAGGCC GAAGTCGATG ACATCAGTAT CCTGCCGACC GTGACGCGCG AGGATATCCC CGAACACATC GATTTGCCAA CGCCAGAAAA GACCCTCCAT CATCCGGCGA CATCAACCTG GTTTAATCGC AGCACCAATG GCTTGGTTTA CCTGCAAGCC GCGCTGGATT TGCCGCAACT GACCCACGAT GAGCTCGATC TGTTGCCGAT TTACAGCGGC GTATTGACCG AGCTGGGGGC AGGCGACCGT GATTATCTGC AAATGGCCGA GGCCGTTGCG GCGCGCACGG GCGGGTTTTC TGCCCGCTCA TCGATCCGCC CGGATCTTAA CAATGCACAT AACTTGAGTT CGTTCTTCCT GCTCGGCGGC AAGGCATTGG TTCGCCATAC CGATGAGCTT GTTGAGCTGT TCCATCAGCA CCTCAACGCG GCACGCTTCG ATGAAACCAG CCGTATTCGG GATCTGATCA GCCAGATACG TTTTCGCAGT GAACAAGGCA TCGCCGGCGC GGGTCATGTT CATGCCATGA ACCTAGCCTC CAGCGGGATG TCGGCTCGCG CCAAATTGAC CCATGAATCG GGTGGTGTGG CCGGTGTTCG GCGCATCAAG GCGATGGATG ACGCGCTCGA TGAAACCAAG GCAATCAACG ATGTAGCCGA GCGATTGGCG CGACTGCACG ATAAGCTCAA GGGCGGGTTG CGGCAATACA ATGTGATTGC CGAACAACGT CATTTCGATG CCATCCAACC GGTGTTGGAA CGAGCCATGC AGCATGGCAA TGCGGTTGAG CACTTCCACT TGTCCAAAGT GCATCAGCCG GTGCGTGAAG CCTGGATCGG CAACCTCGCG GTGAATTATT GCGCCAAGGC CCATGCCGCC GTGCCGCCAA TGCATGAAGA TGCCGCCGCG CTGGCGGTTC TAGGCGGCTT TTTGCGCAAC GGTTATCTGC ACCGCGCCAT TCGTGAGCAA GGCGGTGCGT ATGGCGGTGG CGCGGGTTAC GATTCTGAAT CGGCAAGTTT CCGCTTTTTC TCTTACCGCG ATCCGCGCTT GACCGACACG CTTAACGATT TTGATCGAGC CATTGATTGG CTGCTGGATA ATACACACGA TGGCCGCACG GTCGATGAAG CGATTTTCGG TGTTATTTCC AGCATCGATA AGCCCGGTTC GCCCGCAGGC GAGGCGAAAA AAGCGTTCCT TGATGGCCTG CATGGTCGTA CACTCGAGCA ACAGCGCCTG ATGCGCGCCC GGATACTGGA TGTTACCGAA GCGGATCTCA AGCGCGTGGC CGAGACCTAT CTCAACCCTC AAACGGCTTC TGTCGGCGTG CTTTCCGGCC CGACGAAAGA AGACGAGTTA AAAGGGCTTG GTCTGCATAT TGAACGGATT TAA
|
Protein sequence | MSESIHHPAF TFVRSQPIAA LNLTVDEYRH NATGARHYHM ATDDPQNVFL VGLRTVPEDS TGVAHILEHT VLCGSERFPV RDPFFMMIRR SLNTFMNAFT ASDWTAYPFA SVNVKDFYNL LDVYLDAVFF SRIDERDFRQ EGHRVEFTTP EDPSTPLTFK GVVFNEMKGA MSNPSSVLWQ TLTSALFPTT TYHYNSGGEP VDIPNLTYAQ LKAFYQRFYH PSNAVFMTYG NLPVSDLQTQ FEEKALKRFS RIDPNSAVPI EQALTSPRLI EENYALDEPD TAEKTHVVVG WLLGESTDLD AALEAQLLEG VLLENSASPL LRVLETTDLG GSPSPILGLE DSQRQMVFVA GVEGSEPDRA EAVEKLVLDT LAEIAEKGVP ADMIESVLHQ IELSQREVTG DGMPYGLQLI LHGLPAAIHD GDPIAVLDLE PALARLRKKA ADNQFIPNLI RTLLLDNAHR VRVVLKPDTE LSAAKQAAEL ARLAAMQSAM TDAEKQAVVE QAKALAERQA EVDDISILPT VTREDIPEHI DLPTPEKTLH HPATSTWFNR STNGLVYLQA ALDLPQLTHD ELDLLPIYSG VLTELGAGDR DYLQMAEAVA ARTGGFSARS SIRPDLNNAH NLSSFFLLGG KALVRHTDEL VELFHQHLNA ARFDETSRIR DLISQIRFRS EQGIAGAGHV HAMNLASSGM SARAKLTHES GGVAGVRRIK AMDDALDETK AINDVAERLA RLHDKLKGGL RQYNVIAEQR HFDAIQPVLE RAMQHGNAVE HFHLSKVHQP VREAWIGNLA VNYCAKAHAA VPPMHEDAAA LAVLGGFLRN GYLHRAIREQ GGAYGGGAGY DSESASFRFF SYRDPRLTDT LNDFDRAIDW LLDNTHDGRT VDEAIFGVIS SIDKPGSPAG EAKKAFLDGL HGRTLEQQRL MRARILDVTE ADLKRVAETY LNPQTASVGV LSGPTKEDEL KGLGLHIERI
|
| |