Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1707 |
Symbol | |
ID | 8534865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1834665 |
End bp | 1836878 |
Gene Length | 2214 bp |
Protein Length | 737 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646384091 |
Product | peptidase S8 and S53 subtilisin kexin sedolisin |
Protein accession | YP_003263579 |
Protein GI | 261856296 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTAC GCCACACTCT CTATACAAAA TGCGCACTTG GCCTGCTGTG TGCGCTCGCC CTCAACCTGC CCGCCCAGGC CGGCCTGCCC GTTTCAACAA CGAATATCGC ACTTCCGGCA CCCTCTGCCG CCAGCTCATA CAAGCCCGAC ACAGTGGTCC CCAACGAGGT TCTGGTTAAA TTCGGCAGCA CGCTATCGGC ACAGAACATG ACCCCCACGC TGAATAGCAT GGCGAGCAGC GTCAAGCAGA TCAACCGTTC GGGTTTGACC TTGGTCAAAC TCACGCCCGG CACCGAAACG ATTTCATCGG CGATGGCCAC GTTACGCGCC ATGCCGGGTG TGATTTCCGT CCAACCGAAC TTCATTTACC ACAGCACAGC CCTTCCCAAC GATCCCGAGA TTGGCCAACA GTGGGCGCTG AAAAATATAG GGCAGACTGT CTCGAACGCG ACCTATGCCA CCAGTAATCC GGGCACACCT GGGGATGACA TTGATGCAGA AAGCGCCTGG CAATACCAAT CCGACTGCTC GTCGGTCACG GTCGCCGTCG TCGATACAGG TATCAACTAC ACCCAGCAGG ATCTGGTCAA CAGCATGTGG AATGGTGGGA CTCAATACCC ACACCACGGC TATGATTTTG TCGATAACGA CAATGATCCC TATCCCACAA CCGGTGATGA ACTCCATGGC ACCCATGTGG CAGGCATTAT CGGTGCTGAG GGCAACAATG GGATTGAAGG CTCCGGCGTC TGCCAGAAAG CCAGCATCAT GTCCGTGCGC TCGCTGGACT CATCAGGCGG CAGCACAGCA AGTGTAGTTC AAGGCGTTTA CTTTGCCATT GATCATGGCG CTCGAATCAT CAATATGAGT CTGGGTGGCA GCGGCGGTTT CGACCAAGAC TTCTCCGACG CAATCAGCTA CGCCCAAAGC AAGGGGGCGC TGGTCGTTGT TGCTGCAGGC AATGGCGATG CAAATGGAAA TGGCGTTGAC GTCGATCAAA CCCCTTTTTA TCCCTGCGCT TTCCCGCAAG ATAACCTGAT CTGCGTCGCT GCGCTGGATC AATCGTTCCA GCTCGCCAGC TTTTCTAATT ACGGCGCCAC CAGCGTGGAT GTGGGTGCGC CGGGAACGAA TATTCTCAGC ACGTTTGCAG GACCGACTCT GACAACGGAT TTTTCCAGCG GTTGGACGGC CAGCCTGGGT ACCAGCACGG GCTGGGGTTA CGGGAAAACC ACATCCGGTA TCCCGATACT TGTTAATCCG GTTGATTACG GCAAGAGCAA TTATGCACCG AGTACAGATG ATCGTATCTG GACCGACTTT ACGTTTGCAC CCGGTACACA ACACGTTGCT TTAAATTATT ACCTGCAAGG GCGCATGGCT ACGGGTGATT ATCTCAATTC TGGCGTCGCC GTCGGAAGTA ATACGGATCC GTTTGGAAGC AGTGGCACCC AACTCCAACA CGAAACGGAC ACCCTCAGCT CCCCAGCGGC TGCGTACCCG ATTGATCAGT GTGCTGGCAA AACGTGTTCG ATCGGCTTCC AGTTGACCAG CACGCCCGTA AGTGCAGGCG ACACCGGCCC GCTGATCGCT TTTTTCGAGC TCAATACCGT TGCAACCAAC ACCCATGAGA TGGGTATTGA GAACGGCACA TCAATGGCGG CGCCCGTCGT CTCCGGCATT GCCGCCTTGT TGATGGCCTC CGACCCGGCC GCCAGCGATA TCGACGTCGT CCAGGCAATC AAAAACTCAG GCATACCGGT GCCTACCCTT TCGGGTGTCA CCACAACGGG CAAGGCTGTC AATGCCATGC GCGCGCTTGC CAATCTGCAC CTGAGTGTTA CTGGCCTTGC TGACCAAACG GGTACTGCCG GACAACCGCT TTCTGTCACA TTCTCCATCA GCGGCTTGAA CGCGCTAGCT GTTTCCGCGA GCAGTAGCAA TACTTCTGTA TTGGCCAACA CCGCAATTAC AGGTCAAAAC AGTTGCACAC AGACAGGGGG CTGTACGCTC CAGCTTCTCC CGGCAATGGG TGGTACTTCT ACGATCTATG TGACCGTAAG CGACACATTC GGCCAACAGT CGACGGGCAG TTTCCTGCTG ACAGTTCCCT CTTCCGGTGG TGGTGGCGGC GGCAGCATGA ACTGGACTTT CCTGCTGGTA CTGGCCGTCA TTCTGGCTGC CGGTCAATGG CGCAGGCGAG GGTTGGAATC ATGA
|
Protein sequence | MTLRHTLYTK CALGLLCALA LNLPAQAGLP VSTTNIALPA PSAASSYKPD TVVPNEVLVK FGSTLSAQNM TPTLNSMASS VKQINRSGLT LVKLTPGTET ISSAMATLRA MPGVISVQPN FIYHSTALPN DPEIGQQWAL KNIGQTVSNA TYATSNPGTP GDDIDAESAW QYQSDCSSVT VAVVDTGINY TQQDLVNSMW NGGTQYPHHG YDFVDNDNDP YPTTGDELHG THVAGIIGAE GNNGIEGSGV CQKASIMSVR SLDSSGGSTA SVVQGVYFAI DHGARIINMS LGGSGGFDQD FSDAISYAQS KGALVVVAAG NGDANGNGVD VDQTPFYPCA FPQDNLICVA ALDQSFQLAS FSNYGATSVD VGAPGTNILS TFAGPTLTTD FSSGWTASLG TSTGWGYGKT TSGIPILVNP VDYGKSNYAP STDDRIWTDF TFAPGTQHVA LNYYLQGRMA TGDYLNSGVA VGSNTDPFGS SGTQLQHETD TLSSPAAAYP IDQCAGKTCS IGFQLTSTPV SAGDTGPLIA FFELNTVATN THEMGIENGT SMAAPVVSGI AALLMASDPA ASDIDVVQAI KNSGIPVPTL SGVTTTGKAV NAMRALANLH LSVTGLADQT GTAGQPLSVT FSISGLNALA VSASSSNTSV LANTAITGQN SCTQTGGCTL QLLPAMGGTS TIYVTVSDTF GQQSTGSFLL TVPSSGGGGG GSMNWTFLLV LAVILAAGQW RRRGLES
|
| |