Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1392 |
Symbol | |
ID | 8534548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1498865 |
End bp | 1500346 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 646383783 |
Product | protease Do |
Protein accession | YP_003263273 |
Protein GI | 261855990 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAATCCT GGGTACCCAT TACGGTGAAC GCATTGTTAT TTTCTCTTGT GGCTTTTCGC CGATTGTTCT GGTCATCCCT TGTCTTGGTT GTGGCTGTCT CGCTCACGGC TTGTGGTGAC CAAAAGTCCG AGGCGCAAGT CACCGGTCTA CCCGACTTTT CTGCATTGGT GGCGGCCAAT AATGCCTCCG TGGTCAATGT GAGTGCCATC GTCCCGTTGG CGGCCATGCC GGATACCTCA AATCAAGGTA GGAATGACAG CGAACTCAAC CAGTTTTTCC GACAGTTTTT TGGCTTTAAT GGTCCGGCGC CAGGTGGATC GACGCCTCAG GAACCCACGC CGCCGGTCGA GCCTGAATCT AGTAGCGGTT CGGGTTTCGT CCTGAGTCAG GATGGTGAAA TCGTAACCAA CGAGCATGTG ATCGATGGCG CATCGCAAAT TTACGTTCGG CTGGCGGATG GTCGTGAACT GAAAGCAAAG GTTCTCGGCA GCGATAAGGC CGGTGACATT GCGTTGCTCA AGATTGATGC CAAAGGGCTC AAGCCCGTCA AAATCGGTAA TTCCGATCAA GTCAAACCCG GGCAATGGGC GGTGGCGATC GGCTCGCCTT TCGGGTTTGA TCATTCCGTC ACGGCTGGTG TCGTCAGCGC CAAGGGACGC TCGCTGCCGG GAGATGACAA TCAGCGATAT GTGCCGTATC TGCAGAGCGA TGTTGCGATC AACCCGGGTA GTTCAGGTGG CCCACTGTTC AACGTCAAGG GTGAGGTCAT TGGCATCAAT GCGCAGATTC TCACGGAATC AGGTACTTAC AATGGCCTGT CGTTCTCCAT CCCGATCAAT TACGCGTTGC AGGTGGTGGA ACAATTAAAG CAGCACGGTA CGGTTGATCG TGGCTTCCTG GGGGTTCAGA TTCAGTCCCT GAACCGCGAG ATGGCACAAG CCATGGGTCT TGACCGTGCC AAGGGTGCAT TGGTGACCGG GTTCGTTTCA GGGTCGCCTG CCGAGCAATC CGCGCTGCAA CCGGGCGATA TCATCATTGC TGCCAATGGC CACCCGATTA CGGAATCTGC CGACTTGCCG CAAACCATCG GTGTGCTCCC ACCGGGAAGC GACGTGCGCC TTGAGGTGCT GACGAAAGGT AAGACGCACA ACATATCAAT AAAGCTGGCT GCGTTGCCTC AACATGCGCC GAGGCAGGTT CAGTCGATGA AATCTCATGA TCTGATCGTT GAGGATTTCG GCCTGCTTTT GACGGACGAT GGCGGCACGA TTCGAGTCAA GGCCGTCGAG CCAAACAGCC CGGCCGCGAG ATCGGGCCTG GCGGCTCAGG ACATTCTGCT CACGCTCAAT CAGCGCGCGT TGAATTCCTT GGATGCCGCA CAAAATGCTT TCGAGTCAGC GCGTAAAGAC AGGCCCAATG CCGTTCTTAT GCGCCGTGGT GATCAACAGC ATTACATTGC CCTCTCGCTT CAAGCCGACT GA
|
Protein sequence | MKSWVPITVN ALLFSLVAFR RLFWSSLVLV VAVSLTACGD QKSEAQVTGL PDFSALVAAN NASVVNVSAI VPLAAMPDTS NQGRNDSELN QFFRQFFGFN GPAPGGSTPQ EPTPPVEPES SSGSGFVLSQ DGEIVTNEHV IDGASQIYVR LADGRELKAK VLGSDKAGDI ALLKIDAKGL KPVKIGNSDQ VKPGQWAVAI GSPFGFDHSV TAGVVSAKGR SLPGDDNQRY VPYLQSDVAI NPGSSGGPLF NVKGEVIGIN AQILTESGTY NGLSFSIPIN YALQVVEQLK QHGTVDRGFL GVQIQSLNRE MAQAMGLDRA KGALVTGFVS GSPAEQSALQ PGDIIIAANG HPITESADLP QTIGVLPPGS DVRLEVLTKG KTHNISIKLA ALPQHAPRQV QSMKSHDLIV EDFGLLLTDD GGTIRVKAVE PNSPAARSGL AAQDILLTLN QRALNSLDAA QNAFESARKD RPNAVLMRRG DQQHYIALSL QAD
|
| |