Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0040 |
Symbol | |
ID | 8533153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 46091 |
End bp | 47545 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646382419 |
Product | protease Do |
Protein accession | YP_003261953 |
Protein GI | 261854670 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAACCC AGATTCTTGT TGAAGCCATT CGTGCAGGCT TGCCGCGCCG TGGACGGCTT TTGATGGCCG CCGCTCTGAT TGCCACACCG TTGATGGCGA TGACACCGGC GATCAGTTTT GCCGACAACG GCGTCCCCGA TTATGTGCAG TTGGTAAAAC AGGCCAGCCC GTGGGTGGTC AATATCAGCA GCGTGAGCAA TCCCAAAACC CAAGAAGCTT TTAACAACGG CGAAATGCCA ACCTTTCCGC CAGGACCTGC GGGCGATATG TTCCGGCATT TTTTCCAAGA ACAAATGCCG CAAATGAAGC GTGAACCGAT TCGCTCTCTG GGTTCGGGTT TTATTATTTC CGCCGATGGC TACATTCTTA CCAACGCGCA TGTAGTCAAC GGCGCGGACA AAATCACGGT GCGATTGCCC GATCAGCAAA CCTACAAGGC CAAAGTGATC GGCAAAGACA AACGCACCGA CATCGCGTTG CTGAAAATCG ATGCGAAAAA TCTGCCTGTT GCCCCCATTG GCAACTCGGA TAATATCCAA GTGGGCGAAT GGGTTCTTGC CATTGGCGAG CCTTTCGGGC TCGATCACAC CGCAACGCAC GGCATCGTGT CTGCCCTGGG CCGCGATTTG CCCGATGAGA GCTACGTGCC CTTCATTCAA ACCGATGCGC CCGTCAATCC GGGCAACTCG GGCGGTCCAT TGATCAATGC TAACGGCAAA GTCATCGGCA TCAATTCGCA GATTTATACG AAATCTGGCG GGTTTATGGG GATCTCGTTT GCGATTCCGA TCAATGTTGC CATGAACGTG GTCGATCAGA TCAAGTCTAC CGGTCATGTG ACGCGAGGCT ATTTGGGCGT GTTGATCCAG CCGGTCACCT ACGATTTGGC GCAATCGTTC GGTCTGGATA CCACCAAAGG CGCACTGGTG GCTAAGGTGG AGCCCAACAC ACCGGCGGCC AAAGCAGGTC TAAAATCGGG CGATATCATT CTCAAGTTTA ACGGCAGCGA GATCAAACAC TCCGGCGAAT TGCCCATCAT GGTTGGCATG TCGCCGATTG GCAAACCGGC CACCCTCACC TTGATGCGCG ATGGCAAGCA GATGGAGCTT AATGTCACCA TCGAAAAGCT CGACAAGAAA GCACTCGAAG CTGAATCGGG CACCAGTGAA GCCATTGAAA AAATGGGTCT TCAGGTCACC GAACTCTCCC CAGACGAACT GCAGCAGCTG AACATCAAAT ACGGCATAAA AGTGAAAAGC GTCAAAAATG ACTCGCAATT CGCATCGGTT ATTGCCCCCG GCGACATCCT GCTGGAAGTC AATCGGATGC CGATGAAGTC TGCGACCGAT CTGAAAAAAG CGCTCGACAG TGCGCCGAAG GACCGACCGA TTGCCATCCG GCTGTTGCGG GATGGTCAAC CGTTATTCAT GGCGGTTCAA CTCGGTACTC AGTAA
|
Protein sequence | MRTQILVEAI RAGLPRRGRL LMAAALIATP LMAMTPAISF ADNGVPDYVQ LVKQASPWVV NISSVSNPKT QEAFNNGEMP TFPPGPAGDM FRHFFQEQMP QMKREPIRSL GSGFIISADG YILTNAHVVN GADKITVRLP DQQTYKAKVI GKDKRTDIAL LKIDAKNLPV APIGNSDNIQ VGEWVLAIGE PFGLDHTATH GIVSALGRDL PDESYVPFIQ TDAPVNPGNS GGPLINANGK VIGINSQIYT KSGGFMGISF AIPINVAMNV VDQIKSTGHV TRGYLGVLIQ PVTYDLAQSF GLDTTKGALV AKVEPNTPAA KAGLKSGDII LKFNGSEIKH SGELPIMVGM SPIGKPATLT LMRDGKQMEL NVTIEKLDKK ALEAESGTSE AIEKMGLQVT ELSPDELQQL NIKYGIKVKS VKNDSQFASV IAPGDILLEV NRMPMKSATD LKKALDSAPK DRPIAIRLLR DGQPLFMAVQ LGTQ
|
| |