Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0156 |
Symbol | |
ID | 8542535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 230939 |
End bp | 233515 |
Gene Length | 2577 bp |
Protein Length | 858 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646384952 |
Product | Trypsin-like protein serine protease typically periplasmic containing C-terminal PDZ domain-like protein |
Protein accession | YP_003264690 |
Protein GI | 262193481 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCGCG CGATGAATCT GCCCAAGCTC ACCGCCCAGC TCGAGCTGGC GGCGTCGTTC AGCGACCGTC AGGCGCTGGT CGAGGCCGCC GCCGCCGAGG CCTCGCCCGC CGAACTCAAA GAGATCGCCC GGCTGCTCAA TCACCGCGTG CTGGCCGTGC GCCTGGGCGC CATCGAGATC CTCGAGCGCG CCCGCTTCCG CCCGGTGCTG CCGATCCTGG CCGCGGCCAC CCGCCAGCGC CGCGGCGACG AGCGCGTGCT GGCCGCGCGC GCGGTGTCGC GCCTGGCCCA GCGCGAGGAC CGCGAGACCC TCGAACCGCT GGCGCGCGCG TGGCTCGGCG GCGCAGACAA GTATCTGCAC GGCCACGGCC AGGCGCTGCT CGCGGCCCTC GGTGTCGGCG GCGACGCCTC GCCCGCCAGC GCCTCGCCCG CCAGCGCCTC GCCCGGCGCG TCTGCCGATC GGCCGGGCGC CGCGGCCAGC GCCACCAACC ACGACCTCGC CGGCATCACC GCGCCCGATC GCGTGCAGCG CGCGCGCGCC CTGTCGCTGC TGCTGGCGCG ATCGGGCGCT GGCGCGGCGC TGGTGCAGGG CCTGCTCGCC AGCAAGCACG CGGGCGTGCG CGTCGACCTT TTGCAGGCGC TGGCCAGCCT CGGCGGCCAG GCGCTGGTGG CCGCGGCCCC GCACCTGCTC GAGCGCGGCG ATAGCGACGT CGTCGCCCTG ATCGGCCGCG CTCTGCTCGG CCACCTGGGC GAACTCGCCG AGGCGCCGCA CACGCGCTTG CGCACCGCGA TCGAGCGCGC CCAGCGGCGG CTGAGCGGCG ACGAGCTGGC GATCTCGGCG CTGCACGAGT GCCTGCTCGA GCTCGAGGTC GAGAGCGCGA TCGACACCTT GGCCGCCCAG GTCGACACCC TGGCCGTGGA CACGGTGCAG CGCATCGCGA CGCACATCGC CGCGCTGCCG CCCGAGCGCC GGCGGCCGCT GCTGCCCAAG CTGCTGGCGG CCTTCGAGTA TGCGCCCCGG CGCGCGCTCT TGTTCGCGGA CTCGCTGCAC GCGGCCTGGC CCGAGCTGCG GCCGCCGCGG CGGGCCGCGC TGCGCGAGAT CCTGAGCAGC GCCGCCGGCA GCGCCCTGCC GCGCGGGCTG CCCGACGCCT CGCTGCGCGC CATCGGCCAG CTCTACGCGC TCGCCCTCGA GCCGGGCGCG CGACCGCCCG AGGCCGTGCT CCTGGCCCTC GACCGCAGCG ACGACGCCGC GGTGCTGAGC ACGGCCGTCG CCATCCACGA GACGCTGGCC ACCGAGGCCG CGGCCCGGCG CCTGGCCGCG TACCTCGACG AGCCCGACGA GGCCGTGCGC GCGGCCGCCC ATCAGGCGCT GCTGCGCCTG TCCGAGGACG CCGAGGCGCC CTACCGCGTG CGCTTCCGCG ACGACGGCAA AGCCGAGATC GCGCCCGACT ACCGCACGCC CGAGGGCGAA GCGCTGCGCG CCGAGCACGG CAGCCTGCGC AGCGACAGCG GCGAGGCCTA TCTGCTCGAC GCCCGCGGCC GCCCGGTGGC CGAGCGCGAC ACCGAATGGC GCGGCTGCCG CTGCTGCGAG CGGCCGCGCG TGCTGGTGCG CGAGGGCGAC GAGCGGCCCA CCTGTCCGGT GACCGGCGAG GCCCATCTGC TCGAGGACGG CCGCACCGTG CTGGAGCGCG ATCATCCCCT GGGCGGCTGC GACGAGTGCG AGTCGGTGGC GCCGCTGCTG CGCCACGGCA ACGAGGTCCG CTGCGAGACC TGCGAGACCA CCCACGTGCA GCGCCGCGGC CGCTATCGCG CGCAGCGGCG ACCGCGCGAG CACGCGGAGA TCGCCGAGCC GCTGCCGCAG GATCAGCCGC TGGCGCCGCA GCCGCAGACC CTGCCGACGC CGCCGAGCGC GAGCGATCTC GAGCTGGTCG AGCCGGCCAT CGCCCGCGCC ATGGCGGCCA ACGTGTTCGT GCTGGGCAGC GGCCTGGAGC AGAGCTGGAC CGGCTCGGGC GTGATCGTCG CCCGCGACGG CAACGAGCTG GCCATCCTCA CCAATCGCCA CGTGGTCGAG GACGTCGACG CTGGCGGGCG CTCGCATGTC GCCTCGGTGC GCGTCTACAC CATCTCGGGC GAGCTCAAGC GCGCGCGCGT GGTCTGGCGG GCTGAGCGCG GCATGGACCT GGCGCTGCTC AGCATCCGCA TCGACGAGCC CGAGCGCGTC ACCGTCACCG AGCTGCAAGC GGGCCCCTGC GTGGTCGGCG AGCGGCTGTT CTCGATCGGC AATCCGCTGG GGCTGAGCTG GAGCTACGCC TCGGGCACGC TGGCCGCGTT CCGCACGTTC ACTTCGAAGG CCGGCCTCGA GGTGCGCTTC GTGCACTCAC ACGTGAACAT GTCGCACGGC TCGAGCGGTG GCGGGCTGTA TCACGAGAAG GGACACCTGG TCGGTATCAA CTCATTCATC GGCGGCACCG CGGTCGGCCC GGGCGGCGTG CAGAGCTTCT CGATCGCCAT GCCCTCGGCG CTCGATGCCC TGCGCCGCGA AGGCGTGCGC TTCGCGGGCA AGCCGCTGGT GCCCTGA
|
Protein sequence | MIRAMNLPKL TAQLELAASF SDRQALVEAA AAEASPAELK EIARLLNHRV LAVRLGAIEI LERARFRPVL PILAAATRQR RGDERVLAAR AVSRLAQRED RETLEPLARA WLGGADKYLH GHGQALLAAL GVGGDASPAS ASPASASPGA SADRPGAAAS ATNHDLAGIT APDRVQRARA LSLLLARSGA GAALVQGLLA SKHAGVRVDL LQALASLGGQ ALVAAAPHLL ERGDSDVVAL IGRALLGHLG ELAEAPHTRL RTAIERAQRR LSGDELAISA LHECLLELEV ESAIDTLAAQ VDTLAVDTVQ RIATHIAALP PERRRPLLPK LLAAFEYAPR RALLFADSLH AAWPELRPPR RAALREILSS AAGSALPRGL PDASLRAIGQ LYALALEPGA RPPEAVLLAL DRSDDAAVLS TAVAIHETLA TEAAARRLAA YLDEPDEAVR AAAHQALLRL SEDAEAPYRV RFRDDGKAEI APDYRTPEGE ALRAEHGSLR SDSGEAYLLD ARGRPVAERD TEWRGCRCCE RPRVLVREGD ERPTCPVTGE AHLLEDGRTV LERDHPLGGC DECESVAPLL RHGNEVRCET CETTHVQRRG RYRAQRRPRE HAEIAEPLPQ DQPLAPQPQT LPTPPSASDL ELVEPAIARA MAANVFVLGS GLEQSWTGSG VIVARDGNEL AILTNRHVVE DVDAGGRSHV ASVRVYTISG ELKRARVVWR AERGMDLALL SIRIDEPERV TVTELQAGPC VVGERLFSIG NPLGLSWSYA SGTLAAFRTF TSKAGLEVRF VHSHVNMSHG SSGGGLYHEK GHLVGINSFI GGTAVGPGGV QSFSIAMPSA LDALRREGVR FAGKPLVP
|
| |