Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2620 |
Symbol | |
ID | 5734498 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3361935 |
End bp | 3363203 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279760 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001545386 |
Protein GI | 159899139 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0136175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCCTG TAAAAGTTGT CTTACCAAAT GGCCTGCGAA TTTATACCGA TGAAATGCCC CATACCCATT CAGTTTCGAT GGGTATTTTT ACCCAAGTTG GCTCGCGCTA TGAAAATGCT CGCCTGACGG GAATTTCACA TTTTTTGGAG CATATGTTTT TTAAGGGTAC TGCCAAATAC CCCACTGCCA AAGACCTTAG CGAGGCAATT GAGGGCATTG GTGGCTATAT CAACGCTACT ACCTCGTATG ATACAACCTG TTATTATTGT AAAGTTGCCA ATATTCATAC CGAACGCGGC ATCGATGTGT TAACTGATAT GCTCAACGCT GCCCTATTCG ACCCTAAAGA AATTGAAAAA GAACGCGGCG TGATTCAAGA AGAAATTAAA ATGTCGCTCG ATGTACCCGC TCAATGGGTG CATCAATTGC TCGACGAATT AATGTGGGGC GATCAGCCAC TTGGCCGTGA TATCGCTGGC ACGCTCGAAA GTGTTGGAGC CTTTAGCCGC GAAGATTTGT TGAATTACCG CGATCAGCAT TATGTTGCAG GTAATACGGT CATTTCGTTG GCTGGCAACT TTAATAGCAC CGAAATTGTT GATCGTCTGA CGAGCTTATT TAGCCATTAT CGGGTGCTTG ACGTGCCCAA ACCAATTACC ACCAATAGTT TTGGCACAGC TCCAGTTGTG CATCTTTTAA ATAAACCAAC CGAACAAACC AATTTTGTGT TGGGCCTCAA ATCGTTTGGC TATGGCGATA GCGATCGCTG GGCGCTCAGC GTGCTCGATA GCATCCTTGG TGGCGGTATG TCTTCGCGCT TGTTCCAAGA AATTCGCGAA GAACGCGGCT TGGCCTATAG CGTCGGCTCC TACACCGCCG AATACGATGA CGCTGGCAAA TGGATTGTGT ATGGCGGGGT TGAAGTCAGC AAGGCAGTCG ATGCAATTGC CGCAATTATC GAAGAACTGC GCAAATTGCG CGATCATGGG GTGACTGCCG CCGAGTTACA CCGCATCAAG GAGCAAGTTA AGGGCGGAAT GCTGCTTGGG CTGGAAGATA CTTGGTCGGT GGCCAATCGC AATGCTCGCC ACGAACTGCG CTACGGCGAG GTGATTCCGG TTGAGCAAAT TGTGGCTTGG ATCGAAGCGG TCACGCTCGA AGATATTCAG CGCGTGGCTC AACGCCTAAT TCGCCCAGAT AACTTATACT TAGCAATCAT CGGCCCGCAT GCCGAGGCTG CTGAATTTGA ACAAGCTATC ACGTTATAG
|
Protein sequence | MAPVKVVLPN GLRIYTDEMP HTHSVSMGIF TQVGSRYENA RLTGISHFLE HMFFKGTAKY PTAKDLSEAI EGIGGYINAT TSYDTTCYYC KVANIHTERG IDVLTDMLNA ALFDPKEIEK ERGVIQEEIK MSLDVPAQWV HQLLDELMWG DQPLGRDIAG TLESVGAFSR EDLLNYRDQH YVAGNTVISL AGNFNSTEIV DRLTSLFSHY RVLDVPKPIT TNSFGTAPVV HLLNKPTEQT NFVLGLKSFG YGDSDRWALS VLDSILGGGM SSRLFQEIRE ERGLAYSVGS YTAEYDDAGK WIVYGGVEVS KAVDAIAAII EELRKLRDHG VTAAELHRIK EQVKGGMLLG LEDTWSVANR NARHELRYGE VIPVEQIVAW IEAVTLEDIQ RVAQRLIRPD NLYLAIIGPH AEAAEFEQAI TL
|
| |