Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0845 |
Symbol | |
ID | 5709372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 888487 |
End bp | 889743 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641275348 |
Product | peptidase M50 |
Protein accession | YP_001540670 |
Protein GI | 159041418 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.450162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATAT CACCATTAGT GTGGTTCGTG GTTGCTTGGT TCGTCTTCAT TGGGTTAGTT AAGCTTTTAC TGGGGAATAG GGTTAGGATT TACTACTACA TTGCTTTAAT GGCTAGGAGC AGTGGTGTTG AGAGGGTTCT TAGGCCCCTG GCTGATTTAA TAGAGGGTGT TCCTGCAATA GTCATATTCG TGATTGTTGT GGTATTCTTC GCATTAGCCA TGGTTTACGC GGTACCAGTC CTATTGCCTA TGCCTATTCA GGTTATAGGT GAATTAGTGG GCTTTGTACC ATCATTCATT AGAATACTGG GTATTAACAT AGCCGCAACT GTAAGCGTCC TAACGAGCCT ACACACTGTA TCAGGTCAAC AGGCGGCAAC TCAATTGGTG CAGAGTAGGT TCACACCAGC CACACCGTTA ATACCTGGTG TCACAATTAG CGTTAACGTG TTCATAATAG TGTTAATAGC CATAGGCATA AGTATACTTG TTCACGAAGT CTCCCACGGC ATAGTTGCAC TTAGGTATGG GGGTAGGATA AAGTCAGGTG GCGTATTCCT ATCACTCTTC ATACTCTACG GTGGTTTCGT TGAAGTTGAT GAGGTTGACC TTAGGAAGAG GGCTGGGTTA AGGGGTGTGT TAGCCATGTT ATCCGCAGGG GTTTTCGCCA ACATGATCCT ATCAATAATT GCCATAGGCC TAATGTACCT AGCCCTAATA CCGGCGCTTC AACCATACTT ATCAGGAATA GTCATAACGA GTATTGTTAA GGATTCACCA GCCTTCTACG CCAACATACC GGCCAATAGC CTACTCCTCG CAGTAAATGG GAAGCCTATA ATATCATCAC TGACTTTACT TTACGTGCTT GAGGGCCTTA AGCCAGGTAG TCAAGTCACC TTAACGATAC TTCACTCAGG CATAATACAC ACCTACACGA TAGTCACCTC CAGTAACCCA AGTGACCCAA GCCTGCCCTT CATAGGCATT ACCATTGGTG ACAGGTTATT TTACCAATTC ATATACTGGC TATGGACAAT AAACGTGGTT ATAATACTAC TTAACACCAT GCCCGCGTGG CCTCTTGATG GTGGCCAATT CCTATACCAC ATACTCTTAA GCATACCTGG ATTTAAGGAG GATTGGGCTA ATTGGATTAT GTACATCATT AGCGCTGCCT TATGGGTCCT CTTCGTATTC ACACTACTAG TAAGCTTATC ATCAGGTCTC TGGAGGATTG CGGTGACCCC ACCGTGA
|
Protein sequence | MNISPLVWFV VAWFVFIGLV KLLLGNRVRI YYYIALMARS SGVERVLRPL ADLIEGVPAI VIFVIVVVFF ALAMVYAVPV LLPMPIQVIG ELVGFVPSFI RILGINIAAT VSVLTSLHTV SGQQAATQLV QSRFTPATPL IPGVTISVNV FIIVLIAIGI SILVHEVSHG IVALRYGGRI KSGGVFLSLF ILYGGFVEVD EVDLRKRAGL RGVLAMLSAG VFANMILSII AIGLMYLALI PALQPYLSGI VITSIVKDSP AFYANIPANS LLLAVNGKPI ISSLTLLYVL EGLKPGSQVT LTILHSGIIH TYTIVTSSNP SDPSLPFIGI TIGDRLFYQF IYWLWTINVV IILLNTMPAW PLDGGQFLYH ILLSIPGFKE DWANWIMYII SAALWVLFVF TLLVSLSSGL WRIAVTPP
|
| |