Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0002 |
Symbol | |
ID | 5709976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 9055 |
End bp | 11400 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641274505 |
Product | protease-like protein |
Protein accession | YP_001539846 |
Protein GI | 159040594 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTATA GATTCAGTAA AGGGGGTAAA ATAGCTGCAA TACTAGTGGC TGTAACAGTG GCTATAGTCA TCTCACTATA CGTGGTTAAC GCTCAATCAC CTGATCAACC AAACCCATAC TACCTGGGCT CAGAGGTGTT TTGCATAGAA TATGGAATTA TTGCACCTAA TGGGACCTTT ATACCGTTAC CCTACCCAAC AAGCCTAATA ACGTTCCTAT ACCTAAACAA CGCCACAGGC TTAGGTAACG TGCTTTATAG TGAGTACTAT AATCCCTCAA GCCCACTATA CCACAGGTTC ATATCAGCGG CTGAATTCGA TGAATGGTAC TCAGCGCCAG CAAGCGTCTA CGGTAACTTA ACGGCAATAT ACAGTTACTA TAACTTAACC ACGGAGGTTA AGTCAGCACC AATGTATGCT GCATTAGGTG TGCAGAGTAA TGCTTACAAC GCCTGCATAT CAATAATTAA CGCGACGATA GAGTACTTCA TTTACAATAA TGCTTCGTTT TCATGGGTTA GTTGGGTGTT GGTTACTGAG ACTAATCCAC AGGGCTTCTT CCTTGAAATA ACACCCGGTG AGTTTCAGCA GTTGCTTAAG ACCGTTGAAT CTATTAATAA TAACACTGGT GTAATCCAAC TACCGGTAAC TTATAAGGGT CAACCAGTCT TAATGAAGTA TGCCGTCTAC GTGAGGGGTA GCCACGGTTA CAGTGTTGGC CACGCCTTAG CCCTATACCT GGCTAGGGAG TTTCAGGTGC AGCCTAGCTA CAGTATGATT AAGCCCAGTG GCGTTGTCTC AAAGTCACCG CTGGTGATTA ACGGTAAGCC TGTTGCTGTT CAATTGGAGG CTCAAAGCGC CTTAGCCAAC TCAGCGTTAA ATAAGCCCAG TGAATTCATA CTGCAATTCC CAATAGAAGT ATACCTACCC CAGGGTATTG AGCTACTTTA CAATGCCACA CCACTGTACC CGCTTTGGTT CATTGGTGAT TACAACGGTT ACAACGGTTC ATCAGTTACC GTGGGTATAG TTGACGCCTT CGGTGACGCT GAGAGTCATT TAGTTAACGG ATTCTGCGGC TACGCCTTAT CACCGTACAA CGATATAATA GTTAGTGATG TTAATGCATT CTCATCACTC TTCGACCTAC CTCCAGCGAG TATTACAGTA ATATACCCGG CTGGTGAACC ATTCATCACT CCGTTTAACA GCGTTGATGC TTGCGGTTGG TCCTTTGAAT CAGTCCTCGA TAATGAGTGG GTTCACGCAA TAGCCCCAGG GGCCAGGATA GTCTTCGGGG TGTCCCCTGA TGCTGGGGAT GACTTATACG TTACCATTGA GTACATGGTT AATGAGAGCC TAGTTAACTT CATTAGCCTA AGCTGGGGTT TATCAGAGGA CTACCTTGAC CCATACTATG CCTTAGCCTA CGATCAAATA TTCATGCAGG CAGCGGCACA GGGCATTGGT GTATTCGCCT CCTCCGGTGA CTCTGGTGCC TACGAGTTCT ACCCATTCGT CTCAGCCTTC CACCCATCCA TTGACCCATG GGTCACTGGG GTTGGTGGAA CAACAAGCTA CCTGTTCCCA GGTGGATCAA GGTTCATTAC CGCGTGGAGC TTCTACAGCT TCGGCCTACC TCCATGGGAC TTAATATATT GGGGGAGTGG CGGTGGTTAC TCAATATTCT TCGATATGCC GCTCTACCAG TACCAGTACA TATTCAACCT AATAGGTGAG GGTAATTTCT ATGAGCAAAC CCAGTTCCAG CCATTAATAT GGGGTCTATT GCTTGGTCAA TTCTTCGTTA ATGAACCCTA CGTACCCACA CTCAACATTA ACCCATACAC GCCCCTCTAC AGGACCTTTG AGTGGATGCT TTATCCAAGC CTATACGTAC CCATTGGCGC TAAGGGTTAC CCAATAGTCT CAGCTGACGC TAATCCATAT ACAGGTGTGT TGATAGTGAT TGATGGTGAA CTTAACCCAT TCATATGGGG TGGCACTAGC CTGGCGTCAC CATTAACCAT GGGTATGGTT GCCCTATGGC AGGACTACTT GAATAAAGCC GGCATACCTT ACCAAGTAGG CTTAGCCGCA GTGCCATTAA GCCAAATATG GGCCACTGAG GCTGGTTCAA GCTTCTGCAA CGCCTACTAC CCAACATCAG TCTACGGCAC AAACACCCAC GGTGTCTTCT ACCCATCAAT ATATGGTCAA AACGGCGCCA CGGCTGTGAA TGGTTGGGTT ATTAAGAATC CATGCATCTG GAACCCTGTC AATGGTTTTG GTTCACTAGA CGTGGGTAAC CTGGTGTACT ACGGTACGCA ACTGCTTGAC AAGTAA
|
Protein sequence | MDYRFSKGGK IAAILVAVTV AIVISLYVVN AQSPDQPNPY YLGSEVFCIE YGIIAPNGTF IPLPYPTSLI TFLYLNNATG LGNVLYSEYY NPSSPLYHRF ISAAEFDEWY SAPASVYGNL TAIYSYYNLT TEVKSAPMYA ALGVQSNAYN ACISIINATI EYFIYNNASF SWVSWVLVTE TNPQGFFLEI TPGEFQQLLK TVESINNNTG VIQLPVTYKG QPVLMKYAVY VRGSHGYSVG HALALYLARE FQVQPSYSMI KPSGVVSKSP LVINGKPVAV QLEAQSALAN SALNKPSEFI LQFPIEVYLP QGIELLYNAT PLYPLWFIGD YNGYNGSSVT VGIVDAFGDA ESHLVNGFCG YALSPYNDII VSDVNAFSSL FDLPPASITV IYPAGEPFIT PFNSVDACGW SFESVLDNEW VHAIAPGARI VFGVSPDAGD DLYVTIEYMV NESLVNFISL SWGLSEDYLD PYYALAYDQI FMQAAAQGIG VFASSGDSGA YEFYPFVSAF HPSIDPWVTG VGGTTSYLFP GGSRFITAWS FYSFGLPPWD LIYWGSGGGY SIFFDMPLYQ YQYIFNLIGE GNFYEQTQFQ PLIWGLLLGQ FFVNEPYVPT LNINPYTPLY RTFEWMLYPS LYVPIGAKGY PIVSADANPY TGVLIVIDGE LNPFIWGGTS LASPLTMGMV ALWQDYLNKA GIPYQVGLAA VPLSQIWATE AGSSFCNAYY PTSVYGTNTH GVFYPSIYGQ NGATAVNGWV IKNPCIWNPV NGFGSLDVGN LVYYGTQLLD K
|
| |