Gene Cmaq_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0002 
Symbol 
ID5709976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp9055 
End bp11400 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content47% 
IMG OID641274505 
Productprotease-like protein 
Protein accessionYP_001539846 
Protein GI159040594 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4934] Predicted protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTATA GATTCAGTAA AGGGGGTAAA ATAGCTGCAA TACTAGTGGC TGTAACAGTG 
GCTATAGTCA TCTCACTATA CGTGGTTAAC GCTCAATCAC CTGATCAACC AAACCCATAC
TACCTGGGCT CAGAGGTGTT TTGCATAGAA TATGGAATTA TTGCACCTAA TGGGACCTTT
ATACCGTTAC CCTACCCAAC AAGCCTAATA ACGTTCCTAT ACCTAAACAA CGCCACAGGC
TTAGGTAACG TGCTTTATAG TGAGTACTAT AATCCCTCAA GCCCACTATA CCACAGGTTC
ATATCAGCGG CTGAATTCGA TGAATGGTAC TCAGCGCCAG CAAGCGTCTA CGGTAACTTA
ACGGCAATAT ACAGTTACTA TAACTTAACC ACGGAGGTTA AGTCAGCACC AATGTATGCT
GCATTAGGTG TGCAGAGTAA TGCTTACAAC GCCTGCATAT CAATAATTAA CGCGACGATA
GAGTACTTCA TTTACAATAA TGCTTCGTTT TCATGGGTTA GTTGGGTGTT GGTTACTGAG
ACTAATCCAC AGGGCTTCTT CCTTGAAATA ACACCCGGTG AGTTTCAGCA GTTGCTTAAG
ACCGTTGAAT CTATTAATAA TAACACTGGT GTAATCCAAC TACCGGTAAC TTATAAGGGT
CAACCAGTCT TAATGAAGTA TGCCGTCTAC GTGAGGGGTA GCCACGGTTA CAGTGTTGGC
CACGCCTTAG CCCTATACCT GGCTAGGGAG TTTCAGGTGC AGCCTAGCTA CAGTATGATT
AAGCCCAGTG GCGTTGTCTC AAAGTCACCG CTGGTGATTA ACGGTAAGCC TGTTGCTGTT
CAATTGGAGG CTCAAAGCGC CTTAGCCAAC TCAGCGTTAA ATAAGCCCAG TGAATTCATA
CTGCAATTCC CAATAGAAGT ATACCTACCC CAGGGTATTG AGCTACTTTA CAATGCCACA
CCACTGTACC CGCTTTGGTT CATTGGTGAT TACAACGGTT ACAACGGTTC ATCAGTTACC
GTGGGTATAG TTGACGCCTT CGGTGACGCT GAGAGTCATT TAGTTAACGG ATTCTGCGGC
TACGCCTTAT CACCGTACAA CGATATAATA GTTAGTGATG TTAATGCATT CTCATCACTC
TTCGACCTAC CTCCAGCGAG TATTACAGTA ATATACCCGG CTGGTGAACC ATTCATCACT
CCGTTTAACA GCGTTGATGC TTGCGGTTGG TCCTTTGAAT CAGTCCTCGA TAATGAGTGG
GTTCACGCAA TAGCCCCAGG GGCCAGGATA GTCTTCGGGG TGTCCCCTGA TGCTGGGGAT
GACTTATACG TTACCATTGA GTACATGGTT AATGAGAGCC TAGTTAACTT CATTAGCCTA
AGCTGGGGTT TATCAGAGGA CTACCTTGAC CCATACTATG CCTTAGCCTA CGATCAAATA
TTCATGCAGG CAGCGGCACA GGGCATTGGT GTATTCGCCT CCTCCGGTGA CTCTGGTGCC
TACGAGTTCT ACCCATTCGT CTCAGCCTTC CACCCATCCA TTGACCCATG GGTCACTGGG
GTTGGTGGAA CAACAAGCTA CCTGTTCCCA GGTGGATCAA GGTTCATTAC CGCGTGGAGC
TTCTACAGCT TCGGCCTACC TCCATGGGAC TTAATATATT GGGGGAGTGG CGGTGGTTAC
TCAATATTCT TCGATATGCC GCTCTACCAG TACCAGTACA TATTCAACCT AATAGGTGAG
GGTAATTTCT ATGAGCAAAC CCAGTTCCAG CCATTAATAT GGGGTCTATT GCTTGGTCAA
TTCTTCGTTA ATGAACCCTA CGTACCCACA CTCAACATTA ACCCATACAC GCCCCTCTAC
AGGACCTTTG AGTGGATGCT TTATCCAAGC CTATACGTAC CCATTGGCGC TAAGGGTTAC
CCAATAGTCT CAGCTGACGC TAATCCATAT ACAGGTGTGT TGATAGTGAT TGATGGTGAA
CTTAACCCAT TCATATGGGG TGGCACTAGC CTGGCGTCAC CATTAACCAT GGGTATGGTT
GCCCTATGGC AGGACTACTT GAATAAAGCC GGCATACCTT ACCAAGTAGG CTTAGCCGCA
GTGCCATTAA GCCAAATATG GGCCACTGAG GCTGGTTCAA GCTTCTGCAA CGCCTACTAC
CCAACATCAG TCTACGGCAC AAACACCCAC GGTGTCTTCT ACCCATCAAT ATATGGTCAA
AACGGCGCCA CGGCTGTGAA TGGTTGGGTT ATTAAGAATC CATGCATCTG GAACCCTGTC
AATGGTTTTG GTTCACTAGA CGTGGGTAAC CTGGTGTACT ACGGTACGCA ACTGCTTGAC
AAGTAA
 
Protein sequence
MDYRFSKGGK IAAILVAVTV AIVISLYVVN AQSPDQPNPY YLGSEVFCIE YGIIAPNGTF 
IPLPYPTSLI TFLYLNNATG LGNVLYSEYY NPSSPLYHRF ISAAEFDEWY SAPASVYGNL
TAIYSYYNLT TEVKSAPMYA ALGVQSNAYN ACISIINATI EYFIYNNASF SWVSWVLVTE
TNPQGFFLEI TPGEFQQLLK TVESINNNTG VIQLPVTYKG QPVLMKYAVY VRGSHGYSVG
HALALYLARE FQVQPSYSMI KPSGVVSKSP LVINGKPVAV QLEAQSALAN SALNKPSEFI
LQFPIEVYLP QGIELLYNAT PLYPLWFIGD YNGYNGSSVT VGIVDAFGDA ESHLVNGFCG
YALSPYNDII VSDVNAFSSL FDLPPASITV IYPAGEPFIT PFNSVDACGW SFESVLDNEW
VHAIAPGARI VFGVSPDAGD DLYVTIEYMV NESLVNFISL SWGLSEDYLD PYYALAYDQI
FMQAAAQGIG VFASSGDSGA YEFYPFVSAF HPSIDPWVTG VGGTTSYLFP GGSRFITAWS
FYSFGLPPWD LIYWGSGGGY SIFFDMPLYQ YQYIFNLIGE GNFYEQTQFQ PLIWGLLLGQ
FFVNEPYVPT LNINPYTPLY RTFEWMLYPS LYVPIGAKGY PIVSADANPY TGVLIVIDGE
LNPFIWGGTS LASPLTMGMV ALWQDYLNKA GIPYQVGLAA VPLSQIWATE AGSSFCNAYY
PTSVYGTNTH GVFYPSIYGQ NGATAVNGWV IKNPCIWNPV NGFGSLDVGN LVYYGTQLLD
K