Gene Cmaq_0963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0963 
Symbol 
ID5708565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1010222 
End bp1012144 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content43% 
IMG OID641275464 
Productpeptidase S16 lon domain-containing protein 
Protein accessionYP_001540785 
Protein GI159041533 
COG category[R] General function prediction only 
COG ID[COG1750] Archaeal serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGATTA GTAGGCTACT TCCAATATTG GCGTTAGCCA TAGCGGTATC GGTGCTGGCT 
TATTCATGGA GCATTGGATC AATATCCACA AATAGTATAA CTATCCATGC GTTAGCCGTC
TCAGGTTCAA ACCAGGGTGC TGTAATAAAC ATAACGATAA CTGCGGTTAA GGGTTTACCA
AGTTATCTAG GTGGTAATGT TTACGTATCC GCTATGCCAT TACCCATAGG GGAGGGTGGT
ACCTTCATTT CATCAAGTCA AATAGCCGCC TTTGTGGCTA CAACAATTGC TGGTCAATCG
TTAACAAGCT ACAACTTCCT TATTAACGTT AACTCCAGTA CAATAGAGAT AGGTGGTCCA
TCAGCCAGTG GTTACATGAC CGTTGGCATG TACTCCCTCA TAACTAACAG TAGCCTAAAT
CCAAGTGTGG TTATGACTGG TATGATTATG CCTGATGGAA CAATAGGCCC AGTGGGTGGG
ATACCTGATA AGATTAGGGC TGCAGCCCAA TTAGGTTACT CAACGGTACT GATACCCTAC
GGCCAGCAGA ACTACGTATC CTCAAGTGGC CAGGTAATTA ACCTTATAGA GTTAGGTAAG
CAGCTTGGGG TTAATGTTAT CCCAGTGGCC ACGGTGTACC AGGCGATACA GTACTTCACA
GGCCGTAGCT TTAACCTATC CCTTCAAGTA ACACCCCAGG TGAGTGGTAA TATTTCAGCC
ATTAGTGCCT ACCTCTATAA GACGCTTTAC GTTAATTATG GTAATGAATC AGCATTAGGT
AGTCAGGCTG ATTATGAGGC TGCAGTGAGC AATGCCAATA ACGGCGACTA CTATACTGCA
GCAAGCTTAC TCTACGATGC CCTAATAAAC TACTACACTA ACTTATTCAG GAACGCGACG
TTGGCTTACG CTAAGGCTCT TGTGGCGAAT GTAACTGCTC AATTGAATCA AATGACTAAT
GAAATTAATA GCATTCAACC CACTACAGCC AACTTGGACA TAATGGTGGG AATCTATGAT
AGGATATACA CTGCTCAATC ATTGTTGAAT ACCACTATTA GTGATATTCA GGCTGATAAT
ACAGCTAACA TACCCGGTGA TTTAGCTCAA TTATACGTTA GGGTTATTAC GCTGAAGTAT
TGGTTTAATG TGCTTAACGC CATTAATGGC GGTAACCCAA TACCCATCTC TTATTTAAGT
AAGTTAAGTG GATTATACAC ATCATACGCC TATACAACGG TAACTTACTT ATACTCACTA
GCCAGCGCTG AGGGGTTGAC TACAAGCTTA GGGGTCATTA ATACGATTAA CCAGCTCATT
AATCAAACTA ATGAGGCACA GTCCTACTAC CAGAATGGCC TTTACCTGGA GGCTTTAGCT
GTGAGCCTTG ATACAATAGC CTCAGCTAGC GCCATAATTC ACACAATGTT CCTGATTGGG
GGGAGTAATG CATCATTATA CTTACTCAAC GTTGTTAGGA ACGTTGCAAC GTATAATGAG
GCCCTAGTAT CCCAGTGTAA TGGTTTACCA GTTCTATCCA GTGCCTATGT GCAATTCGGC
AATTATTGGT TCAGTCAATA TAATAGTAGT CTTGCTGGTA ACCCATCGTC AACGTCAAGT
CAAAGTTACT TTGAGGAGTC TTTATCGCTA TACGAGGAGT CAATAGCGTA TTCATTATTC
CTGAGGCAAC TTTACTATGA GTTAGGAGCA TGCCCAATAG CCAGCTACAC CATTAACTAC
ACGGCACTTC AGTACGTGCC AGCACCAAGT GCTCCACATG TAACTCAAGG CCAATCCACG
GGCGCATCAT TTTCAATAAT TAACCTGGCT GGAGTCAACG TAACGTACCT AGCCATTGGG
GTAGTGTTAG TGGCAATGAG CATAGTGGTT CTAGTGCTTA AGGTTGTTAA AGTAAGTAGA
TAA
 
Protein sequence
MRISRLLPIL ALAIAVSVLA YSWSIGSIST NSITIHALAV SGSNQGAVIN ITITAVKGLP 
SYLGGNVYVS AMPLPIGEGG TFISSSQIAA FVATTIAGQS LTSYNFLINV NSSTIEIGGP
SASGYMTVGM YSLITNSSLN PSVVMTGMIM PDGTIGPVGG IPDKIRAAAQ LGYSTVLIPY
GQQNYVSSSG QVINLIELGK QLGVNVIPVA TVYQAIQYFT GRSFNLSLQV TPQVSGNISA
ISAYLYKTLY VNYGNESALG SQADYEAAVS NANNGDYYTA ASLLYDALIN YYTNLFRNAT
LAYAKALVAN VTAQLNQMTN EINSIQPTTA NLDIMVGIYD RIYTAQSLLN TTISDIQADN
TANIPGDLAQ LYVRVITLKY WFNVLNAING GNPIPISYLS KLSGLYTSYA YTTVTYLYSL
ASAEGLTTSL GVINTINQLI NQTNEAQSYY QNGLYLEALA VSLDTIASAS AIIHTMFLIG
GSNASLYLLN VVRNVATYNE ALVSQCNGLP VLSSAYVQFG NYWFSQYNSS LAGNPSSTSS
QSYFEESLSL YEESIAYSLF LRQLYYELGA CPIASYTINY TALQYVPAPS APHVTQGQST
GASFSIINLA GVNVTYLAIG VVLVAMSIVV LVLKVVKVSR