Gene Cmaq_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1694 
Symbol 
ID5709112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1772568 
End bp1773794 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content37% 
IMG OID641276202 
Productpeptidase M48 Ste24p 
Protein accessionYP_001541507 
Protein GI159042255 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGT ACAGTAAGCT ACTTGACTCA TTAATGAGGC TCGGCTCAAT TAAGGTATCT 
GACATCATGA GAAACCTAAG TACCGGTAAT AATAATGATT TACTATATGT TATCTACTCG
CTAAACTCTG ATATCAGCAT TGAGGCGTAC TTCACTAGAT TCTACTTAGC CATTACTAAT
GGATCCCTTA GAATACGTGG CGACTTGAAG AAGGCTGATG AATTCAGTAA GATTATGCTA
AGGAATACTG TTATTGGTAA TGGTAGGAAG CTTGTTATGT TATTTAAAGA TAATGGTGAA
GAGTACGTGA AAATACCCAC TAGACCTACC TCATCAATTA CCATGAACCC GGCAGTATCA
TTCATTGTTT CATCAATACT AACCTTAATT ATATTCCTTC TACTTACGAA GTACGGTATA
TTACTAACCC TAGCCGTGGT TATTGCGCAA GTCTTATTAA CTAACATAGC CTACACCTAT
GTATCCTTCC TCCGCATGAT TAAGTTAAGG GTTAATGGTT CAAACATAAT TAAGGTAGTG
GTAACGTTAC CTATTGATGT GCCTGAGGAT ACGCTTGCCA GATTAGTATC ATATGCTTCA
TCAATTAAGA GCATTAGCAA GAGTCAATTA ACATTGTTAA TATCAGGGCT AAGGGCTATA
GGTGGTTCAA TGATAACCAG TATTAATGTT GAGAGAATAT CAATGCCATT AATTAAAGGC
ATTAATGTTT ACTTAGTCCC ATCACCTGAA TGTAATGCAG TATCCCTAAA CCTCATTAAT
AAGGTAATAT TGGTTAGCAC TAAATTAGTG GCATGCCTTA ATGAAGATGA GTTAAGGGCC
GTGATTCACC ATGAGTTAGG TCATATAATT AATAAAGACA CCTATAAGGC ATTGGTGGCA
TCAGTAGTCT ACTCCCTGGT CTCAGCTGTA ATGCTGCTAT ACGTTATACC AAGGATTGGG
TTAACCCTAG TAACAGTATC CGCTTACGCA TTAATAGCAT TACTGGCTAT AGTCATCTCA
CTTACATTAA GTAGGATTAA TGAAACTAAG GCTGACTTAT ACGCATTAAG CAAGGGTTAT
AAGGAATCAT TAGCCACTGC TTTAGTTAAG GTAACTTACC CATCAATACA TTCACCATTA
ATTAAACAGG TTTTCCTAAG TCACCCAACT ACGTTAAGTA GAGTTAATGC AATCTTAAAG
GCATCTAAGA GACTTAATGG CAAGTGA
 
Protein sequence
MRAYSKLLDS LMRLGSIKVS DIMRNLSTGN NNDLLYVIYS LNSDISIEAY FTRFYLAITN 
GSLRIRGDLK KADEFSKIML RNTVIGNGRK LVMLFKDNGE EYVKIPTRPT SSITMNPAVS
FIVSSILTLI IFLLLTKYGI LLTLAVVIAQ VLLTNIAYTY VSFLRMIKLR VNGSNIIKVV
VTLPIDVPED TLARLVSYAS SIKSISKSQL TLLISGLRAI GGSMITSINV ERISMPLIKG
INVYLVPSPE CNAVSLNLIN KVILVSTKLV ACLNEDELRA VIHHELGHII NKDTYKALVA
SVVYSLVSAV MLLYVIPRIG LTLVTVSAYA LIALLAIVIS LTLSRINETK ADLYALSKGY
KESLATALVK VTYPSIHSPL IKQVFLSHPT TLSRVNAILK ASKRLNGK