Gene Cmaq_1879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1879 
Symbol 
ID5709154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1953998 
End bp1956910 
Gene Length2913 bp 
Protein Length970 aa 
Translation table11 
GC content44% 
IMG OID641276387 
ProductDEAD/DEAH box helicase domain-containing protein 
Protein accessionYP_001541686 
Protein GI159042434 
COG category[R] General function prediction only 
COG ID[COG1201] Lhr-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.149039 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0231063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCAGTAG TAATAATGTC TATTGATGTA TTAAGCATGC TGCATCCCAT GGTTAAGGAA 
TTAGTGGTTA AGAGGGGCTT CAGGGAATTA ACCGAGCCTC AGCTTAAGGC AATCCCAGTC
ATATTAAGCG GTAAAAACAC CCTACTAATA GCCCCCACTG GTAGCGGTAA GACTGAGGCT
GCCCTACTAC CTGTCTTATC AATGTACCTA AATATGCCTC AGGATAAACC GAGGGGAATC
TACATACTAT ACATAACCCC CCTCAGGGCC CTTAATAGGG ATTTACTAAG GAGGATTGAG
TGGTGGGCTA GTGGGGTTGG GTTAACTGTG GCTGTTAGGC ATGGGGACAC TGATAAGAAT
GAGAGGGCTA GGCAGAGTAG GAACCCTCCT CAATTACTCA TAACCACCCC GGAGACCCTT
CAAATACTCT TCATTGGTAG TAGGCTTAGG GAGCACATAT CTAAGGTTAA ATGGGTTATA
GTGGATGAGG TTCATGAGTT AGTTGAGGAT AAGAGGGGGA GTCAATTATC AATCGGCCTA
CAGAAGCTTA AGGCCTTGGC GGGTAGATTC CAGGTAATTG GATTATCGGC ATCAGTGGGT
TCACCCAGTG AAGTGGCTAA ATTCCTTGTG GGTAATGATG AGGAGTGTGA AGTCATTAAC
CTATCCTTCA CTAGGAATTA CCAATTCGAT GTAATTAACC CAGCAGTATC ATTAAGTAAC
GCGGACATTA ATGAACTTAA GGCTAGGTAC ATGATTGTGA ATAACATTAA TGCTGATGAA
TTATTCCATA ACGATGCCTT AGCCAGGTTG CTGTACGTCA TGAATCTAAT TGATGATGGT
GGTAAGTCAT CAATAGTTTT CGTTAACACC AGGTCAATGG CTGAGTTACT CACCTCAAGA
ATACTCATGA TTAACCCAAC CTACCCAGTA TCAATACACC ACAGTTCATT ATCCAGGGTT
AATAGGCTTA ACACAGAGGA GATGCTTAGG GGTGGTAGGC TTAAGGCTGT TGTAGCCACA
TCAAGCCTAG AGCTAGGTAT TGATATTGGG CACGTTGACC AGGTTATTCA ATACGTTTCA
CCGCATCAAG TCACCAGGCT TGTTCAGAGG GTTGGGAGGA GTGGGCATAG GCTTGATAAG
GTACCTAAGG GCATTATAAT AACTGAGGAT ATTAACGATA CCTTGGAGGC AATAGCCATA
GTGAATAGGG CTAGGGCCGG CTGGCTTGAA CCAACCAAGT CCCCTGAGAA GCCTTACGAC
GTCCTCTTTA ATCAAATAGT ATCCCTGATA CTCATGAAAC CCAGGTGGAC TGTTGATGAG
ATACTCAACA TAGTTAAGGG CGCCTACCCC TACAGGAACT TAACTAGGGA TGAGTTGATT
GCAGTTCTCA GATTCATGAG TGAGGGCCTT AGGCCTAGGT TAGTCTACTT CATTGAGGAG
GAGGGTGTTG TGTTAAAGCC GAGGAGCCCT AGGTCCAGGA TTGAGCTTAG GGAATACTTC
TTCAACAACT TATCAATGAT ACCTGAGGAG AAGCAGTACC TGGTTACTAG AGTTGATACA
GGTGAGGTGG TTGGTGTGCT TGATGAGGCA TTCATGGCTG AGTATGGGGA ACCAGGAATA
AAATTCGTAT TCAGGGGTAA CGTATGGATA CTTAGGTCAA TTAAGGATGA TGTAATTTAC
GTTGAACCAG CAAAGGACCC AGTGGGTGCT ATACCATCAT GGATTGGGGA AGAGATACCC
GTCCCCTTTG AGATTGCGCA AGACGTCGGT AGATTGAAGA GAATAATAAC AGACGAGTTA
AGGAACGGCG CCAGTGAGGA TGCGTTAATT AGTAGGTTGA CCAAGGAGTT GGGTGTATCT
AGGCGCACGG TTAAGTACAT TGTGTCAACT ATTAAGGAGC AGTTGAAGTA CGGCTTCGTG
CCAACTGATG ACTCAATAAT CATTGAGAAG GTTGGTGAAT TAGTCGTCGT ACACGCCAGC
TTCGGTACAT TAGTGAATAG GACATTATCA AGATTATTAG CCAGCTACAT GACGCATGCC
CTTGACTTAC CGGTTAGGGT TCAACAGGAC CCATACGCAA TAATACTTCA ACTACCTAGT
AAGGTTGGGC CAGGCATCAT AATTGAGTCC CTGAGGGCGT TGGCGAACTT AAGCATGAGT
GAGTTCATCC AGTATATTAA GAGGATTGCC CTTGAGACTG GTTTATTTAA GAGGAAGGTG
GTTCACGTAG CTAGGAGGTT CAACGTAATT AAGAGTGATA AGTCTGTCTC AGACATTAGC
TTAACCAACC TGATACAAGC CCTTGAGGGT ACACCGGTTT ACGTTGAGGC GCTTAGGGAG
TTCATAACAA GTGATCTTGA TGTTGATGGA GCCGTAAGGG TGCTTAAGTC CATACTGGAT
GGAACCATTA AGGTGAGTGT AATTGAGGGG AGTGAATTCA GTCCAATGGC TAGGGAAATC
CTAAGTAAGG CTAGCAGTAG GCTTGAGGTA ATTGCCCCGG AGAGGTTGGA TAAGTTAATA
ATTGAGAGTG TGAAGGCTAG GTTGCTTAAT GAACCATTAA CCCTGGCATG CCTTGAATGC
GGTAACGTCT ACATTGGGAG TGTTAAGGAT TTGATTAAGG ACTTGAAGTG CCCTAAGTGT
GGTTCAGTTA AGTTAACTGC ATCAAAGATG GAGCCTGATA AGGTTGCTGC AATAATTAGG
AGGGGTAGTG GTGATGATTA TGATAGATTA GTGAAGGCCA GTGAATTATT GACTAAGTAT
GGTTGGAGGG GGTTAATGGC TATAGCGAGT AGGATTAGTC TACGCAGGGC TGAGGAATTC
CTATCATCAG ATGGGTCGGA GGACTTCAAT GAATTCATAA TTAAATTATA CAACGCCGAG
AAGGAGGAGG TTAAGCAGAG GTTCTTCACT TAA
 
Protein sequence
MSVVIMSIDV LSMLHPMVKE LVVKRGFREL TEPQLKAIPV ILSGKNTLLI APTGSGKTEA 
ALLPVLSMYL NMPQDKPRGI YILYITPLRA LNRDLLRRIE WWASGVGLTV AVRHGDTDKN
ERARQSRNPP QLLITTPETL QILFIGSRLR EHISKVKWVI VDEVHELVED KRGSQLSIGL
QKLKALAGRF QVIGLSASVG SPSEVAKFLV GNDEECEVIN LSFTRNYQFD VINPAVSLSN
ADINELKARY MIVNNINADE LFHNDALARL LYVMNLIDDG GKSSIVFVNT RSMAELLTSR
ILMINPTYPV SIHHSSLSRV NRLNTEEMLR GGRLKAVVAT SSLELGIDIG HVDQVIQYVS
PHQVTRLVQR VGRSGHRLDK VPKGIIITED INDTLEAIAI VNRARAGWLE PTKSPEKPYD
VLFNQIVSLI LMKPRWTVDE ILNIVKGAYP YRNLTRDELI AVLRFMSEGL RPRLVYFIEE
EGVVLKPRSP RSRIELREYF FNNLSMIPEE KQYLVTRVDT GEVVGVLDEA FMAEYGEPGI
KFVFRGNVWI LRSIKDDVIY VEPAKDPVGA IPSWIGEEIP VPFEIAQDVG RLKRIITDEL
RNGASEDALI SRLTKELGVS RRTVKYIVST IKEQLKYGFV PTDDSIIIEK VGELVVVHAS
FGTLVNRTLS RLLASYMTHA LDLPVRVQQD PYAIILQLPS KVGPGIIIES LRALANLSMS
EFIQYIKRIA LETGLFKRKV VHVARRFNVI KSDKSVSDIS LTNLIQALEG TPVYVEALRE
FITSDLDVDG AVRVLKSILD GTIKVSVIEG SEFSPMAREI LSKASSRLEV IAPERLDKLI
IESVKARLLN EPLTLACLEC GNVYIGSVKD LIKDLKCPKC GSVKLTASKM EPDKVAAIIR
RGSGDDYDRL VKASELLTKY GWRGLMAIAS RISLRRAEEF LSSDGSEDFN EFIIKLYNAE
KEEVKQRFFT