Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0318 |
Symbol | |
ID | 5710426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 355222 |
End bp | 357492 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641274822 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001540156 |
Protein GI | 159040904 |
COG category | [R] General function prediction only |
COG ID | [COG1204] Superfamily II helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATATTG AGGAGCTACC GCTACCAAGC TTGCTTAGGG ATTTTCTAAT AAGTAAGAGG GGCATTAGGA CACTGTACCC GCCCCAGGAG GAGGCTATTA GGGCTGGTTT ACTGAATGGT GAAAACATAC TCATGGTTTC AGCCACAGCC TCAGGTAAGA CGCTTCTAGC TGAAGTAGCT GCTGTAAATA ATGTACTGGT TAATGATAAG AAGTCACTGG TGGCTGTTCC CCTGAAGGCA TTAGCCTTCG AGAAGCTTAA TGACTTCAAC ACGTACAGTG AATTAGGCAT TAGGGTGGCT GCCTCAACAG GTGACTATAA TAGTGAGGAT AAGTGGTTAG GCTCTTATGA TGTAATAATA ACTACGTACG AGAAGCTTGA TAGCCTACTC AGGCTTAAAC CGAGTTGGAT ATGGAATGTT GGGCAATTAA TAATTGATGA AATACACTTC ATTAATGATG ATGAAAGGGG ACCAATAATA GAATCCATTG TGGCTAAGTT AAGGATGCTT AACCTTAACC CGCAGATCAT TGGATTAAGC GCAACAATAG GCAACCCGGA GGAGTTGGCT AATTGGCTTA ACGCTAAGCT GGTTAAATCA GATTGGAGAC CAGTTAGCCT AAGGGAGGGG GTTTACCATA AGGGTGTAGT AACGTACGTT AATGACGGTG AAAAGAGGAT CAGCGGGCAG GGTGATTCAC TAATCAACCT AACAGTGGAC ACGTTGAATG ATGGTGGGCA GGTACTGGTG TTCTCATCAT CAAGACAAGG CGCAGTGAGG ATTGCTAGGA AACTGGCTGA GTATATATGT TCATCCCCAG TCAGGTACAT TGATCCTGGT GAGGCAGGTA AATTAGCTGA GGAGGTTAGG GAGACTTCAT CATCAAGGAT ACTGGCTGAG GAATTAACAG GCTTAATAAA ATGCGGCGTA TCCTTCCACC ACGCGGGCCT TGAGTTAGAG GTTAGGAGGG TTATTGAGGA GGGGTTTAGG AGGGGTGTAT TAAGGGTATT AGCCTCCACA ACGACCCTAG CGGCTGGGGT TAATTTACCT GCACGTAGGG TTATTGTGAA TGAGTATAGG CGTTATGAAC CAGGCTACGG CTTCATTGAA ATACCTGTAA TGGAGTATAA GCAGATGGCT GGTAGGGCAG GTAGACCAGG CCTTGACCCC TATGGTGAAG CAATAATAAT TGTTTCCAGT AAGGATGAGG TGGATTACGT CATTGATAAG TACATTAAGT CCCCGCCGGA GTACGTTAAG TCAAACTTCA TGAACCCTAC ATCACTTAAA TTCCACACAC TATCAGCGGT GGCAAGCCAG TACGCTGAAA CCATTGATGA ATTAGTTAAA TTCACTTCAA ACACCTTCGC TGGGTTTCAG GGTAAATTAT CAGCAATGAT TCAAGCTAAT TCAGTGAGGA GGATGATCAG TAGGATCATT GATGAACTGG TTGATTACGG CTTCATAATA AGGAATGGGG ATAAACTAGA GGCCACTGAG GTTGGTGCAG TTGTTAACAG GATGTACCTT GACCCAGATA CTGCACACGT ATTCATAATG GGTTTGAGAA ACCTCAACAG TGACGCTGAC TTAAACGCCT ACTCACTGAT GCTTGTGGTT AAGTCACCTA AGATACCTAA GGTTAAGGTG AGGAGGAATG AGCTTGATGA ATTAGCTCAA CAAGCAGCCT CAATGTGGTC CTCAATACCC CTTAAACCCA GTGATGTGGA TGAGTTAGTT AATTACCCTG AGGATTACGA GGACTTCCTA TCAGAGTTTA AAACAGCCAT GGCGCTCCTT GAGTGGATTA ATGAGAGTAA TGAGGATCAA ATTATGAAGA CTTACGACGT CCAACCTGGG GATTTAAGGG TGCTTTCAGA CCAAGCCGAG TGGTTAATAG GTGCACTTCA GGAATTAGCC AGGACACTGG GGTTAAGTGG GAATGTTGTG AATGGGTTGA GGGCGCTTAG GTATAGGGTG AAGTACGGTG TTAATGATGA ACTCCTGGAA CTGGTCGTTA ACCTTGAGGG AGTGGGTAGG GTTAGGGCAA GGGCCCTCTA TGCCGCAGGC TATAGGTCAA TTGAGGACTT AGCTAAGGCA AATGTAAGTG ACTTAACCAG GATACGTGGA ATAGGGGATA AGATTGCTGG TTCAATAATT GAACAGGCTC ATCAATTAGT TAAGGATGGC AGGGTAATTA AGTTTAATGA ATCCACGGTT AAGGGAAAGA CTAGGCGTGG TGGAGGTGGC TTACTTGACC ACATGTATTG A
|
Protein sequence | MNIEELPLPS LLRDFLISKR GIRTLYPPQE EAIRAGLLNG ENILMVSATA SGKTLLAEVA AVNNVLVNDK KSLVAVPLKA LAFEKLNDFN TYSELGIRVA ASTGDYNSED KWLGSYDVII TTYEKLDSLL RLKPSWIWNV GQLIIDEIHF INDDERGPII ESIVAKLRML NLNPQIIGLS ATIGNPEELA NWLNAKLVKS DWRPVSLREG VYHKGVVTYV NDGEKRISGQ GDSLINLTVD TLNDGGQVLV FSSSRQGAVR IARKLAEYIC SSPVRYIDPG EAGKLAEEVR ETSSSRILAE ELTGLIKCGV SFHHAGLELE VRRVIEEGFR RGVLRVLAST TTLAAGVNLP ARRVIVNEYR RYEPGYGFIE IPVMEYKQMA GRAGRPGLDP YGEAIIIVSS KDEVDYVIDK YIKSPPEYVK SNFMNPTSLK FHTLSAVASQ YAETIDELVK FTSNTFAGFQ GKLSAMIQAN SVRRMISRII DELVDYGFII RNGDKLEATE VGAVVNRMYL DPDTAHVFIM GLRNLNSDAD LNAYSLMLVV KSPKIPKVKV RRNELDELAQ QAASMWSSIP LKPSDVDELV NYPEDYEDFL SEFKTAMALL EWINESNEDQ IMKTYDVQPG DLRVLSDQAE WLIGALQELA RTLGLSGNVV NGLRALRYRV KYGVNDELLE LVVNLEGVGR VRARALYAAG YRSIEDLAKA NVSDLTRIRG IGDKIAGSII EQAHQLVKDG RVIKFNESTV KGKTRRGGGG LLDHMY
|
| |