Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1062 |
Symbol | |
ID | 5710134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1110687 |
End bp | 1114382 |
Gene Length | 3696 bp |
Protein Length | 1231 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641275562 |
Product | hypothetical protein |
Protein accession | YP_001540881 |
Protein GI | 159041629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000818904 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00000512763 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTGCTC AACCTCAAAT AAGCATTGAT GATGTTTTAA ACTGTTTTCT ACAAGGCATT AGGGAGGCTA ATAATGAGGA AGAGTTGAGG ATCAGGATAT CCAGGTGCAT AGAGGATATG ATCCTCGAGC CACTCGGCAT AACACAGTAC GGTGGTCGCT ATGAGTACAC CCTAGTCTCT GGGGCTAGGG TGGATGCCCT CTACGGACAC GTAATCATAG AATATAAAGC CCCAGGCAGA CTCTCAACAA ACACCGACAT AACCAAGGCC AAGGACCAAG TCATAAACTA CATTAAAACC GAGGCCGCCT CAAAAGCTGA GTGGGATAGG TATCTGGGTA TTATAATCAG TGATAGAATA GCCTTCGTGA GGTACTACAA ACCGCAGGAT ACCTGGATCC TAAGGGGCCC GTATAACATT ACCAGGGAGA GCATTATAAA GATTATTGAG GCCCTGAGGG GGCTAAGGAG GAAGGCGCTC AGTGTGGATA ACCTCTTGAG GGACTTCGGC TCCCAGTCGG AACTTGCTGG GAGAGTTATT AGGGCGTTGT ACAGGAGGCT CATTGAGACT GGGAACCCGA GGACTAGGGT GTTGTTTGAG GATTGGATGA GGCTCTTCAG ACAAGCAACA GGTTATAGGC CTGAGGAGCT TGAGGAATTA CCCACGTTGG CTCAGGAGTA CGGGTTAGCC GATAATGTGA ACTTCGATGC ATTAATATTC GCAATACAAA CATACTACGC CTTCATACTA AAGCTCCTCG CCGCGGAGGT TGTTTACCTA TACGGCGGTG GTAGGTTTTA CATATCCTAC ATAGCCGATC TCGATGACGC ATACACAAGG GGCAGTGTCA ACGCCCTAAG GGATGAGCTT AGGGAGTTGG AAAGCGGTGG CATTTTTAGG CACTTCGGCT ATGAGAATTT CCTAGAGGGT GATTACTTCT CCTGGTACCT GGAGGAGCTT GATGGGGAGC TTGCTGGGGC GCTGGCTGAG GTGATCAGGA GACTCTCGGA CTACGAGCCG GCAACGCCTC AATTAGAGCC CGAGTACGCT AGGGACCTAC TCAAGAGACT TTACCAGGAA TTAATGCCCC GCGACATAAG GCATAATCTT GGCGAGTACT ACACGCCTGA CTGGCTCGCT GACTTCCTAC TCGATGAGGT GGGGCTTAGC CTAGGGAACC TCATGGAGAT GGGTAAGGAG GACTCACTGA AGCCACTGCA GCAACTGAGG GTTCTGGACC CGGCATGTGG CTCGGGCACC TTCCTAGTGA GGTACATAGC GAGGCTTAGG GCTTATGCCA GGGAGTACTT CCTGGAGGAT GTACTCGTGG ACTACGTACT ACAGAACGTG GTTGGCTATG ACCTAAACCC ACTGGCGGTA CTCACCGCCA GAACAAACTA CCTACTCATG ATTGCGGACT TGCCAAAGAA GGGAACAATA GAGATACCGA TTTACATGGC TGACTCATTG ATGGTTGAGA GGAGATCCAC ACTAATGGGT GATGTTTATG CGCTGAGGAC AACGGCTGGG GAGTTCAGGA TACCGGCAAA CATCATCGAG AAGAACCTAC TGCCGGATAT ACTCCACGAG GTGACTGACG CATTGAGGAA CAGGTACAAG CCGGAGGAGT TCAAAGCAAG ACTACAGTAC AGATTCAAGG AGTTGGGAGA AAACGAGCTC AGCGTCCTGT TGGATTTCTA CAGCACACTC CTAAAGCTGG AGGAGGAGGG TAAGGACGAT GTGTGGGTAT CCATAATAAG GAACGCCTTC GCACCAATAC TGAGGGGCAA GTTCGACTAC ATTGTGGGTA ACCCACCATG GGTTAATTGG GAAAACCTGC CGGAGGACTT CAGGGAGTTG TCCAATGACT TGTGGCAGCA CTATGGGCTG GCCGAGATTA GGGGGAAGAT GGGGTTGGGT AAGGTTAAGA GGGACTTGGC GATGCTCTTC ATGGCCAGGT GCTTCGACCT ATACCTAAAA CCCGGCGGTA AGCACGCATT CCTAATGCCA TTCACAGTAT TCAAGACACA GGCTGGTTCA GGGTTCAGGA GGTTCCTAGC CACAAAAACC AGGGTGCACG TTGTACATGA CATGGTAACT CTGTACCCAT TCGAGGGGGC CGTGAACAGG GTCTCGGCAA TAGTTGTTGA GAAACCCGAG ACGGGCAACG CTACGGGGAA CAAGGGTGGG GTTAGGCACA TCATATGGGT TAACAGAACA GGGAAGCCCA TACCCACCGA TGCGCACTTA GAGGATGTCC TCAAAGTTAC GGAGAGGTTC GAGGCCATAA TGGCACCCGT AGTGGAGAGG GACCCGTCAA GCCCCTGGAT GCAGGTAACA AGTAAAGTAC TACCCTACAT TAGGAAGATA ACCTCCGGGA CCTCACCATA CGAGGCACAT GCAGGCGTTT ATACAGCTCT AAACCAGGTA TACTTCATAC AGATCAAGCA GAGACTCCCA GACGGTAAGC TACTAATCAT CAACCCGCCG GAGCCTGGGC AGAAGAAGAG GGTTAAGCAG GTCGAGGTGA CTATAGAGCC TGACCTGGTC TACCCGCTCA TTAGGGGTAG GGACGTCAAG AGGTGGTACG TAGACTACAA GGAGAGGTAT ATCATTCTTC CTACAGATGA GCAGGGAAAT ATGATTAAGC ATTATGAAAT GAAGATAAAG TACCCCAATA CGTATAGATA CTTCTATGAG TTCTTCAGGG ATCTAGTGAA TAGGGGTGGT GAGCCCTACA AAAGTAAGTT AGAACCTTAT AGGAGGTTAC CACTTGAGGT GGCGGAAAAA AGCGCACCGC CGTTTTACTG GGTGTTCAAC GTAGAGCCTA GCCTAGTTCC CTACAAGGTA GTGTGGAGTA GAATTGCCGG GGGCATTAGT GGTAAGGCTG TTAGTTTTTC ATGTGCTGTA GTAGAACCCA TAGAGATAAA GGATCTAGGT AAAAAGCCTG TGATACCTGA TGATGGAACT ATACTAATAG ATCTCAAAGA TCCGAAGGAG GCCTACTATA TTGCTGGCGT CCTTAATTCA ATTATCGTAA GGACAACAAT AGCAGCATAT ACATACGAGT TGAGACAGGA AACGCATATA GTCGATTTTA TAAGAATTCC TCGTTATGAT CCTAATAATG AATTGCATGG GAGGATTTCC GAGCTCTCCA GGAGGGCTCA CGAGATTGCC AGGTGTATCC ACGCCAGTGC CAAGCCCGAG TACTGTGGGT TCATTAGGGA CCCCGAGGGT GGGCTCAGGA AGATTGAGGA TGAGCTTGAT AGGGCCGTGG CTCAGCTCTA CGGCATACCC GAAGATGCCC TGGAGGAGTT CAGGAAGTTA CTGGGTGTGT TGTCTGGCGA GAGCGTTCCA GAAGAGGAAG AGAGTGTTGT GGGGGTTGTT AGGCCCTCGG TGGAGTTTCT CAAGGTCAAT GTTGTTGCTG GGCAGACTGA TTACCTGGAG GTTAATATTG TGACTGCGGG GCTTTGTGAT AAGGCCGTAC TCATTCTTAA GTGGCCCTGG GGCACGCAGA CGCTGAACCT TGATGATGGT AGGCATAGGA TTGAGGTTAA GGTCCCTGAG GGGGTTTATG AGGTTGCGTA TAGTTTTAAG TGCTCTGGTT ATGAGTATGA TGGTAGTGTT AAGGTAACGG CATCCTCCAA GGTGCCGGAG GGGCCTAGGA GGTCCAGGAC TCTTAGACTT GGGTGA
|
Protein sequence | MAAQPQISID DVLNCFLQGI REANNEEELR IRISRCIEDM ILEPLGITQY GGRYEYTLVS GARVDALYGH VIIEYKAPGR LSTNTDITKA KDQVINYIKT EAASKAEWDR YLGIIISDRI AFVRYYKPQD TWILRGPYNI TRESIIKIIE ALRGLRRKAL SVDNLLRDFG SQSELAGRVI RALYRRLIET GNPRTRVLFE DWMRLFRQAT GYRPEELEEL PTLAQEYGLA DNVNFDALIF AIQTYYAFIL KLLAAEVVYL YGGGRFYISY IADLDDAYTR GSVNALRDEL RELESGGIFR HFGYENFLEG DYFSWYLEEL DGELAGALAE VIRRLSDYEP ATPQLEPEYA RDLLKRLYQE LMPRDIRHNL GEYYTPDWLA DFLLDEVGLS LGNLMEMGKE DSLKPLQQLR VLDPACGSGT FLVRYIARLR AYAREYFLED VLVDYVLQNV VGYDLNPLAV LTARTNYLLM IADLPKKGTI EIPIYMADSL MVERRSTLMG DVYALRTTAG EFRIPANIIE KNLLPDILHE VTDALRNRYK PEEFKARLQY RFKELGENEL SVLLDFYSTL LKLEEEGKDD VWVSIIRNAF APILRGKFDY IVGNPPWVNW ENLPEDFREL SNDLWQHYGL AEIRGKMGLG KVKRDLAMLF MARCFDLYLK PGGKHAFLMP FTVFKTQAGS GFRRFLATKT RVHVVHDMVT LYPFEGAVNR VSAIVVEKPE TGNATGNKGG VRHIIWVNRT GKPIPTDAHL EDVLKVTERF EAIMAPVVER DPSSPWMQVT SKVLPYIRKI TSGTSPYEAH AGVYTALNQV YFIQIKQRLP DGKLLIINPP EPGQKKRVKQ VEVTIEPDLV YPLIRGRDVK RWYVDYKERY IILPTDEQGN MIKHYEMKIK YPNTYRYFYE FFRDLVNRGG EPYKSKLEPY RRLPLEVAEK SAPPFYWVFN VEPSLVPYKV VWSRIAGGIS GKAVSFSCAV VEPIEIKDLG KKPVIPDDGT ILIDLKDPKE AYYIAGVLNS IIVRTTIAAY TYELRQETHI VDFIRIPRYD PNNELHGRIS ELSRRAHEIA RCIHASAKPE YCGFIRDPEG GLRKIEDELD RAVAQLYGIP EDALEEFRKL LGVLSGESVP EEEESVVGVV RPSVEFLKVN VVAGQTDYLE VNIVTAGLCD KAVLILKWPW GTQTLNLDDG RHRIEVKVPE GVYEVAYSFK CSGYEYDGSV KVTASSKVPE GPRRSRTLRL G
|
| |