Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0322 |
Symbol | |
ID | 5709456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 359208 |
End bp | 361193 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641274826 |
Product | ribosomal protein S12/S23 |
Protein accession | YP_001540160 |
Protein GI | 159040908 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1372] Intein/homing endonuclease |
TIGRFAM ID | [TIGR00982] ribosomal protein S23 (S12) [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.497912 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGGTA AAAAGAGCCC ATTAGGACTA TATGCCGCTA GGAAGCTTAG AAGAAAGAGA CAGAAATTCA GGTGGAGTGA TATTCAGTAT AAGAAGAGAG CGTTAGGCTT TATTAAGAAG TATGACCCAC TGGAAGGAGC CCCAATGGCA AGAGGAATAG TACTGGAGAA GGTAGGCGTA GAGGCAAGAA AACCCAACGC AGCAGTGAGG AAGTGCGTAA CCCCAGATAC ATTAATTAAT TTAACACCCA GGGTAGCAAC TAGGATAATT AACCTCGCGG GTAATTGGCA TGAAGTTTCA ATAATCCACT TCAATAAGGG TTCTAAGGTA ATTGAACCAA CTAGGCTCAT TGACTTCTTC CACATAGAGC CCGAGGAGTT TAGAGAGGAT GGTGTGTATG AGTTGAGGAC ACTCTACGGT AGGAGAATAA TAGCGAGTGG TGATCACCCA ATATACACTG GGAGAGGCAT ATTACCCCTT AAGGAGGTTA AGCCAGGTGA TTACGTAGCC GTATACCCAA GGGAGCCCAT TGAGGCTTAC ACGCTTAATG ATGATAGAGT AATACTAACT GAGGATGACA TAAGAAGGGA GGCTCCACCA AACGCCAAGG TTGATCAAAT TATTAATAGG CTTAAGGAGT TAGGACTGAT ACCACTTAAG TATAGTAATA GTAATATTTA TAGGATTGCA CGATTAGTCG GCCATCTATT TGGGGATGGG TCACTAAGCT ACGTTAAGAG TGGCAATGGT TATGAGGGTA AAGTGGCATT CAGTGGAAAT CCCAGTGACC TTGAGGACAT TATAAATGAT TTAATTGAAT TAGGATTTAA ACCATCAAAG ATTAGGGAGT ACCACGGCGA AAGCATTGTT ACCTGGAGTA ATGGATTAAT GAATGTAATT AGCGGTAAGT CAAATGTTGC CTTCGTCACC TCAATAACGT TATTCACGCT ACTTAAGGCA TTGGGGATTC CCGTTGGTGA TAAGGCACTG CAGGCATATA GAGTGCCTGA GTGGGTAATG GAGGCTCCAT TGGACGTTAA GGCGGAGTTT TTAGCCGCGT ACTTCGGTAG TGAGTTAGAG AGACCGCGGG TTGAGAATAA CGGTAGAACA TTCCAACCAC CAACACTGGT AATACACAAG GCTGAGAACC TGGCTGGGAA TGGTATTGAA TTCCTAAACG ATATTAGGAA ACTACTTAGT GAATTCGGTA TAGAGACAAC ACCCGTGAGC GTTAGCAGTG GCGTCATTAG GAAGAATGGC ATTAGGACCG TTAGGTTGAG GTTTTCAATA ATTAGTAACA CTAATAACTT ACTAAGGTTC TTTGGAAGGA TAGGCTATGC GTACAATAAG GAACATGACT CACTGGCGGT ATTAGCGTAT GAGTATCTAC GTAGAAAGGT TTTGCTTTGG CAGCATTACG CTGATGCTTA TGAATTAACG AGGAAATTAA TGAGCGAAGG CTACGACGGA CCCAGTATAG TTAAGGCCCT TAGGAGCGCC GGGTATAGGG TTAGTAGGGC CACGGTGTAT AAGTGGTTAA AGGGTGTTAA GAATACTAAG TACATAGGTA GAACAGCCAG AATACCATCA TTCAGTGATT GGATTATTAA GGCCACACTA GGGCTGGGGA ACTCGGGGCT GGTTTGGGAT GTGGTCACTG AAGTTAAGCC CATTGAGTGG AATGATAGGC TTTGGGATGT GACCACTGAA AGCTCATACC ATAACTTCAT TGCAAACGGC CTGGTAACCG GCAACTGTGT CAGGGTCCAG CTTACGAAGA ATGGGAAGGT GGTTACGGCA TTCGTGCCGT GGGATGGTGG TTTAAACCTC ATTAATGAGC ATGATGAGGT GATCATAGAG AGGATCGGTG GCCCAGAGGG TAGGGCTTAC GGTGACCTAC CTGGCGTGAG GTTTAAGGTA ACTAAAGTTA ATGGAGTCAG TCTTAAAGCC ATACTACTGG GTAAGAAACA GAAACCAGTT AGATAA
|
Protein sequence | MSGKKSPLGL YAARKLRRKR QKFRWSDIQY KKRALGFIKK YDPLEGAPMA RGIVLEKVGV EARKPNAAVR KCVTPDTLIN LTPRVATRII NLAGNWHEVS IIHFNKGSKV IEPTRLIDFF HIEPEEFRED GVYELRTLYG RRIIASGDHP IYTGRGILPL KEVKPGDYVA VYPREPIEAY TLNDDRVILT EDDIRREAPP NAKVDQIINR LKELGLIPLK YSNSNIYRIA RLVGHLFGDG SLSYVKSGNG YEGKVAFSGN PSDLEDIIND LIELGFKPSK IREYHGESIV TWSNGLMNVI SGKSNVAFVT SITLFTLLKA LGIPVGDKAL QAYRVPEWVM EAPLDVKAEF LAAYFGSELE RPRVENNGRT FQPPTLVIHK AENLAGNGIE FLNDIRKLLS EFGIETTPVS VSSGVIRKNG IRTVRLRFSI ISNTNNLLRF FGRIGYAYNK EHDSLAVLAY EYLRRKVLLW QHYADAYELT RKLMSEGYDG PSIVKALRSA GYRVSRATVY KWLKGVKNTK YIGRTARIPS FSDWIIKATL GLGNSGLVWD VVTEVKPIEW NDRLWDVTTE SSYHNFIANG LVTGNCVRVQ LTKNGKVVTA FVPWDGGLNL INEHDEVIIE RIGGPEGRAY GDLPGVRFKV TKVNGVSLKA ILLGKKQKPV R
|
| |