Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2421 |
Symbol | |
ID | 4244837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3735023 |
End bp | 3736231 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638107511 |
Product | restriction modification system DNA specificity subunit |
Protein accession | YP_722111 |
Protein GI | 113476050 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.318068 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATGGC AGCGTGTTTT TGTTGAAGAT GTAGCTAAAA TTGTAACTAA GGGAACTACT CCTACTTCTA TAGGTTTTAG CTTTTCTAAA GAAGGTATCC CTTTTCTACG AGTCAATAAT ATCCAAGATG GTAAAATCAA TCTTGGTGAT GTTTTATTTA TTGACTCAAA AACGGATCAA GCTCTTGCGC GTTCTCGAAT TTTAAAAAAA GATGTAATAA TTTCAATTGC TGGTACAATT GGAAAAACCG CAGTTATTCC TACTAATGCT CCAGCAATGA ACTGCAACCA GGCACTTGCA ATAATAAGGC TTCACAATAA TGTAGACCCC TACTATTTTA ACCATTGGCT GAATACAGGA GATGCGTTTC GACAAATTAC AGGTTCAAAA GTGACAGCAA CTATTTCTAA CCTAAGTCTT GGTTGTATCA AAAAGCTCAA AATCCCCCTC CCCCCAATAG AAGAACAGCG CCGAATAGCT GCAATACTCG ACCAAGCTGA TGCTATCAGA CGAAAGAGAC AACAAGCGAT CGCTCTAACA GATGAATTAT TGCGTTCTAC ATTCCTGGAG ATGTTCGGCG ACCCTGTTAT TAATCCGAAG GGGTGGGAGG TAAAAAAATT AGAGGAAGTT GCATTAAAAC GCAAGGGGGC TATAAAATGC GGACCTTTTG GTAGCCAACT ACTTATAAGC GAGTTTGTCA AAGATGGTAT TCCAGTATAC GGAATAGACA ATGTTCAAAA AAATGAGTTT GTTTGGGCCA AACCCAAGTA TATTACTACT GAAAAGTACG AGCAATTAAA AAGCTTTTCT ATCCAGGATG AGGACGTTCT GATTTCAAGA ACTGGAACAG TTGGAAGAAC TTGTGTCGCA CCACCTGATA TCCCTAGAAG TATCCTTGGA CCTAACTTGC TGAAAGTTTC CTTAAATACT AACAAAATGC TTCCTAAATA TTTGTCCTAT GCTTTAAATC ACTCTAATCC CCTGATTGAA GAGATAAAAA GAATGTCACC AGGTGCTACA GTTGCAGTTT TTAACACAAC AAACCTTAAA GCTTTGAGGT TAACAATTCC CCATATAAAC CTGCAATCCC AGTTTGTCAA CTTTACTGAA AATGTTGAAT TGACAAAGCA AAAAGAGTCT AACTACCTCA CAGAATCCAA CAACCTATTT AACTCCCTGT TACAACGCGC ATTCAAAGGC CAACTATAA
|
Protein sequence | MKWQRVFVED VAKIVTKGTT PTSIGFSFSK EGIPFLRVNN IQDGKINLGD VLFIDSKTDQ ALARSRILKK DVIISIAGTI GKTAVIPTNA PAMNCNQALA IIRLHNNVDP YYFNHWLNTG DAFRQITGSK VTATISNLSL GCIKKLKIPL PPIEEQRRIA AILDQADAIR RKRQQAIALT DELLRSTFLE MFGDPVINPK GWEVKKLEEV ALKRKGAIKC GPFGSQLLIS EFVKDGIPVY GIDNVQKNEF VWAKPKYITT EKYEQLKSFS IQDEDVLISR TGTVGRTCVA PPDIPRSILG PNLLKVSLNT NKMLPKYLSY ALNHSNPLIE EIKRMSPGAT VAVFNTTNLK ALRLTIPHIN LQSQFVNFTE NVELTKQKES NYLTESNNLF NSLLQRAFKG QL
|
| |