Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2422 |
Symbol | |
ID | 4244838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3736248 |
End bp | 3739607 |
Gene Length | 3360 bp |
Protein Length | 1119 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638107512 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_722112 |
Protein GI | 113476051 |
COG category | [V] Defense mechanisms |
COG ID | [COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.567699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.274661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCAA ACTTTGACTT TCTCAAACCC CACTACCCCA AACTACACCA AACCGCCACC CAGGCAGAAA CATTAATCAA CAACGCCCCC CGGGCCAGTT GCTTCTACAC CCGCTACACC TTAGAACAAG CGGTTATCTG GCTCTACGAA AATAATCCTT ACCTAGAGCT CCCCAAGACC TACAACTTAG GGGCATTAAT TCACGAGCAA ACCTTTAAAG ACAACCTCCA AAACAACCTT TTCCCCAAAC TCCGCATCAT TCACAAAATA GGCAACGCAG CAGCCCACGA CCCCAAACCC ATCACCTCCA AAGATGCTCG TTACCTCATC CAAGAACTCT ATCACTTCCT CTACTGGCTC ACCCGCTACT ACAGCCCCGA CGGCAAAAAC CTCCCTAACC TCAAATTCAA CCCCGACCTC ATTCCCACAA GCACAACTAA GTCAGACCTC ACCCTCGATA AACTCCAACA ACTCGAAACC CAACTTTCTA CCGCCCAAGA AATGCAACGC ATCGCCGAAC AAAAAGAAAA ACAAACCACC GCAGAACTAG AAACAGCAAA AGCCGAACTC GCCCAACTCA AACAACAAAA CCAGACCAAA AGCGATCGCC ACGACTACAA CGAAGCCGAT ACCCGCACCT ACTTACTTGA TATTTTACTC AAAGAAGCTG GTTGGCCGAT ACATCTGCCA GAGTCTACAG AATACCAAGT CACAGGAATG CCTAACGAAA CAGGAAAAGG TAAAATTGAC TATGTATTTT GGGGAGATGA CGGCAACCCT TTAGCAATAG TAGAAGCAAA ACGCACTCGC AAGAGTCCCC AAACAGGACA ACATCAAGCC AAACTATACG CCGACTGTCT GCAAGCAGAA TTTAACCACC GCCCCGTCAT TTTCTACAGC AACGGTTATG AGCATTGGAT ATGGGATGAT TTTAACTATC CACCCCGCCC CATTCAAGGC TTTCTCAAAA AAGACGAACT AGAGCGTTTA ATATTCCGCC GGAGTCATCG TCAACCCCTC CATACCCTGG ATGTCAACCC CGACATCGCC GGACGTTGTT ATCAAAAAGA AGCTATCTGT TGCATCAAAG AAATATTTGA CAACAAACAG CGTAAAGCAT TATTAGTCAT GGCAACAGGA ACAGGTAAAA CTCGGACAGC CGTCAGTATT GTAGAATTAT TGGAACGAGC CAACTGGGTG AAACGAGTCT TATTTTTAGC GGACCGCAAC GTCCTCTTAA CTCAAGCCCA AGGTGTCTTT AACAGTAACC TTAACAGTAT TACCACTGCA AACCTAACCG AAAAAAAACA AGACGCAGAA AACGCCACCA TTGTCTTTTC CACCTACCCC ACCATCAGCA ACCGCATCAA CGCAACTGAT GGAGATAAAC GCTTATTCAG TCCTGGATAT TTTGATTTAG TTATAGTAGA TGAAGCACAC CGTTCCATCT ATCGCAAATA TCGGCAAATA TTTCAATATT TCGATGCTCT ACTTTTAGGT TTAACAGCTA CACCCCGCAA TGAAGTAGAC CGGGACACTT ACAGTATTTT TGACTTAGAG CCTGGGGTTC CCACCTTTGC TTATGAATTA GACAGCGCCA TCAAAGACGG TTATCTCGTT CCTCCCAGTG GTGTCGAAGT TCCCTTTAAG TTTATGCGGG CAGGTATTAG ATATTCAGAA CTGCAACCAG AGGAAAAGAC AGCCTATGAG GAACGGTTTG CTGATGCAGA AACTGGGGAA GTGCCGGACA AAATTAATGC TACAGCTCTT AATACTTGGT TATTTAATAT TAGTACCGTA GACCAAGCGC TACAGTTATT AATGGAGCGG GGGCTAAAAG TAGAAGGAGG CGATCGCCTC GGTAAAACCA TCATTTTTGC TCGCAACCAC AAACATGCTG AGTTTATTTC TGAACGGTTT AATGCTAATT ATCCCCACTA CAAAGGTCAG TTTGCTCAAA TTATTGATAG CCAAAGTTCC TACTCTCAAA GTTTGCTGGA TGACTTTTCA AATGCAGACA AACAGCCAAT TATTGCGGTG TCTGTAGATA TGTTAGATAC AGGGGTAGAC GTGCGGGAAG TAGTTAACTT AGTATTTTTC AAACCAGTCT ATTCTCGGAT AAAATTCAAT CAAATGATTG GACGCGGTAC TCGTTTATGC CCTAACTTAT TTGCTCCGGG TGATGATAAA ACAGAATTTT TAGTATTCGA CCTGTGCAGT AACTTCAGTT ATTTTGAGCA ACAGATAGAC GAAAAAAATG TAAAAATTCC TGACAGTCTC ACAACAAAAT TAGTCAAATC ACGCCTTGAA CTGAGTCAAT TAGTACCCAC TGGCGAGTTA AAAAATAACC TCTTAGATGA GTTGCATCAA TATATTAACT CAATGCCCAA AGATAACTTT TTAGTGCGCC CCCATTTGCA GCAAGTAGAA GAGTTTTCCC AGAGAGAGCG ATGGAATGAG TTAGGAGAAA GCGATCAAGA AATAATAGGG GAGTCATTAG CAAGTTTACC CAATGGTTTA CCAAAAGAGA GCCATTTAAA TAAACGGTTT GACCTAATAT GTGTCAAATT ACAGCTAGCA TTGTTAAAAC AGTCAACAGA CTTTATCAGA CTCCGGGATA ATATTCGAGA TATTTGCTCT CAACTCGCCC AAAAGTCTAA CATTCCCATG ATAGCAGCAA AACTACAAGT TATTGAAGAA GTACAAGCAG AGGAATGGTG GCGAGATATT ACAGTAGAAA TGGTTTCAAT TTTACAGCAA GACTTACGAG AATTAGTAAA ATTTATAGAT AGCCAAGAAC AAAAAATTAT GTATGTAAAC TTTGCCGATG AAATGGGAGA AGTGCAAAAT GTCAACGTTC CCACCCAAAC ATCAGGGTTT AGTCCTCTCC AGTATAAGAA AAAAGTAGAG GCATATATTC GGAGTAATGA GAACAATGTT GCTGTTGCTA AATTAAAACG TAATATCCCC TTAACCGACG CTGACTTAAC AGCATTAGAA GAAATGTTAT TTAATAGTGA AGTTATAGAA GATAGGGAAA TATTTGCCGA AGTTTACGGT CAAAATATTA GTTTAAAACT ATTTATTAGA AAGTTAGTCG GATTAGATAG AAATGCAGCC AAAGAATTAT TTAGTAAATA TTTAAACAGC AAATTTAACA CCAGTCAAAT TAGATTTGTT GAGAATATTA TCGACTATTT AACTCAAAAT GGAGTGATGT CCCCAGAATT ATTATATGAG CCACCTTTTA CAGATTTACA CACGGAAGGA TTAGATGGTA TCTTTGCAGA TAAAGAGGCA GATAACATTA TTGAGATATT GACAGAGATT AATGAAAGTG TTGATTATGA CGTAGCTTAG
|
Protein sequence | MTSNFDFLKP HYPKLHQTAT QAETLINNAP RASCFYTRYT LEQAVIWLYE NNPYLELPKT YNLGALIHEQ TFKDNLQNNL FPKLRIIHKI GNAAAHDPKP ITSKDARYLI QELYHFLYWL TRYYSPDGKN LPNLKFNPDL IPTSTTKSDL TLDKLQQLET QLSTAQEMQR IAEQKEKQTT AELETAKAEL AQLKQQNQTK SDRHDYNEAD TRTYLLDILL KEAGWPIHLP ESTEYQVTGM PNETGKGKID YVFWGDDGNP LAIVEAKRTR KSPQTGQHQA KLYADCLQAE FNHRPVIFYS NGYEHWIWDD FNYPPRPIQG FLKKDELERL IFRRSHRQPL HTLDVNPDIA GRCYQKEAIC CIKEIFDNKQ RKALLVMATG TGKTRTAVSI VELLERANWV KRVLFLADRN VLLTQAQGVF NSNLNSITTA NLTEKKQDAE NATIVFSTYP TISNRINATD GDKRLFSPGY FDLVIVDEAH RSIYRKYRQI FQYFDALLLG LTATPRNEVD RDTYSIFDLE PGVPTFAYEL DSAIKDGYLV PPSGVEVPFK FMRAGIRYSE LQPEEKTAYE ERFADAETGE VPDKINATAL NTWLFNISTV DQALQLLMER GLKVEGGDRL GKTIIFARNH KHAEFISERF NANYPHYKGQ FAQIIDSQSS YSQSLLDDFS NADKQPIIAV SVDMLDTGVD VREVVNLVFF KPVYSRIKFN QMIGRGTRLC PNLFAPGDDK TEFLVFDLCS NFSYFEQQID EKNVKIPDSL TTKLVKSRLE LSQLVPTGEL KNNLLDELHQ YINSMPKDNF LVRPHLQQVE EFSQRERWNE LGESDQEIIG ESLASLPNGL PKESHLNKRF DLICVKLQLA LLKQSTDFIR LRDNIRDICS QLAQKSNIPM IAAKLQVIEE VQAEEWWRDI TVEMVSILQQ DLRELVKFID SQEQKIMYVN FADEMGEVQN VNVPTQTSGF SPLQYKKKVE AYIRSNENNV AVAKLKRNIP LTDADLTALE EMLFNSEVIE DREIFAEVYG QNISLKLFIR KLVGLDRNAA KELFSKYLNS KFNTSQIRFV ENIIDYLTQN GVMSPELLYE PPFTDLHTEG LDGIFADKEA DNIIEILTEI NESVDYDVA
|
| |