Gene Tery_2422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2422 
Symbol 
ID4244838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3736248 
End bp3739607 
Gene Length3360 bp 
Protein Length1119 aa 
Translation table11 
GC content41% 
IMG OID638107512 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_722112 
Protein GI113476051 
COG category[V] Defense mechanisms 
COG ID[COG4096] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.567699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.274661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCAA ACTTTGACTT TCTCAAACCC CACTACCCCA AACTACACCA AACCGCCACC 
CAGGCAGAAA CATTAATCAA CAACGCCCCC CGGGCCAGTT GCTTCTACAC CCGCTACACC
TTAGAACAAG CGGTTATCTG GCTCTACGAA AATAATCCTT ACCTAGAGCT CCCCAAGACC
TACAACTTAG GGGCATTAAT TCACGAGCAA ACCTTTAAAG ACAACCTCCA AAACAACCTT
TTCCCCAAAC TCCGCATCAT TCACAAAATA GGCAACGCAG CAGCCCACGA CCCCAAACCC
ATCACCTCCA AAGATGCTCG TTACCTCATC CAAGAACTCT ATCACTTCCT CTACTGGCTC
ACCCGCTACT ACAGCCCCGA CGGCAAAAAC CTCCCTAACC TCAAATTCAA CCCCGACCTC
ATTCCCACAA GCACAACTAA GTCAGACCTC ACCCTCGATA AACTCCAACA ACTCGAAACC
CAACTTTCTA CCGCCCAAGA AATGCAACGC ATCGCCGAAC AAAAAGAAAA ACAAACCACC
GCAGAACTAG AAACAGCAAA AGCCGAACTC GCCCAACTCA AACAACAAAA CCAGACCAAA
AGCGATCGCC ACGACTACAA CGAAGCCGAT ACCCGCACCT ACTTACTTGA TATTTTACTC
AAAGAAGCTG GTTGGCCGAT ACATCTGCCA GAGTCTACAG AATACCAAGT CACAGGAATG
CCTAACGAAA CAGGAAAAGG TAAAATTGAC TATGTATTTT GGGGAGATGA CGGCAACCCT
TTAGCAATAG TAGAAGCAAA ACGCACTCGC AAGAGTCCCC AAACAGGACA ACATCAAGCC
AAACTATACG CCGACTGTCT GCAAGCAGAA TTTAACCACC GCCCCGTCAT TTTCTACAGC
AACGGTTATG AGCATTGGAT ATGGGATGAT TTTAACTATC CACCCCGCCC CATTCAAGGC
TTTCTCAAAA AAGACGAACT AGAGCGTTTA ATATTCCGCC GGAGTCATCG TCAACCCCTC
CATACCCTGG ATGTCAACCC CGACATCGCC GGACGTTGTT ATCAAAAAGA AGCTATCTGT
TGCATCAAAG AAATATTTGA CAACAAACAG CGTAAAGCAT TATTAGTCAT GGCAACAGGA
ACAGGTAAAA CTCGGACAGC CGTCAGTATT GTAGAATTAT TGGAACGAGC CAACTGGGTG
AAACGAGTCT TATTTTTAGC GGACCGCAAC GTCCTCTTAA CTCAAGCCCA AGGTGTCTTT
AACAGTAACC TTAACAGTAT TACCACTGCA AACCTAACCG AAAAAAAACA AGACGCAGAA
AACGCCACCA TTGTCTTTTC CACCTACCCC ACCATCAGCA ACCGCATCAA CGCAACTGAT
GGAGATAAAC GCTTATTCAG TCCTGGATAT TTTGATTTAG TTATAGTAGA TGAAGCACAC
CGTTCCATCT ATCGCAAATA TCGGCAAATA TTTCAATATT TCGATGCTCT ACTTTTAGGT
TTAACAGCTA CACCCCGCAA TGAAGTAGAC CGGGACACTT ACAGTATTTT TGACTTAGAG
CCTGGGGTTC CCACCTTTGC TTATGAATTA GACAGCGCCA TCAAAGACGG TTATCTCGTT
CCTCCCAGTG GTGTCGAAGT TCCCTTTAAG TTTATGCGGG CAGGTATTAG ATATTCAGAA
CTGCAACCAG AGGAAAAGAC AGCCTATGAG GAACGGTTTG CTGATGCAGA AACTGGGGAA
GTGCCGGACA AAATTAATGC TACAGCTCTT AATACTTGGT TATTTAATAT TAGTACCGTA
GACCAAGCGC TACAGTTATT AATGGAGCGG GGGCTAAAAG TAGAAGGAGG CGATCGCCTC
GGTAAAACCA TCATTTTTGC TCGCAACCAC AAACATGCTG AGTTTATTTC TGAACGGTTT
AATGCTAATT ATCCCCACTA CAAAGGTCAG TTTGCTCAAA TTATTGATAG CCAAAGTTCC
TACTCTCAAA GTTTGCTGGA TGACTTTTCA AATGCAGACA AACAGCCAAT TATTGCGGTG
TCTGTAGATA TGTTAGATAC AGGGGTAGAC GTGCGGGAAG TAGTTAACTT AGTATTTTTC
AAACCAGTCT ATTCTCGGAT AAAATTCAAT CAAATGATTG GACGCGGTAC TCGTTTATGC
CCTAACTTAT TTGCTCCGGG TGATGATAAA ACAGAATTTT TAGTATTCGA CCTGTGCAGT
AACTTCAGTT ATTTTGAGCA ACAGATAGAC GAAAAAAATG TAAAAATTCC TGACAGTCTC
ACAACAAAAT TAGTCAAATC ACGCCTTGAA CTGAGTCAAT TAGTACCCAC TGGCGAGTTA
AAAAATAACC TCTTAGATGA GTTGCATCAA TATATTAACT CAATGCCCAA AGATAACTTT
TTAGTGCGCC CCCATTTGCA GCAAGTAGAA GAGTTTTCCC AGAGAGAGCG ATGGAATGAG
TTAGGAGAAA GCGATCAAGA AATAATAGGG GAGTCATTAG CAAGTTTACC CAATGGTTTA
CCAAAAGAGA GCCATTTAAA TAAACGGTTT GACCTAATAT GTGTCAAATT ACAGCTAGCA
TTGTTAAAAC AGTCAACAGA CTTTATCAGA CTCCGGGATA ATATTCGAGA TATTTGCTCT
CAACTCGCCC AAAAGTCTAA CATTCCCATG ATAGCAGCAA AACTACAAGT TATTGAAGAA
GTACAAGCAG AGGAATGGTG GCGAGATATT ACAGTAGAAA TGGTTTCAAT TTTACAGCAA
GACTTACGAG AATTAGTAAA ATTTATAGAT AGCCAAGAAC AAAAAATTAT GTATGTAAAC
TTTGCCGATG AAATGGGAGA AGTGCAAAAT GTCAACGTTC CCACCCAAAC ATCAGGGTTT
AGTCCTCTCC AGTATAAGAA AAAAGTAGAG GCATATATTC GGAGTAATGA GAACAATGTT
GCTGTTGCTA AATTAAAACG TAATATCCCC TTAACCGACG CTGACTTAAC AGCATTAGAA
GAAATGTTAT TTAATAGTGA AGTTATAGAA GATAGGGAAA TATTTGCCGA AGTTTACGGT
CAAAATATTA GTTTAAAACT ATTTATTAGA AAGTTAGTCG GATTAGATAG AAATGCAGCC
AAAGAATTAT TTAGTAAATA TTTAAACAGC AAATTTAACA CCAGTCAAAT TAGATTTGTT
GAGAATATTA TCGACTATTT AACTCAAAAT GGAGTGATGT CCCCAGAATT ATTATATGAG
CCACCTTTTA CAGATTTACA CACGGAAGGA TTAGATGGTA TCTTTGCAGA TAAAGAGGCA
GATAACATTA TTGAGATATT GACAGAGATT AATGAAAGTG TTGATTATGA CGTAGCTTAG
 
Protein sequence
MTSNFDFLKP HYPKLHQTAT QAETLINNAP RASCFYTRYT LEQAVIWLYE NNPYLELPKT 
YNLGALIHEQ TFKDNLQNNL FPKLRIIHKI GNAAAHDPKP ITSKDARYLI QELYHFLYWL
TRYYSPDGKN LPNLKFNPDL IPTSTTKSDL TLDKLQQLET QLSTAQEMQR IAEQKEKQTT
AELETAKAEL AQLKQQNQTK SDRHDYNEAD TRTYLLDILL KEAGWPIHLP ESTEYQVTGM
PNETGKGKID YVFWGDDGNP LAIVEAKRTR KSPQTGQHQA KLYADCLQAE FNHRPVIFYS
NGYEHWIWDD FNYPPRPIQG FLKKDELERL IFRRSHRQPL HTLDVNPDIA GRCYQKEAIC
CIKEIFDNKQ RKALLVMATG TGKTRTAVSI VELLERANWV KRVLFLADRN VLLTQAQGVF
NSNLNSITTA NLTEKKQDAE NATIVFSTYP TISNRINATD GDKRLFSPGY FDLVIVDEAH
RSIYRKYRQI FQYFDALLLG LTATPRNEVD RDTYSIFDLE PGVPTFAYEL DSAIKDGYLV
PPSGVEVPFK FMRAGIRYSE LQPEEKTAYE ERFADAETGE VPDKINATAL NTWLFNISTV
DQALQLLMER GLKVEGGDRL GKTIIFARNH KHAEFISERF NANYPHYKGQ FAQIIDSQSS
YSQSLLDDFS NADKQPIIAV SVDMLDTGVD VREVVNLVFF KPVYSRIKFN QMIGRGTRLC
PNLFAPGDDK TEFLVFDLCS NFSYFEQQID EKNVKIPDSL TTKLVKSRLE LSQLVPTGEL
KNNLLDELHQ YINSMPKDNF LVRPHLQQVE EFSQRERWNE LGESDQEIIG ESLASLPNGL
PKESHLNKRF DLICVKLQLA LLKQSTDFIR LRDNIRDICS QLAQKSNIPM IAAKLQVIEE
VQAEEWWRDI TVEMVSILQQ DLRELVKFID SQEQKIMYVN FADEMGEVQN VNVPTQTSGF
SPLQYKKKVE AYIRSNENNV AVAKLKRNIP LTDADLTALE EMLFNSEVIE DREIFAEVYG
QNISLKLFIR KLVGLDRNAA KELFSKYLNS KFNTSQIRFV ENIIDYLTQN GVMSPELLYE
PPFTDLHTEG LDGIFADKEA DNIIEILTEI NESVDYDVA