Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4236 |
Symbol | |
ID | 4245888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6539956 |
End bp | 6542760 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109131 |
Product | hypothetical protein |
Protein accession | YP_723709 |
Protein GI | 113477648 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1743] Adenine-specific DNA methylase containing a Zn-ribbon |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.4305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATT CCCAACGTCC GACTGTTTTT ATTGAAAAAA TTATGCCTGT GAAATTATTA AATCAACAGG TTTATTATGA ACATGGAGGT AATCCTTTTA AGGGGTTGCA TCGCTGGTAT TCTCGTAAAC CTTTATCGTT TTCAAGAGCT TCGGTTTTAG CTTCTTTATT GCCTGATGAT ATTTCTGTAG AAGAGTTTGA GTATTTATTA GGGTTGAAAA AGCGGAGCAA TTCTATTCAA AAAAGTGAAC AGTATAAAGA TGATACTAGG CTTTATAAAA TTCCTCCTGA TGAAACAAGA ATTAAGCAGG TACATGACTA CTGTGAGAAA ACCTGGGGGA CGAGAACACC TACTATTTTG GATGCGTTCG GTGGTGGTGG GAGTATTCCT TTTGAAGCAG CGCGGTATGG GTTGAATGTT TTGGCATCAG ATTTGAATCC GGTGGCAGTG GTGACAATGA AAGCAGCGAT GGAATATCCT TTGAAATTCG GACCTGACTT GCAGCAGGAT ATTGATAAGT GGGTGCAGTG GGTAGAGGAT GAGGCAGAGA AACGGTTAGC TGAGTTTTTC CCGTCGTTGC CAGGGGAAAC GGTGCAGAAT TATTTGTGGG CCCATACGGT GGTTTGTCCG AGTTGTCAGT CGGTTGTGCC TTTGAGTCCG AATTGGTGGT TATACAAACG CCCGGAAAAG CAGAATTTAC ATAAATGGTG TGCGGTGAAA CCTATTCCTA ATCTTGAAGG GAAGCGGGTT GATTTTGAGT TGATAAAAGG AAGTAAGGGA AAGGGTACGA CTATTAAGAC TGATGAGGGT GAGTTCGACC CGAATGATTA CAACACTATT AGTAGGGGTG TGGGTAAATG TCCGAATTGT GGTAGCGTGA TTGAGGATGA TGCAATTAAA TCTCAGGCAC GGTCAGGAAA GCTTGGGCAT CAAATGTATG CAGTGGCATT TAAAAAAGGT AAGGGAAGTT TAGAGTTTAG ATTACCTCAA AATGTTGACT TTGATGGATT GGGTAAAACT GATTATTATT TGAATAGTAG TTTTGAGGAA TTTCAATTAA GTGGTTTATT ACCAGAAATC GAAATTAACT CTGGTGAGAA AACAGACGAA CTTATTAGAT ATGGAATTAA CCAATGGTCA AAATTATTTA ATCCCCGTCA ACTTCTAACC CTTGTCACTT ATGTCGAAAT TATTAACGAT GTTAAGTTAC AATTACAAGC AGAATATGAA CCCGATAAAG TAGAGGCGAT CGCTACTTAT TTGACTTTGC TCCTAGAGCG GTGCATTGAT AAAAATTCAC GTTTATCTTG TTGGGATTCT TCGGTTGCAG TTGCTCAAAA GGCTTCTGTA CAACACTCAC TCAACTTGAT GTGGAATTAC CCAGAATTTA GCGGTAATGG AAAATTATGG AATTGGTGTT CCGATGTTAC TTCAAATTAC CAAAAGCTTT GTGCATTATT CAACTCTAAA CCCCTACCTA TTGACACCCA ACAACACAAC AAAACAATTC AAATAGACTC CGCATCAGCA GACACCCTCT ACCACATCAG CGATAACTCA GTAGATGCCA TCATCACCGA CCCGCCCTAC TACGCCACCA TTCAATACGC CGAACTATCG GACTTTTTCT ACGTCTGGCA GCGCCGAGTC TTAGGCGATA TCTTCCCCGA CCTCTACTTA ACCGAACTCA CCGACAAAGA CAGAGAAGCA GTTGCCAACC CCTCCCGCTT CCGCAACATG GGAACATCCC CCGATGAACT TGCAAACCAA GACTACGAAG CAAAAATTGC ACTCGCCTTT GCCGAACATT ACCGAGTCTT GCGCGACGAC GGCGTAATGA CGGTACAATT TAACCACAAA GAGTCCGGCG CGTGGGATGT CTTAGCAAAA TCTCTCATTG ACGCTGGTTT TGAAATCACC GCATCTTGGG CAGTCAGTAC TGAAAACCCC CAAAACCTCC ATCAAGCTAA GAAAAATTCT GTTTCCAGCA CCGTCTTACT TGTCTGTCGC AAACGTGACC CCAACGCCCC TCAAGCATGG TGGGATGACC TGCAACCAGA AGTTGCCAAC CAAGTAGAGG AACGCGCCCC CGACTTTGAA AAAAATGACA TCACGGGAAT TGACCTATAT CTCAGCGCAT TCGGCCCAGC ATTAAACGTA TTCAGTCGTT CCTATCCCAT ATTAGACAAC AGTGGAGTAG AAGTCCGCCC CGAAGTCGCC TTTGCTGAAG CTAGAAAAGC GATCGCTAAC TACCGCTTCC AGAAACTTGT ACAAACAGAC ACAGCAGGCT TTGATATTTT GACCCAATGG TATTTATTAG CTTGGGATGC TTTCAGTGCC AGGGAGTTCC CCTTTGATGA AGCCAGACAA CTTGCCCTAG CCATAGGAGG TTTCAACGTC AACGACCTGG TTAAAGTTCA CAAATTATTA GACTCAACGA GTGGCACTTG CAAATTATTA ACACCCCGAC AACGACTGAA AAAACGAGCA TTTTCGGTCA CTCCACAAGA TTTTTCTAGT CAATATTTAG TAGATGACAT TCATGCTATT ATTGCTATTT ATCAAGAAGA GGAAAATGTA GAAGTAGTCC GTCGGTTTAT GGAAAAAACA GGATTATTAA GCAATGAAAT GTTTATGCAA ACTATTGAAG TAGCATTAAA AGTAATTCCC GATAAAATAG AAGAGGAACA AACCTTGATG AATTTGTGTT TAATGATGGA TGAAATTAAA GATAATGTCA GTACCCAAGG GAAACAATTA GAATTATTTG AACAGCAGTT AAGCTTAGAT TTTGGAGATG TTTAA
|
Protein sequence | MNNSQRPTVF IEKIMPVKLL NQQVYYEHGG NPFKGLHRWY SRKPLSFSRA SVLASLLPDD ISVEEFEYLL GLKKRSNSIQ KSEQYKDDTR LYKIPPDETR IKQVHDYCEK TWGTRTPTIL DAFGGGGSIP FEAARYGLNV LASDLNPVAV VTMKAAMEYP LKFGPDLQQD IDKWVQWVED EAEKRLAEFF PSLPGETVQN YLWAHTVVCP SCQSVVPLSP NWWLYKRPEK QNLHKWCAVK PIPNLEGKRV DFELIKGSKG KGTTIKTDEG EFDPNDYNTI SRGVGKCPNC GSVIEDDAIK SQARSGKLGH QMYAVAFKKG KGSLEFRLPQ NVDFDGLGKT DYYLNSSFEE FQLSGLLPEI EINSGEKTDE LIRYGINQWS KLFNPRQLLT LVTYVEIIND VKLQLQAEYE PDKVEAIATY LTLLLERCID KNSRLSCWDS SVAVAQKASV QHSLNLMWNY PEFSGNGKLW NWCSDVTSNY QKLCALFNSK PLPIDTQQHN KTIQIDSASA DTLYHISDNS VDAIITDPPY YATIQYAELS DFFYVWQRRV LGDIFPDLYL TELTDKDREA VANPSRFRNM GTSPDELANQ DYEAKIALAF AEHYRVLRDD GVMTVQFNHK ESGAWDVLAK SLIDAGFEIT ASWAVSTENP QNLHQAKKNS VSSTVLLVCR KRDPNAPQAW WDDLQPEVAN QVEERAPDFE KNDITGIDLY LSAFGPALNV FSRSYPILDN SGVEVRPEVA FAEARKAIAN YRFQKLVQTD TAGFDILTQW YLLAWDAFSA REFPFDEARQ LALAIGGFNV NDLVKVHKLL DSTSGTCKLL TPRQRLKKRA FSVTPQDFSS QYLVDDIHAI IAIYQEEENV EVVRRFMEKT GLLSNEMFMQ TIEVALKVIP DKIEEEQTLM NLCLMMDEIK DNVSTQGKQL ELFEQQLSLD FGDV
|
| |