Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2268 |
Symbol | |
ID | 4243361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3529656 |
End bp | 3534458 |
Gene Length | 4803 bp |
Protein Length | 1600 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107369 |
Product | glycosyl transferase family protein |
Protein accession | YP_721969 |
Protein GI | 113475908 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.59897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTA CTTCTTTCGC TCAGGCAAAT CAACTATTTA AGGAAGGAAA AATAGAGGAA GCGATCGCTT TTTACAAAAA GGCCACCACT AAAAGTAACC AATTCTATTG TTCTCACCAT AATTTAGGAG AGGCATTGAT CAAAGCAGGA CGCATCAAAG AAGCTGCTGC TGCTTTTCGT GAGGCATTGG CCATCAACCC TAATTCTGCT TGGTCTCTTT ATAAGTTAGG GGCAATGTTA AACAAGTTAG GCCAGTATAA GGAAGCTGTG GGTTATTTGC GGCAAGCGGT GGAAAAGAAA ACAAATGTGC CGGAGTTTTA TTTAAGTTTA GGGAGGGCAT TGGTGCATTT GGGACAATGG TCGGACGCGG AGGAATCTCT CTATCAGGTG GTTGACTTCT GTGTTGACTC TCTCCCTCTA GAGGAGGAGG CAAAGCAGGT ATATGGAACT TCCTTAATGA CATTCTATGT ATCAGAGGCT TATTTTTACT TGGGAGACAT AAAGTTTGGG CAACAGCAAT GGTCTGAAGC GGTGAAATTT TATCGTCAGA GTTGGGAAAC TAATTATGGT AAATTGGAAT GCTGTATGGG TTGGGCTGCA AGTTTGGGTA AGTTGGGACG ATGGTCGGAA GTTGTGGAAT GTTATCGTCA GAGCGCACAT CTGTTTGATG AGTCTGAGGA GTTTTTGTTT GGGTTTGGTC AAGCTTTGTC GCAGTTGGGT CGATGGGAGG AGGCGGTTGT TAAGTATCGG TTAGTAGTGG AAATTAATCC GAAGTCGGCT GAGTTTCGGC ATCATTTGGG GTATGCTTTG ATGCAGTTGA AGTATTGGGT CCAGGCAGAG ATTGAGTTAC GGAAGGCAGC GGAGTTGCAT CCAGCATCTG CCGTGGTTTG GCAGCACTTA GGGAATGTTT TGAGGGAGTT GGGGAAGAGG GAGGAGGCTG TTAAGGTTTA TCAGCGGGGG CAACAGATTG ATACTGAGTC TGTAAGTCCT AAACAGATTG CAGATAAGCA GTTGCTAGAA AGGGTGCGGA AATATTTAAA GGCGTCAAAG TCCAAAAAAA AGAACGTGGT TGTTTATACA GCCATTTGCA ATAACTATGA TGTATTAAAA ATACCAGAGT TTTTATGCCC TGACTGGGAT TATGTTTGCT TTACGGATAG AGCGCAATAT CCAGGGGAGC ATTGTTGGGA AATTCGTCAT TTCGATTACA TACATGAAGA TAGCACTCGC ACTGCTCGAT ATGTAAAAAC ACATCCTCAT ATTTATTTCA ATAACTATGA GTATAGTATC TGGATAGATG CGCACATTCT GGTTAAATCC AATTTTCTGG AGGAGTTTCT GAATAGTTTT ATCAAAAATC AGCAATTGTT TGCAGCTATC CCTCACCCAT ATAGGAATTG TACTTATCAA GAAGCTAATA TATGTAGCCA GCAAGAGAAA GACGACAAAG ATACAATTGA AGAGCAAACT ACTCACTATC AACAGGAAGG TCTGCCCTAT GAATTAGGAT TAATAGAGAC AGGTGTTATG ATTCGTAAAC ATAACGACAA TTGTATTAGG AACTTGCACA ATTTATGGTG GGAAGAAATA GAAAAATATA GTAAAAGAGA CCAGCTTTCC GTAATGTTTG CTCTATGGAA AACTGACTTA AAGTGGATTC CATTGATGAA GAAAGGGCGA TCAACAAGAA ACCACCCAGG ATTTGGCTTC TTTCAGCATG GGTTAAGATC TACCACAAGA GATATCCCCT ATCAGATACC CTCTTTCCTG CCTAAAGAAT TTGCCTTTGA ACAAACTCCA TTTTGGCACT CAAAAACTAA GCCTTACGAT CGAAATTCCT TAAAAAGCAT AAGTGCTTTT CCTATTGATA TTGTCATTTG TGTACACAAT GCCCTTGAGC ATGTTAAAAA TTGTATTAAT TCTGTGCTTG TCAATCTTTT GCCAAGCCAT AAGATTATTA TTGTGGATGA CGGTTCTGAT CATGACACCT GGGACTACCT AAAACAGATT GCTGCACAAA ACCATAGCAT TACTTTAGTT CGACATGAGA TAGCTAAGGG TTACACAAAA TCTGCAAATT TAGGAATGCA GATTGCTGAT GCGGAGTTTG TTATTCTTTT AAATAGTGAT ACTATAGTAA GCCCAGACTG GGCTTTGAAA CTTTTGCAAA CTGCAACGAG TAGTAATTTG ATAGGTGTAG TAGGTCCCAT GAGTAATGCA GCATCATGGC AATCATTACC TTTTGTCAAA GATCCAAAGA ACGGTGGAAT GATAATCAAT GAACTTCCAG AGGGCAAGAC TGTAGCTGAC ATGGATAAGT TCTGCGAGCA ATACGGTTGG TTTGGGCATT TCCCCCGGGT TCCTTTAATT AATGGTTTTT GTTATGGTAT AAAGCGCGAA GTTATTGATG CTATTGGCTA CTTTGATGAA GAAGCTTTTC CCAAAGGATA TGGGGAGGAA GATGATTATT CTATGCGCGC CTTGAATGCA GGTTTTACCC ATGCATTGGC AACCCATTGC TATGTTTTTC ATGCCAAATC AAAGAGTTTT GGTGCGGAAA CTAGAAGTGT CCTTGCTGAA GCAGGTGGGA ATGCACTAAG AGCCAGACAT GGAGAGAAAC GAATCAAAAG AGCTGTGATT TCTCTAAATT TTCATCCATT TCTTAAACGT TTACGGGCTT ATGCTGGGGA GTATTATGGC GTTGAAGTTC TAGAAAAGGG ACTTACTAAA CTTTCATCTG TGCCGAAAAA CACAAATTTG CCAGAAAGCT CCGATCTACC GAAAAGCACA AATTTGCCAG AAAGCTCTGG ACCGTCAGAT TCTCCCCAAA AATTAGCCAA CTACGATTTA AGTCAGCTCA CAAAAATAAA TTCGGTAATT TTTCCTAAAG AGCTACTGCT GGTAAAAATT TCATTACAGG AGCTACAGGA GAACAGCGTT TTTACTGAAA ACATGAGAGG TAAGCCCTTA GAAGAGCCTC CAGGAAAAGT GTTATGGATT ATCCCTAACT GTCGAAATAT ATTAGCGGGA GGAATTAGAA CTGTATTTAT GGTAGCCGAG GAATTTTCAC GGTCCTGGAA TAGTAAAAAC GTTTTTTTAA TTCAAATGAC TTCTCCGACA GAGGTATTTT TAGACAAATC TACAATAAAA AAATTCTTCC CTGACCTAAA TTTTGAATTA ATTGTTTTTA ATTTTGATGA TAACCCAAAA GAAATTCCCA GAACAGATAT AGCTTTTTGC ACAGCTTGGC ATACAGCTTA TCTACTTGCT CGATACAATA ATTGCCAAGC CAAATTTTAC TTTATGCAAG ATGATGAGTC GTTATTCTAT CCAGCTGGTA GCGTGAGTGG AACGATTGAT ATGACTTACC GCTTTGGTTT TCATTGCATT GCCAATTCTT TAGGAATAGC GGAAAAGTAC AAGCAATATG GAGATAATGT AACATACTTC ACTCCGGGAA TAGATCGGAG TATTTATTAT CCGAAACCTT ATAGTAAAAA CAAACTACCG TGGCAAGTTG TATTTTATGG CAGACCGAAA AATAAAAGAA ATGCTTTTGT TCTGGGGATT GAGGCTTTAA AGCTTGTCAA GTTACATTTC CAGGACAATG TAAGAATCGT CTCTGTTGGT AGCGATTGGA ATCCCAAGGA TTATGGCCTC GAAAGCATTA TTGAGAACTT AGGTGTGCTT GACAGCTTGG AAAAAGTAGC AGAGCTTTAT GGAAATAGTC ATGTGGGCTT AGTCTTTATG CTGACCCCAC ATCCATCCTA CCAACCTCTT GAATATATGG CTTCAGGCTG TGCTACAGTT AGTAATTATA ACTTAGGAAC TTCATGGCTT TTTCAGCACG AACAAAATTC ACTGTTATCG TCTCTCTTAC CCGATGATGT TGCTGGAAAC ATTATTCGTT TATTAGAAAA TAACCAACTA AGGGAGCAAA TTATCAAAGG AGGGCTAGAA ACAACAAAAA AAATGGATTG GAAATTTGCA TTTGAGAGAA TTAAAGAATT TGTTATTAAC CCCGATTCTG ATGATACAAA AAATTATAGA ATATCCAATG CGGAAAAAAT ATCTATTAAC CCAAGAGATG AATTCATAAA AATATTGCGG GAAGGAAATT TTGACTATCT TGACTTTGGT TGTTCAAAAG GAAATTCCCT AAACTGGAGC AAACGCCTTT TTGGTGGTAA ACAGGGTTTG GGAATTGATA TCGATCCCAA AAAGATAGCT CAAGCGAAAG CTGCTGGTCA TAATGCTGTG ATATTTGATA TAAACAATAT TCCTACAAAA AAACTGGTAA GATTTACAGT TCTCTCCCAC TTACTTCAGA ATTTGCCAAG CGAAAACGAT GTTAAAGCTT TTGTACGAAA AGCTTGCCAA GTATCTACTG ATTTTGTTTT TATCAAGCAA CCCTATTTTG ATGCGGATGG TTATTTATTC CAAAATGGAC TCAAATTGTT TTTCTCTGAT TGGACTGGAC ATCCGAATCA GATGACTACT CTATCTTTGT TTAAGTTAAT GAAGGAGTTA AAAGATGAAG GGCTTTTGCA GAAGTTTTCA ATTCATGGTA AAAAACCAAT TTTGTCCTCT GATGACAACC ATGTACAGTC GATTAATGCT CCCATAGATC AGCATCACTT CGATTCATCA AAACATCCAC CTAAGATTCA GGGGTTTAAA TTTGAGTTTC CAGTTTTTTA TGAAACGGTT GTGATGATCT CAATATCAGG TGTTAGCCAT TATGAATACT TTAAAAAATT TCCAACAGAT GCAACCTTTT TTGAAAGCTG GTCTGTTGAT TAA
|
Protein sequence | MTATSFAQAN QLFKEGKIEE AIAFYKKATT KSNQFYCSHH NLGEALIKAG RIKEAAAAFR EALAINPNSA WSLYKLGAML NKLGQYKEAV GYLRQAVEKK TNVPEFYLSL GRALVHLGQW SDAEESLYQV VDFCVDSLPL EEEAKQVYGT SLMTFYVSEA YFYLGDIKFG QQQWSEAVKF YRQSWETNYG KLECCMGWAA SLGKLGRWSE VVECYRQSAH LFDESEEFLF GFGQALSQLG RWEEAVVKYR LVVEINPKSA EFRHHLGYAL MQLKYWVQAE IELRKAAELH PASAVVWQHL GNVLRELGKR EEAVKVYQRG QQIDTESVSP KQIADKQLLE RVRKYLKASK SKKKNVVVYT AICNNYDVLK IPEFLCPDWD YVCFTDRAQY PGEHCWEIRH FDYIHEDSTR TARYVKTHPH IYFNNYEYSI WIDAHILVKS NFLEEFLNSF IKNQQLFAAI PHPYRNCTYQ EANICSQQEK DDKDTIEEQT THYQQEGLPY ELGLIETGVM IRKHNDNCIR NLHNLWWEEI EKYSKRDQLS VMFALWKTDL KWIPLMKKGR STRNHPGFGF FQHGLRSTTR DIPYQIPSFL PKEFAFEQTP FWHSKTKPYD RNSLKSISAF PIDIVICVHN ALEHVKNCIN SVLVNLLPSH KIIIVDDGSD HDTWDYLKQI AAQNHSITLV RHEIAKGYTK SANLGMQIAD AEFVILLNSD TIVSPDWALK LLQTATSSNL IGVVGPMSNA ASWQSLPFVK DPKNGGMIIN ELPEGKTVAD MDKFCEQYGW FGHFPRVPLI NGFCYGIKRE VIDAIGYFDE EAFPKGYGEE DDYSMRALNA GFTHALATHC YVFHAKSKSF GAETRSVLAE AGGNALRARH GEKRIKRAVI SLNFHPFLKR LRAYAGEYYG VEVLEKGLTK LSSVPKNTNL PESSDLPKST NLPESSGPSD SPQKLANYDL SQLTKINSVI FPKELLLVKI SLQELQENSV FTENMRGKPL EEPPGKVLWI IPNCRNILAG GIRTVFMVAE EFSRSWNSKN VFLIQMTSPT EVFLDKSTIK KFFPDLNFEL IVFNFDDNPK EIPRTDIAFC TAWHTAYLLA RYNNCQAKFY FMQDDESLFY PAGSVSGTID MTYRFGFHCI ANSLGIAEKY KQYGDNVTYF TPGIDRSIYY PKPYSKNKLP WQVVFYGRPK NKRNAFVLGI EALKLVKLHF QDNVRIVSVG SDWNPKDYGL ESIIENLGVL DSLEKVAELY GNSHVGLVFM LTPHPSYQPL EYMASGCATV SNYNLGTSWL FQHEQNSLLS SLLPDDVAGN IIRLLENNQL REQIIKGGLE TTKKMDWKFA FERIKEFVIN PDSDDTKNYR ISNAEKISIN PRDEFIKILR EGNFDYLDFG CSKGNSLNWS KRLFGGKQGL GIDIDPKKIA QAKAAGHNAV IFDINNIPTK KLVRFTVLSH LLQNLPSEND VKAFVRKACQ VSTDFVFIKQ PYFDADGYLF QNGLKLFFSD WTGHPNQMTT LSLFKLMKEL KDEGLLQKFS IHGKKPILSS DDNHVQSINA PIDQHHFDSS KHPPKIQGFK FEFPVFYETV VMISISGVSH YEYFKKFPTD ATFFESWSVD
|
| |