Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3238 |
Symbol | |
ID | 4243659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4955862 |
End bp | 4957973 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638108235 |
Product | glycosyl transferase family protein |
Protein accession | YP_722826 |
Protein GI | 113476765 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG0438] Glycosyltransferase [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTAG AATTAAAAGA TTGGGAACTA TGGGTAAATG AAAAACTAGC CCAAAAACCA AAAAATACAC CAAGCTTACA GGGAAGTGTC AAGCCAGTTT TATTCATTCT CCATGATATT ACTTCAGCGG GGGCACAGCT ATTTTTAGTT AGGCTTTTAG AATGGATAGG TCAACATGAA CCTACTATGT CTAAAGAAGT GCTGATCGAT ATTTCACGCT CGGATATTGG TAATTATGGA GAGCAAGGAA ACTTTGTTTT AGAGCGGGTA AAAAAATGTG GCGAAGTCCA TTTTATTGAT TCAAATAGTG GACTTCCTGA AAATATTGCT GCTATTCGCT CTGGTTTTTA CTCTTTAATT TATGTTAATA CCTGTGTTTT AGGAGCGTTG CTAGATTCTA TCAGAAAAAT TCCTAGTCCT ATTATTGTTC ATGTTCACGA ATTAAAATTT TGGATTAGTA ATAGGCTAGG TATGGATAAC TTTAATCGTT TATTGAAGTA TGATCCTCGT TGGATTGCTT GTTCTAATGC CGTTCAAGAA AGTTTAGTTA ATTACTGTCT TGTTTCTCCT AAAAAAGTTG ATGTTATTCA TGGGTTTATT CCTTCTGAGA AACTACTTAT TTCTCAAGTA AAAACTCCCC AACAAATGAG GAAAGAGTTA AGTATTTCAG AAGATACTTT TGTTATTGCT TGTTGTGGAA CTTTAGATTG GCGAAAAGGA GGGGATTTAA TAGTTCCTTT GCTAGTTATT TTGAAAAAAA AATTATCCCC AGAAAAAAAG TTTGTTTGTA TCTGGGTTGG TAACTGGGAC AGCCAACTTT CTCAGCTAGA AATTGAATAT ACAGTCGAAA AGGCTGAGTT GGAAAACAAT ATTATTTTTA CAGGGTATCA GAAATCTCCT CTGAATTATA TGAGTTGTGC CGATGTTTTC TTACTTCTTT CTCGGGAAGA TCCTTTTCCT TTGGTAATGA TGGAAGCAGG GGTTTGTAAA CTTCCAGTGG TAGGGTTTGA TGGTTCTGGT GGAGCAACTG AGTTTGTTGA GTCAGAGGCA GGGTTACTTG CTCCTTACTT AAATTTGGAA GTAATGGCGG AAAAAATAGC TATACTTTAC AATAATACCA GTCTTAGGAA AGAAATGGGA GAAAATGCTT ATCGAAAGGT TAATGAACTT TATAACGAAA CTGTTTCTGC ACCGAAAATA TTACAATTAA TTCAATCTTT AGTTCATAAG TCGAGTGAAA CGACTGTAAC TTTTAAGGAT TTTGTACCTA GAGTTTCAGT CATAGTTCCT AACTATAATC ATGCTCCCTA TCTCCGTAAA CGTTTAGATT CAGTTTATAG TCAGACTTAT ACTGATTTTG AAGTAATTTT ACTAGATGAC TGCTCTAGCG ATAATAGTAG AGACATTTTG GCTTCTTATA AGGAAGAAAA ACCTAATACT ATTTTTGTTC CTAACGAATC TAATAGTGGC TCAGTTTTCC GTCAGTGGGA AAAAGGGGTA TCTTTGGCTC GTGGTGAGTA TATTTGGATT GCTGAATCTG ATGATTTTGC CTCTCCTGAT TTTCTGAAGC AGTTAGTAAC TGTTATGGAC GAGCATCCAG AAGTTGGATT AGCATATTCC CAATCTTGGT TAGTTGATAG TCAAGATGTT GTTTCTGGAG ATGCTAGTTG TTGGACTAAT GATTTAGATA GTCAGCGCTG GTCTCAAAGT TTTATTAATG ATGGGAGGGA TGAGATAGTT AGATTTCTGA TTTACAAAAA TACGATTCCT AATGCTAGTG CGGTTTTAAT TCGTAGAAGT GCATTGAAAA AATGTGGCGG TGTTATTGAG AAATCTTTTA GATTGTGTGG AGATTGGCTA CAATGGATGA AAATTTTGTC TTGCTCTGAT GTTGGATTTG TGTCAGAATG TTTAAATTAT TGGCGGCAAA ATACCTCTAA TGCTAGAGTT AAGAGCGCGG GTACTTTAGA ATGGACCGAG GGAGAGCAAG TTTTGAGTTA TATTTGTGGT TTGTTAAATT TGCCTGAAGA ACAAAAAGAC AAAATTCTTC TGTCTTTTAT CCGCAGATGT TGGCAGTGGC AAAGAGAATT TATTGAAAAG ACTTATAATT AG
|
Protein sequence | MNLELKDWEL WVNEKLAQKP KNTPSLQGSV KPVLFILHDI TSAGAQLFLV RLLEWIGQHE PTMSKEVLID ISRSDIGNYG EQGNFVLERV KKCGEVHFID SNSGLPENIA AIRSGFYSLI YVNTCVLGAL LDSIRKIPSP IIVHVHELKF WISNRLGMDN FNRLLKYDPR WIACSNAVQE SLVNYCLVSP KKVDVIHGFI PSEKLLISQV KTPQQMRKEL SISEDTFVIA CCGTLDWRKG GDLIVPLLVI LKKKLSPEKK FVCIWVGNWD SQLSQLEIEY TVEKAELENN IIFTGYQKSP LNYMSCADVF LLLSREDPFP LVMMEAGVCK LPVVGFDGSG GATEFVESEA GLLAPYLNLE VMAEKIAILY NNTSLRKEMG ENAYRKVNEL YNETVSAPKI LQLIQSLVHK SSETTVTFKD FVPRVSVIVP NYNHAPYLRK RLDSVYSQTY TDFEVILLDD CSSDNSRDIL ASYKEEKPNT IFVPNESNSG SVFRQWEKGV SLARGEYIWI AESDDFASPD FLKQLVTVMD EHPEVGLAYS QSWLVDSQDV VSGDASCWTN DLDSQRWSQS FINDGRDEIV RFLIYKNTIP NASAVLIRRS ALKKCGGVIE KSFRLCGDWL QWMKILSCSD VGFVSECLNY WRQNTSNARV KSAGTLEWTE GEQVLSYICG LLNLPEEQKD KILLSFIRRC WQWQREFIEK TYN
|
| |