Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0146 |
Symbol | |
ID | 4242431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 214536 |
End bp | 215744 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638105495 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_720114 |
Protein GI | 113474053 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0724066 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCT TAATGCTTTC CTCTACTTTT CCTTATCCTC CTAGCAAAGG CGGTACACAA GTTAGAACAT TTAATCTACT GAAATTTTTG AGCAAGAAAC ACTCTGTTAC TTTGCTAACA CAAAGCTCTC CAGATGTCAA TAATTTAGAA GTAGATTCAT TGAGATCTTA TGTAGAAGAA TTGGCCATAT TTCCCCGCCC TCAAGAGCCT AAAAAAGGAA ATTTTGCAAA ACTAAGACGT TTTAGTAATT TTATAATAGA AGGTACTCCA CCCAGCGTTA TCTCTATATA TTCGCCAGAA ATGCAAAAGT GGGTAGATGA GTTTGTGGGA GCAGGTAGAT GTGAGGCCAT TACTTGCGAA CATTCCGTTA ATGAAATTTA TATTCGTCAT CAATGGCAGG AGAAAGTCAA AACCCTAGTT AATGTCCACA GTTCAGTTTA TGGCACTTGT CGGCAACAGT TGGAAACTGG CATAGCAGAG AAGCCTTTGC GAGATAGACT GAATCTTCCC CTACTCCTGC GTTATGAAAA ACGTTATTGC TCAAAATTTT CCTTTATAAT TGCCACAACT GAAGAAGATG GGAAACAACT AAAGGAATTT AGCCCCAATA GTCAAATTTC AGTTATTCCG AATGGGGTAG ATCTTAGTCA TTTTCCTTGT CGTAGTAGCG ACCCAGGAGG AAAGCAATTA ATTTTTGTGG GAGCTATAGA TAATTTTCCT AATATTGATG CAGTTCGGTT TCTGACTCTA GAGATATTGC CTAGGGTACA AGAACGTTAT CCTGATACAA CCTTAGCATT GGTAGGGGCA AGACCAGTAC GAGAAGTACA GGAATATTCG ACTCGCCCAG GGGTAAAAGT TACGGGCCGC GTACCTTCAA TAGCAGAATA TTTACATCAG GCTACAATTT GTGTAATTCC TATGCGCACT GGTTTTGGAA TTAAAAATAA AACCTTAGAG GCAATGGCAG CAGGTACACC TGTTGTGGCC AGCAATCGTG GTTTGGAAGG ATTAGCAGTT GATGGTGCAG GTAAACCACT AAGAGCATTA AGGGCAAATT ATGTAGAAGA ATATGTAGAA GCCATTGGTC GGTTATTTGA GGATAGGCAG TTAAGACAAA CGTTGTCAGA AAACGGGCGA TCACTGGTAG AAACCGAGTA TACTTGGAAA ATTCAAGGTC AGCGGTATGA AAGGGTATTA CTAGGATAA
|
Protein sequence | MKILMLSSTF PYPPSKGGTQ VRTFNLLKFL SKKHSVTLLT QSSPDVNNLE VDSLRSYVEE LAIFPRPQEP KKGNFAKLRR FSNFIIEGTP PSVISIYSPE MQKWVDEFVG AGRCEAITCE HSVNEIYIRH QWQEKVKTLV NVHSSVYGTC RQQLETGIAE KPLRDRLNLP LLLRYEKRYC SKFSFIIATT EEDGKQLKEF SPNSQISVIP NGVDLSHFPC RSSDPGGKQL IFVGAIDNFP NIDAVRFLTL EILPRVQERY PDTTLALVGA RPVREVQEYS TRPGVKVTGR VPSIAEYLHQ ATICVIPMRT GFGIKNKTLE AMAAGTPVVA SNRGLEGLAV DGAGKPLRAL RANYVEEYVE AIGRLFEDRQ LRQTLSENGR SLVETEYTWK IQGQRYERVL LG
|
| |