Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1097 |
Symbol | |
ID | 4242178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1725763 |
End bp | 1726743 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106323 |
Product | glycosyl transferase family protein |
Protein accession | YP_720935 |
Protein GI | 113474874 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000713189 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.938252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTAC CATCAATTTC CGTTGTTATT CCTACCTACG GGCGGGAGGA ACCTCTAATA GATACGATCG CTGATGTTAT TAAACAAGAC TACCCTAATT TTGAAGTTTT AGTAGTTGAC CAAACTGGGA CTCATAAACC AGAAACTCAA GTTTATCTGG AAAAACAAGC TAATGCTGGT AGGATAAAAT TGTTCCGCCT GACTTGGGCC AGCCTACCAG GAGCAAGGAA TTATGCAGTC AGACGTTGTA GTGGCGAAAT CATCCTTTTT ATTGATGATG ATGTACGTTT GCAAAAAGGA TTTTTGGCGG CTCATGCTGG TAACTATATT GATCGCCCAG AGATAGGAGC AGTGGCAGGA CGAGTATTTG ATCGCATGAA ATTAGGAGAT TCTGGTGGAG AGTTGGAAAT AGAAGATTTA CCTCCAGAAG CGTCTGACCC TGGTATTGCT TGGTATCATA TAGATTTGGT ACATACGGTT AAAGCTCAAC GAGTCATCTC AGCAAGAGGG TGCAATATGT CATATCGCCG AGAAGTTTTT ACTAAATATG GTTTGAGTTT TGATGAGCGG TTTCGTGGCA GTGCAGTAAG GGAAGAGTCT GATTTTTGTT TGAGGTTACG ACGAACTGGC TATCACATTT GGTTTGACCC AGAAGCTTCT TTAGTTCACT TGGGTGAAGA AAGTGGGGGT TGTCATGATA TTTCTACAAG GTCTCTCAAA TATCAAATTA CTTTTTATCA CAATCACTTT TTTATGGGAT TAAAAAATCT CACTCCTCAT CAATGTCTGC GTTTTTTTGG CAAATTATTT GATTGTCATG TGTTGGGGAA CCCCCCTTGT TACAAAAGTG GTTCTCCTAT AAAAATAATT ACTCGCGGCG GTTTCTATAT TCTTGGTTTT TTTAGTGCTG TTCAAACAAG GATTAAATCT ATTTGGGATG ACGGGCAAAT TTATACAAAA AAAGTCAACA CTCAAGAGTA A
|
Protein sequence | MNLPSISVVI PTYGREEPLI DTIADVIKQD YPNFEVLVVD QTGTHKPETQ VYLEKQANAG RIKLFRLTWA SLPGARNYAV RRCSGEIILF IDDDVRLQKG FLAAHAGNYI DRPEIGAVAG RVFDRMKLGD SGGELEIEDL PPEASDPGIA WYHIDLVHTV KAQRVISARG CNMSYRREVF TKYGLSFDER FRGSAVREES DFCLRLRRTG YHIWFDPEAS LVHLGEESGG CHDISTRSLK YQITFYHNHF FMGLKNLTPH QCLRFFGKLF DCHVLGNPPC YKSGSPIKII TRGGFYILGF FSAVQTRIKS IWDDGQIYTK KVNTQE
|
| |