Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2111 |
Symbol | |
ID | 4243947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3296981 |
End bp | 3298258 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638107219 |
Product | glycosyl transferase family protein |
Protein accession | YP_721820 |
Protein GI | 113475759 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | [TIGR01426] glycosyltransferase, MGT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000775281 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.550046 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCATT TTGGTCTGAT TTGTCCGGCA TTGACAGGAC ACCTAAATCC AATGCTTCCC ATAGGACAAG AATTAAAAAG GCGTGGTCAT CGTGTCACGA CGATAGGGAT ACTTGACGCT GAAGCTAAGA CACTAGCAGC AGGATTAGAA TTTGTTGCCT ATGGCACGGA AGAATATTCT AAGGGCAGCA CAGCAGAAGC TTTAAATCAC CTGAGTAAAC TCAGCGGGTT AGCTGCATTT CGCTATACAA TTACACTATT AAAAGACTGG ACAAATGTTT TGCTTCGGGA TGCTCCGCAA GTCATCAAAA ATGCTGGTGT AGATGCTTTG TTAATCGACC AGGCTTCATT AGGAGAATCT ATAGGGGATT TTCTAGACAT TCCCTTTATT ACTATTTGTA GTGCACTGGT ACTCAATCAA GATGAGAATG TTCCCCACCC TGTAAGCAAC TGGAAATATA ACCCCGCCTG GTGGGCAAAA CTGCGTAATA GAGCTACTTG GAGTTTCTAT CAAATCTTAG GCAAACCTAT TAACAAGGTA GTAGCTGAGT ATCGTCGTCA ATGGAATTTA CCTTTGTACT CTGACCCCAA TGATGCTTAT TCTCTACTGG CTCAAATTAG TCAGCAACCT GCTGAGTTAG AATTTCCCAG AGAAAATTTA CCTAAGTGTT TTCATTTCAC AGGACCTTAT CATTATTCAG GTACTAGAGA ACCTGTTTCC TTTCCTTGGG AACAGTTGAC AGGTAAACCT TTAATTTATG CCTCTATGGG AACTATACAA AATCGTTTGG TTGAGGTATT TTATCAAATT ACAGCAGCTT GTGAGGGGTT GGATGCTCAG TTAGTTATTT CTCTGGGAGG TTCTGCCACT CCAGAATCTC TACCCAACTT AGCAGGAAAT CCTCTAGTTG TTGAATATGC ACCCCAATTA GAAATACTGC AAAAAGCTAC TCTCACTATT ACTCATGCAG GTATGAATAC AACTCTAGAA TGTTTAAGTA ATGCAGTACC AATGGTTGCT ATTCCTATTG CTAACGATCA ACCAGGAGTA GCGGCACGAA TAGCTTGGGC TGGAGCTGGA GTAGCGATAA CACTGAAACG TTTAACAGTA CCTCGGTTAC GAACAGCTAT TTCTCAGGTG CTCACACAAC CGTCATATAA GCAAAATGCT TTGAGATTAC AGAAAGCAAT TAAACGAGCA GGTGGAGTCA CTCGTGCTGC TGATATTATT GAACAGGCAG TATCAACAGG TAAACCAGTT TTAACAGGAA CTATATAA
|
Protein sequence | MTHFGLICPA LTGHLNPMLP IGQELKRRGH RVTTIGILDA EAKTLAAGLE FVAYGTEEYS KGSTAEALNH LSKLSGLAAF RYTITLLKDW TNVLLRDAPQ VIKNAGVDAL LIDQASLGES IGDFLDIPFI TICSALVLNQ DENVPHPVSN WKYNPAWWAK LRNRATWSFY QILGKPINKV VAEYRRQWNL PLYSDPNDAY SLLAQISQQP AELEFPRENL PKCFHFTGPY HYSGTREPVS FPWEQLTGKP LIYASMGTIQ NRLVEVFYQI TAACEGLDAQ LVISLGGSAT PESLPNLAGN PLVVEYAPQL EILQKATLTI THAGMNTTLE CLSNAVPMVA IPIANDQPGV AARIAWAGAG VAITLKRLTV PRLRTAISQV LTQPSYKQNA LRLQKAIKRA GGVTRAADII EQAVSTGKPV LTGTI
|
| |