Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3434 |
Symbol | |
ID | 4243377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5252486 |
End bp | 5253415 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638108410 |
Product | glycosyl transferase family protein |
Protein accession | YP_723000 |
Protein GI | 113476939 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.134306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.508878 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATCAC CTAAGATACC AGTATCAGTT CTAATTCCTG CTAAGAATGA AGAGAAAAAT TTACCTGCTT GTCTAGAAAG TGTAGCAATA GCAGATGAAA TATTTGTGGT AGATTCTCAA AGTAGTGATC AGTCTACTGA AATAGTTGAA AGTTATGGAG CAAATTTAGT ACAGTTTCAT TTTAATGGAA CTTGGCCGAA AAAGAAAAAT TGGTCTTTGG AGAATTTACA ATTTCGGAAC GAGTGGGTAT TAATAGTTGA TTGTGATGAA CGTATTACTT CAAAACTTTG GGATGAAATT GATAGAGCAA TTCAGGATCC AAATTATAAT GGTTATTATA TCAACCGGAA AGTATTTTTT CTAGGCAAAT GGATTCGTCA TGGAGGGAAA TATCCTGATT GGAACCTCAG ATTATTTAAA CACAAAGAAG GTCGCTACGA AAACCTAAAA ACTGAGAGTG TGCAGAATAT TGGGGATAAT GAGGTACATG AACACGTTAT TTTAACTGGC AATGCAGGCT ATTTAAAAAA TGATATGCTT CACGAAGATT TTCGTGATTT ATTTTGCTGG ATCGAAAGAC ATAACCGTTA TTCTAATTGG GAAGCGCAGG TATATTACAA TGTACTAACT GGTCAAGGTG ATAATGAAAC TATTGGCGGT AGTTTATTTG GAGATGCAGT AAAGCGCAAG CGTTTTCTCA AAAAAATATG GGTGAGACTA CCATTTAAAC CTTTCTTGAG ATTTATTTTA TTCTATTTTA TTCAACTAGG ATTTTTGGAT GGTAAAGCAG GATATATTTA TGGTAGGTTA TTGAGTCAAT ATGAGTATCA AATTGGTGTT AAGCTTTATG AGCTTAAAGA CTGTAATGGT CAGTTGAATG TTGCTCAAAA GAAACGACAA AATACTAATT CTGAACGAGT TAGTTGTTGA
|
Protein sequence | MISPKIPVSV LIPAKNEEKN LPACLESVAI ADEIFVVDSQ SSDQSTEIVE SYGANLVQFH FNGTWPKKKN WSLENLQFRN EWVLIVDCDE RITSKLWDEI DRAIQDPNYN GYYINRKVFF LGKWIRHGGK YPDWNLRLFK HKEGRYENLK TESVQNIGDN EVHEHVILTG NAGYLKNDML HEDFRDLFCW IERHNRYSNW EAQVYYNVLT GQGDNETIGG SLFGDAVKRK RFLKKIWVRL PFKPFLRFIL FYFIQLGFLD GKAGYIYGRL LSQYEYQIGV KLYELKDCNG QLNVAQKKRQ NTNSERVSC
|
| |