Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4954 |
Symbol | |
ID | 4246608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 7547130 |
End bp | 7548305 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638109765 |
Product | glycosyl transferase family protein |
Protein accession | YP_724341 |
Protein GI | 113478280 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00767556 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCC TTGTAGAAAT ACTTCAAATA ATTTGCCTCA TACCCATCAT TAGTGGGTGT ATATATTTAA TCTTAAGTGT CTGGACAATT AGAAATTTTT TCAGAAAAAC CACTACAGAA ATAAATGCCA CAGAAAAATT TCAGCCTCCT GTAACTGTAC TTAAACCAAT ACGGGGGATC GAAAAAAACC TGAAGTCAAA TCTGCATACT ATTACTATTC AAGATTGGCC AGAGTATCAA GTAATTTATT CTATTCAAGA CCCTCAAGAT TCAGCTCTCC CTATTCTTGA CGAGCTTCAA GCAGAAGTAG ACAACCAGAA AATTTCCGTT GTTATTGACA ATAAACAAGC AGGAGCTAAT GGCAAGGTTA ATAACTTACT TGGTGCAATA GCACAAGCAC GTCATCAGAT TATTATTATT AGCGATAGTG ATACTAATCT TAAACCTGAC TATATCAAAA ATATCATATC TCCTTTATCA AATCCTAATG TGGGAGCTGT CTGCACTCTC TTTAAAGTCA AAAGTGCTTA TAGATGGTTT GAGAAGATGG AATTGTTAAC AATAAATGCT GACTTTATTC CTAGTGTTAT ATTCGCAGCA GTCACGGGAG CATCCAATGC TTGTTTGGGA CCCTCGATCG CTATAAGTCG CAGCACATTA CAAGAACTGG GTGGCCTTGA GAGTCTGGCA GATTATCTTG TAGAAGATTA TGAATTAGGA CGGAGGGTAT GGACTTCTGG AAAAAAAATG GTGCTTTTGC CATATACTAT TGATGTGACT GTAGACTTAA AGAATTGGCA AGAGTGGTGG ACTCATCAAG TCTATTGGGA TCAGAATACA TATTTGGCAC GTCCCTGGCC TTTTATTGCA ACTATATTAA TCCGGGCAGT ACCTTTTGCT ATTTTGTTCG CTATAGTGAG AATGGGGGAT TTACTCGGAT TAGGAGTATT AGGGTTTACT TTGGCTTTAC GACTTTTCAG CGCTGGGATA ACTTTAAAAG AGTTGAAAGA TGTAGAAGGT TTTCAAAGTC TTTACTTATT ACCTTTACGT GACACTTTCG GTTTAATATT TTGGTTTTTG GCGTTGACTA AGCGTACAGT GGTATGGAGG GGTGTTAAAT ACAAATTGGT CGATCATGGA AAAATGGTTC CCGTCAGCAA AGGAGTAGGG AATTAG
|
Protein sequence | MPTLVEILQI ICLIPIISGC IYLILSVWTI RNFFRKTTTE INATEKFQPP VTVLKPIRGI EKNLKSNLHT ITIQDWPEYQ VIYSIQDPQD SALPILDELQ AEVDNQKISV VIDNKQAGAN GKVNNLLGAI AQARHQIIII SDSDTNLKPD YIKNIISPLS NPNVGAVCTL FKVKSAYRWF EKMELLTINA DFIPSVIFAA VTGASNACLG PSIAISRSTL QELGGLESLA DYLVEDYELG RRVWTSGKKM VLLPYTIDVT VDLKNWQEWW THQVYWDQNT YLARPWPFIA TILIRAVPFA ILFAIVRMGD LLGLGVLGFT LALRLFSAGI TLKELKDVEG FQSLYLLPLR DTFGLIFWFL ALTKRTVVWR GVKYKLVDHG KMVPVSKGVG N
|
| |