Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1372 |
Symbol | |
ID | 4245472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2104828 |
End bp | 2106021 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638106545 |
Product | glycosyl transferase family protein |
Protein accession | YP_721156 |
Protein GI | 113475095 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | [TIGR03469] hopene-associated glycosyltransferase HpnB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.704698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAAA AATTTCTCTT AATAACAGTA ATTTCTAACT TCATAATATG GATATACTTA TTAATATTTC GCGGTAATTT TTGGCTGGCA AACCAACAAC TATTATCAAA ATCAAAAACA AAAATTGAGA ATACGGAAAA TTTGCCATCA ATTTATGTGA TAATTCCTGC TAGAAATGAA GAAAAATTAC TAAAAATTAC CCTAAATTCT TTATTAAATC AAGATTATTC AGGGATATTA AAAATAATAT TAGTAGATGA TCACAGTAAA GATAATACAA TCAATATAGC GAATTCTTTG GCTCAACAAG GTCATAATTC TACAAAGCTA GAAGTTATTT CAGCAGCAGA TTTACCTAGT AATTGGACAG GAAAACTATG GGCAATTAAT GAAGGAATTA ACTATGCAAA AAAACAAACT CCAGCCCCAG ATTATTTTCT ATTAACAGAT GCAGATATTG AACATTTCCC TACTAATATT CGCCAACTTG TTGTCAAAGC AGAACAAGAA AATTTAGCCT TAGTTTCTTT AATGGTAAAA CTACAATGCG AAACAATAGC CGAGAAATTA ATGATTCCCG CATTTGTATT TTTCTTTCAA AAGTTATATC CATTTAAATG GGTAAATAAT CCCCAAAATA CTACTGCAGC TGCTGCTGGA GGTTGCATAT TAGTTCGTCA TAAAAATTTA GATCAAGTTG GAGGAATAGA GGTTATTAAA AATGCTTTAA TAGATGATTG TAATTTAGCT AAAATAGTTA AACAAAAATC CACAAATAAA AATATCTGGT TAGGGCTAAC TAATGATACG AAAAGCCGAC GTTCTTATCC TGATTTAATG AGTATTTGGA ATATGGTAGC TCGTACTGCT TTTACTCAGT TAAATTATTC TCCATTCTTG TTATTAGTAA CAGTAATAGG AATGAAATTA GTTTATTTAA TTCCCTCATT AGGAATAATT TTGGGAGTTA TTTTTGGTTG GTGGCCAGTA GTAGTGATCG CGATCTTAGC AAGATTATTA ATATTTTTAG CTTACTTACC TATTATTAGA TTTTATGGAC TTTCACCAAT ATATGCCATG AGCTTACCCA CTGTTGCTTT GATTTATATA TTAATCACAA TAGATTCAGC TTGGCGACAC TGGCGAGGGC GAGGCGGTTA TTGGAAAGGA CGAGTTAATA CCAGTATATT CTGA
|
Protein sequence | MFQKFLLITV ISNFIIWIYL LIFRGNFWLA NQQLLSKSKT KIENTENLPS IYVIIPARNE EKLLKITLNS LLNQDYSGIL KIILVDDHSK DNTINIANSL AQQGHNSTKL EVISAADLPS NWTGKLWAIN EGINYAKKQT PAPDYFLLTD ADIEHFPTNI RQLVVKAEQE NLALVSLMVK LQCETIAEKL MIPAFVFFFQ KLYPFKWVNN PQNTTAAAAG GCILVRHKNL DQVGGIEVIK NALIDDCNLA KIVKQKSTNK NIWLGLTNDT KSRRSYPDLM SIWNMVARTA FTQLNYSPFL LLVTVIGMKL VYLIPSLGII LGVIFGWWPV VVIAILARLL IFLAYLPIIR FYGLSPIYAM SLPTVALIYI LITIDSAWRH WRGRGGYWKG RVNTSIF
|
| |