Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4043 |
Symbol | |
ID | 4242071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6247182 |
End bp | 6248372 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638108949 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_723530 |
Protein GI | 113477469 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.121484 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.172478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC TAGTATTAAG TTGGGAGTTC CCACCGAGAA TTATAGGAGG AATAGCACGT CATGTTGCAG AACTATACCC AGAATTGGTG AAATTAGGAC ATGAAGTTCA TTTAGTTACA GTCAAGTCTG GTGAAGCACC AATGTATGAA ATTGTAGAGG GAATAGAAGT CTACAGAGTG CCAGTTGGAC CAAGCCACAA CTTTTTTCAT TGGATAGGAA ATATGAATGA AGCTATGGGT CGTTATGGAG GAAAACTAAT TAAAGAAGAG AAAAATTTTG ATATAATTCA TGCCCATGAT TGGCTAGTTG CAGATGCGAC TATCGCTCTT AAACATATCT TTAAACTACC ACTAATAGCT ACTATTCATG CCACAGAAAA TGGTCGCCAT AATGGTATTC ATAATCGTAG CCAACAGTAT ATTCATGAAA AGGAAAAAGA GTTAATTTAT AATGCTTGGA GAGTGATTGT TTGCTCAAAC TATATGCGAG GAGATGTAAC AAGAACTTTA GATAGTCCTT GGGACAAAAT AGACGTAATT TATAATGGAA TTTGTCCTGA AAAAAAACCT ACTCTAAATC AGTTTGATTA TCTACATTTC CGTCGGCATT TTGCAACAGA TGAAGAAAAA ATTGTTTACT ACTTAGGTAG AATGACTCCA GAAAAAGGTT TGTCAGTGTT AATTCATGCA GCACCTAGAG TAATTGAAGA AATGGGAGAT AGGATAAAAT TTATTATGAT TGGTGGTGGC AAAACTGACT ATTGGAAACA GGAAATCTGG AATTTAGGAA TTTCGGAAAG ATTCTATTTC ACAGGGTTTA TGTCTGAGGA AAAATTAGAT AAATTCCAGG CGATCGCAGA TTGTGCAGTA TTTCCTAGCT TATACGAACC GTTTGGTATT GTTGCCCTAG AAAGTTTTGC AGCAAGGGTG CCAGTGGTGG TTTCAGATAC CGGTGGTTTG CCAGAAGTGG TAGAACATGG TAAAACGGGT ATTGTTACTA AAGTTGGTAA TCCTACTTCT CTAGCATTGG GTATTCTAGA AGTTTTGAAA GGCCGTAGCT TTGTCAAAGA GTTGGTGAAT AATGCTTATC AGGAATTAGA GAATAAATTT TGCTGGGGTA AAATAGCAAA ACAAACTGAT AGAGTGTATC ACAGAGTACT AGCAGAAAGG ACGCAAGTGA CTTGGAAATA G
|
Protein sequence | MKILVLSWEF PPRIIGGIAR HVAELYPELV KLGHEVHLVT VKSGEAPMYE IVEGIEVYRV PVGPSHNFFH WIGNMNEAMG RYGGKLIKEE KNFDIIHAHD WLVADATIAL KHIFKLPLIA TIHATENGRH NGIHNRSQQY IHEKEKELIY NAWRVIVCSN YMRGDVTRTL DSPWDKIDVI YNGICPEKKP TLNQFDYLHF RRHFATDEEK IVYYLGRMTP EKGLSVLIHA APRVIEEMGD RIKFIMIGGG KTDYWKQEIW NLGISERFYF TGFMSEEKLD KFQAIADCAV FPSLYEPFGI VALESFAARV PVVVSDTGGL PEVVEHGKTG IVTKVGNPTS LALGILEVLK GRSFVKELVN NAYQELENKF CWGKIAKQTD RVYHRVLAER TQVTWK
|
| |