Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2863 |
Symbol | |
ID | 4244934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4464598 |
End bp | 4466469 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 638107912 |
Product | sulfotransferase |
Protein accession | YP_722509 |
Protein GI | 113476448 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.211639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.192148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCGA AAAATAATTT AAATTTGAAT ACATCTGAAG AGTATGCAAA CCTTGGTAGT ATATATGTTC AAGAAAATAA TTGGGATTTA GCAATTGAAA ACTACCGTAA AGCTATCATA CTAAAACCTA ATGTTTCATG GTATTACTAT CACTTAGCAC AAGCTTTATC TCAAAAAAAA GATTGGGAAA AAGCTATAAA AAATTATGAT AAAGCAATAG AGTTAGATCC TAATTTTGCT TGGTCTTATT ATAATTTGGC AAATGCGTTA AGTAAGCTAG AAAAATGGGA TGAGGTGGTT AAAGCTTACC AAAATGCAAT CAGAGTCGAT GTAAATTTTT CTTGGTCTTA CTACAATTTA GGTGATGCTT TAATCAAGTT AAAAAAATGG GATGAAGCTA TATATAACTA TCTCTATGCT ACTAAACTTC AGACAGAATT ACCAGGAATT TATAGTAAAC TAGGAGATGC TATAGAAGAA AAACATAAAT CAAGCTTAGA TGGAAAAATT AAGGATTTCT ACAAAGATAT TCAAGATATT AAGCAATATT ATATTGACGA TAAATCACTA CTACTTTTAC AACAAAACCC TGATTTACTT GTACAAGTAG CAGATGGTTT AACTAAAGCG AATCAAATTA ATGGGGCAAT TATTTTATAT AAAATAGCTC TAGATATTAA TCAAGATTAC CTAAAAATTT TTGAAAAACT CAAGCAAGTA TTAGAGAAAA AAAATCAGTT AGAACGAGAA ATATTAGAAG TAAGAAAAGA AATAGAATTA AAGCAAAATA GTAGGTCTTA CTATAATTTA GGTATTGCCC TAACTCGAGA AAAAAAATGG AATGAAGCAG TTATTGCTTA CCGTCAAGGT ATTGAAATTG AACCGGATTT TCATTGGTGG TTTTATCACA ATATATGGGA AGCCTTTGCT AGGGAAGATA AACTAATGGA AATACTCAAT TTTTTTCAGA TGTTTCAGAA AGCTAATCCC GATTCTTTTT GGTCTTATTT AAATATAGGA GAAGCTCTAA CTTGTTTGGG TAAAATTGAT GAAGCTATAC CTTATTATCA AACAGCTTGT TATCAGCAAA CTACAAAAAA ATACCCTAGT TTAGTATCGC AACCATGGAA TTTAGAACAA GTGCAGGGAC CGAATTTTAT AATTATTGGA GTGCAAAAAG GAGGTACTAC TTCCCTTTTT GGTTATCTGA CTCAACACCC ACAAATAATG TCTCCTATTA AAAAAGAAAT TGATTTTTGG TCTTGGAAAT TTAATGAGTC AATTAATTGG TATCTGGCTC ATTTTCCAGT AATTCCAGAT GGAAAAAAAA TCTTGGCTGG GGAAGCTAGT CCTAGTTATT TTAATCATCC TGATGCTGCT AGGAGAATTT ATCAATTTTT TCCAAAAATT AAGTTGATTA TACTTTTGAG AAATCCTGTA GTTCGAGCTA TATCTCAATA TTATACCTGG AGAAGATTCA ACTGGGAAAA CCGATCTTTA GAAGAGGCAA TTGAATCAGA TTTAGATAAG CTAATCAATA ATCCAGAAAA AGTTAATTAT TGGATGGGAG AACAGAATTA TTTGGCAAAG GGAGTATATA TTGAATTTTT AAAAGAATGG ATGAGTTTAT TCCCAAGGGA ACAGTTACTA ATTTTGAAAA GTGAAGATTT TTATGCTGAT CCACAAGCAA TTGTACAGCA AGTTTTAAAG TTTTTAGATT TGCCAAGATA CGAACTATTA GAGTATAAGA ATTATAATCC TGGTAATTAT TCACAAATCG ATCCATTAAT GGATAAAAAA TTAAGTAATT ATTTCCAAGT TCATAATCAA AAATTAGAAG AATATTTAGG GATAAAATTT AACTGGGAGT AA
|
Protein sequence | MSSKNNLNLN TSEEYANLGS IYVQENNWDL AIENYRKAII LKPNVSWYYY HLAQALSQKK DWEKAIKNYD KAIELDPNFA WSYYNLANAL SKLEKWDEVV KAYQNAIRVD VNFSWSYYNL GDALIKLKKW DEAIYNYLYA TKLQTELPGI YSKLGDAIEE KHKSSLDGKI KDFYKDIQDI KQYYIDDKSL LLLQQNPDLL VQVADGLTKA NQINGAIILY KIALDINQDY LKIFEKLKQV LEKKNQLERE ILEVRKEIEL KQNSRSYYNL GIALTREKKW NEAVIAYRQG IEIEPDFHWW FYHNIWEAFA REDKLMEILN FFQMFQKANP DSFWSYLNIG EALTCLGKID EAIPYYQTAC YQQTTKKYPS LVSQPWNLEQ VQGPNFIIIG VQKGGTTSLF GYLTQHPQIM SPIKKEIDFW SWKFNESINW YLAHFPVIPD GKKILAGEAS PSYFNHPDAA RRIYQFFPKI KLIILLRNPV VRAISQYYTW RRFNWENRSL EEAIESDLDK LINNPEKVNY WMGEQNYLAK GVYIEFLKEW MSLFPREQLL ILKSEDFYAD PQAIVQQVLK FLDLPRYELL EYKNYNPGNY SQIDPLMDKK LSNYFQVHNQ KLEEYLGIKF NWE
|
| |