Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2861 |
Symbol | |
ID | 4244932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4460035 |
End bp | 4462080 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638107910 |
Product | sulfotransferase |
Protein accession | YP_722507 |
Protein GI | 113476446 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.392476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAATA ATAAAAAGGT AGAAACAGTA GCTATTAATT TTCATCAATT AGCAGAATCT AGTCTAGCCC AAGGAAAATT GGATGAAGCT TATGCAGCTT GTTTAAAAGC ATTAAATAGC CAACCAGAAT TTGCACCAGC TTACAAAACT ATAGGCAATA TTTTACAGGT CAAAGGAGAT ATAGAGGCAG CCAAAAATTA TTATTTTAAA GCCATAACAA TATTTCCTGA TTTTGCTGAA GCCCATGCTA ATATTGGTAG TATGTACGCT AAACAAAGGG ACTGGGAAAA AGCATTTTTT TATTACCAAA AAGCTATCCA TATTAAGCCT AACTTAGCGA TAGTTTATCG AAATTTAGCG AAAGTCTGCG AGTGTACTGA AAAAGAAGAA TTAGCCACAG AATATACTTA TAAAGCACTT ATCCTAGAAC CGGAATCAGC TACAGCTATA GAGCATTTGA ATATAGGAAA AAAATTATTA GAATTAAACA AAATAGAGGC AGCAATTAAA TGTTACCGTA ATGCTGTTAA AATTAATCCT AATTTGTCAG CAGGATATCA AAATTTAGGA GAATTGTTGG TAAAAAATGG GGAATTAGAG TCAGCATTAA TAGCTTTACG CGAAGGAATT AAGATAGATG CTAAAAACCC TAGATGTTAC TACTTACTCG GGGAAGTTTG GCAAAAACAG GGACAATATA AGTTAGCAAT TTCAGATTAT AGTCGTGCTA TAGAATTAAA ACCAGAAAAT CATTTATTCC ACAAAAAATT AGGAGATGTT TGGGAAAAAA TGGGTAAGCT AGATGTGGCA ATATCCTGTT ACGAAAAAGC TATAGAAATA AATCCAAATT TCTTTTGGAG TTACCATAGT TTAGGTAATG TCTATACTAA ACAACAAAAA TGGGATAAAG CGATCGCTGC TTACGACAAA GCAACTATTA TTAACCCAAA TTTCTCTAAT ACATACTATA ATTTAGCTGA TGCTTTTTTA CACAATTCTC AAAAAGAGGA AGCTATTATT ACTTACTTAG AAGCTATTAG ACTTAGACCA GAACATTCTT GGTATTCTCA TCATTCAGTA TTATGGAAAC ACTTACTAAA AAGTCGGCTT GAGGAAGTAT TAAATTTATA TCAAGATGCC ACAAAAAAAG AGCCAAATAG TATTTTGTGT CATCTTAACC TAGGAGAAAT TTTTACAGAA AAAGGAAATA TAAAAGAAGC AATTAACAGC TATCAAACAG CTTGTTACAA CAAAACAAAA AAATCAAATC CTGCCTTTGT CGACAAGTAT TGGAACTTTG ATAATGTAGC GCGTCCTAAT TTCATTATTA TTGGTTCTCA AAAAAGTGGT ACGACTTCTT TAGCAAGTTA TATTAGTCAA CACCCCCAAG TATTACCAGC TATTAAGAAA GAAACCCATT TTTGGTCACG GGAATTTAAT CAAGGAATAG ATTGGTATCT GGCTCATTTT CCTCCCATTC CTAAGTCGCA AAATTTGATT ACTGGGGAAG CTACTCCTAA TTATTTAGTC ACTGATAAAA TTCCAGAAAG AATCTATAGT TTACTGCCTA ATATTAAACT ATTGGTGATT TTAAGAAATC CAGTAGATAG AGCTTTTTCT CAATATCATC ATTGGCAGAG ATTAAACTGG GAAGACCGCT CTTTTGAAGT TGCAATTAAT CAGGAATTAG AAATACTGAA AACTACTCCT AAACAACCCC AAGGAGATAG AAAATATTGG CGACTATCAG GAAATTATAT AGGGAGAGGT GTTTATATAG AATTTATACA GAAATGGATG GGATTATTTC CTAAGAAACA ATTTTTAATT TTGAGAGGAG AAGACCTTTA TCAAACGCCC GATAATACCA TGAAGCAAGT ATTTGATTTT TTAGGTTTGC CAGAACATAA ACTGGCAAAA TATAAAAAGT TAAATTCTGG TTCTTATACA CCAATTTCTG ATTTGCTGCG TCAAAGATTA TCTAAATATT TTCAACCTCA TAATCAGAGA TTAGAAGAGT ATTTGGGTAT AAAGTTTAAT TGGTAA
|
Protein sequence | MSNNKKVETV AINFHQLAES SLAQGKLDEA YAACLKALNS QPEFAPAYKT IGNILQVKGD IEAAKNYYFK AITIFPDFAE AHANIGSMYA KQRDWEKAFF YYQKAIHIKP NLAIVYRNLA KVCECTEKEE LATEYTYKAL ILEPESATAI EHLNIGKKLL ELNKIEAAIK CYRNAVKINP NLSAGYQNLG ELLVKNGELE SALIALREGI KIDAKNPRCY YLLGEVWQKQ GQYKLAISDY SRAIELKPEN HLFHKKLGDV WEKMGKLDVA ISCYEKAIEI NPNFFWSYHS LGNVYTKQQK WDKAIAAYDK ATIINPNFSN TYYNLADAFL HNSQKEEAII TYLEAIRLRP EHSWYSHHSV LWKHLLKSRL EEVLNLYQDA TKKEPNSILC HLNLGEIFTE KGNIKEAINS YQTACYNKTK KSNPAFVDKY WNFDNVARPN FIIIGSQKSG TTSLASYISQ HPQVLPAIKK ETHFWSREFN QGIDWYLAHF PPIPKSQNLI TGEATPNYLV TDKIPERIYS LLPNIKLLVI LRNPVDRAFS QYHHWQRLNW EDRSFEVAIN QELEILKTTP KQPQGDRKYW RLSGNYIGRG VYIEFIQKWM GLFPKKQFLI LRGEDLYQTP DNTMKQVFDF LGLPEHKLAK YKKLNSGSYT PISDLLRQRL SKYFQPHNQR LEEYLGIKFN W
|
| |