Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4770 |
Symbol | |
ID | 4246424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7326380 |
End bp | 7329043 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638109620 |
Product | sulfotransferase |
Protein accession | YP_724196 |
Protein GI | 113478135 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.185793 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAA CAATCTCAGC AATTCATTTA AATCAAAAGG CAGAAATCTA TTTAGCTCAA GGAAAATTGG AAGCAGCAAT AACAGCTTGC TATCAAGCAT TAGAAATTGA GCAAAATTTT CCACTTACCT GCAAAATATT GGGAAATATT TTACAAAGAA TGGGTGAGAT AGATAAAGCA AAAGAATGGT ATATAAAAGC TATCAGTCAA CAACCAAATT TGGCGGAAGC TCATGCTAAT TTGGGGAGTA TATATGCACA ACAAAAACAA TGGCATTTAG CAATTGAATG TTACCGGGAA GCTATTGGGA TAAAACCAAA TATTCCTGGT TTTTACCGTA ATTTAGGGAA AATTTGGCAG GAACTAGACA AAGTAGAATT AGCTAGAGAT TGTCAGGAAC AAGCATTGAG TTTAGAAGCA CATTATCCTC AAGCTTCAAA ATATTTAAAA CAGGGAAAAA AGCTCTTAGA AAATGGTGAG AGAGAAGAGG CGATCGCCTA TTTCCAAAAA GCCATAAATT TTAATCCATC TTTGGTAGAT GCTTATCAAA ATTTAGGAGA TATTTCACTG AAAACAAAAG ATTTCAATGA GGCAATAAAT TATTATCAAA AAGCTATTGA GCTAAAACCA GACTTATGGA TAGTTCATTA TAAACTAGGA AAATTATTTC AAGAAATAGG AGAATTAGAT ACAGCTACCA TTGAGTTTAA TCTAGCAATA GAACTGAATC CTAGTTTTAT CTATTCTTAC AAAAATTTAG GAGATATTCT GCACCATAAA AAAGACTTAG ATGTCGCTAA AAATTGCTAT AAAAAAGTTA TAGCAATCCA ATCAGATGTT TGGGATGCTC ATCGAAAAAT AAACGAAATT CTTCTAGCGC AAGAAAGATT AAATGAAGCA ATTATTGGTT GTCAGTTGGT GATAAAAATC AACCCAAAAT TATCTTGGCC TTATAAAATT ATGGGAAATA TTTATACTCA AAATAAAGCA TGGGATAAAG CAATTGTAGC TTATCGTTGT TTTCTAGAAA TAGAATCAGA TAAGGATTGG GTTTACGAAA AATTAGGGGA TGCTTTAAAA GAAAAAGGTC TGATAGATGA AGCTATATAC AGCTACCAAA AAGCTATAGA AATTAATCCC AATAATTACT GGTTTTATTA TAGTTTAGGA AAAGCTCTAT GTAAACTATC TAGATATGAA GAAGCTATAA CAGCTTATCA ACGGGGAATT AAAATTGACC CTAATTTATA TTTTGCTTAC CACAATTTAG GGGTAGCTTT AGTAGAGTTA AAAAGATGGA ATCAAGCTAT AGTTGCCTAC CGTCAGGCAA TTAAAATTAA ACCAGATTCT TATTGGTCTC ATTACAACTT AGGGGAAATT TTTCTCAAAT TACAGGAGTG GGATAAAGCT GTAGAAACTT ACCGCTATGC AATTGAGAAT AATCCTAATT CTCCCTGGTA TTATCAATAT TTAGGAATTG TACTCAGAAA ACAAGGAAAA ATACAAGAAG CGATCGCTTG TTACCGAAAA GCAATAGAAA TAAAACCAGA CTGGCATCGG TTTTACTCTT TATTAGGAGA TATTTTGCTA GAAATAGGTG ATTCAGAGGA GGCGATCTCT TGTTATATAA AAGCAATTAA ATTACAACCA AATGCTACTG CAGCTTATCG ACAATTAAGA GGTATCTATA TTTTTAAATT AGCTCAACTT AGACCTCATC AATTAAATGA ATTAGTTAAA TGTTATCAAG AAGCTATTAA ATTACAGCCA AACTTTCCAG AAGTATATAT AAATTTAGCA GACATTCTTA CAGGCAAAGG TGAACTTGAT ACTGCAATTA ATTATTATCA GAAAGCTACC TATAACAAAC TTTTAGTCTC TCATCCAGAA TTTGTCAAAA ATCATTGGGA TTTTCAAGAA TTTGGTCAAC CAAGTTTTGT TATTATTGGC ACAGTTAAAG GGGGAACATC ATCACTTTAT AATTATCTAT GTCATCATCC CAATGTAATT CCTGCTCTAC AAAAAGAAAT CAATTTTTTT AACAATAAAT TTAATCAAGG AATAGATTGG TATTTAGCAC ATTTTCCTCA ACTACCTGAG CAGGGAAAAT TTATAACTGG GGAAGCTACC CCTAACTATA TGTATTCTGA TGAAATAGGA AAAAAGTTAT TAGATAACTT TCCTAAAATT AAAATAATTG CGATTTTAAG AAATCCAGTA GATCGAACAA TTTCTCACTA TTATATGGCT AAAAGATTGG GACAAGAGTC AAAGAAATTT ACAGAATTTG TACCCCAAGA AATGAAATTT CTTAGACGAC TAAACAACAA CTATCAAAAT TATCAAAGAC TAATTAAGGA AATGTCAGCT TATTTCAGGG GAAGCTTATA TATACATTTT CTGAAAAAAT GGATAAATCT TTTTCCTAAA GAACAGCTTT TAATATTGAA AAGTGAAGAT ATGTATGAAA ATCCGGCAGG GACAACTAAA AAAGCTTTTG ATTTTCTAGG TTTACCAAAT TATCAACTAT TAGAATATAA AAAATATTTT CCTGGTTATT ATGCTCCAAT AGATGCTAGT TTGCGTTGTC AAATTGCTGA GTTATTTCAA CCTCATAATC AAAAATTAGA AGAGTCTCTT GGCATAAAAT TTAACTGGGA TTAA
|
Protein sequence | MGKTISAIHL NQKAEIYLAQ GKLEAAITAC YQALEIEQNF PLTCKILGNI LQRMGEIDKA KEWYIKAISQ QPNLAEAHAN LGSIYAQQKQ WHLAIECYRE AIGIKPNIPG FYRNLGKIWQ ELDKVELARD CQEQALSLEA HYPQASKYLK QGKKLLENGE REEAIAYFQK AINFNPSLVD AYQNLGDISL KTKDFNEAIN YYQKAIELKP DLWIVHYKLG KLFQEIGELD TATIEFNLAI ELNPSFIYSY KNLGDILHHK KDLDVAKNCY KKVIAIQSDV WDAHRKINEI LLAQERLNEA IIGCQLVIKI NPKLSWPYKI MGNIYTQNKA WDKAIVAYRC FLEIESDKDW VYEKLGDALK EKGLIDEAIY SYQKAIEINP NNYWFYYSLG KALCKLSRYE EAITAYQRGI KIDPNLYFAY HNLGVALVEL KRWNQAIVAY RQAIKIKPDS YWSHYNLGEI FLKLQEWDKA VETYRYAIEN NPNSPWYYQY LGIVLRKQGK IQEAIACYRK AIEIKPDWHR FYSLLGDILL EIGDSEEAIS CYIKAIKLQP NATAAYRQLR GIYIFKLAQL RPHQLNELVK CYQEAIKLQP NFPEVYINLA DILTGKGELD TAINYYQKAT YNKLLVSHPE FVKNHWDFQE FGQPSFVIIG TVKGGTSSLY NYLCHHPNVI PALQKEINFF NNKFNQGIDW YLAHFPQLPE QGKFITGEAT PNYMYSDEIG KKLLDNFPKI KIIAILRNPV DRTISHYYMA KRLGQESKKF TEFVPQEMKF LRRLNNNYQN YQRLIKEMSA YFRGSLYIHF LKKWINLFPK EQLLILKSED MYENPAGTTK KAFDFLGLPN YQLLEYKKYF PGYYAPIDAS LRCQIAELFQ PHNQKLEESL GIKFNWD
|
| |