Gene Tery_4770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4770 
Symbol 
ID4246424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7326380 
End bp7329043 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content31% 
IMG OID638109620 
Productsulfotransferase 
Protein accessionYP_724196 
Protein GI113478135 
COG category[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAAAA CAATCTCAGC AATTCATTTA AATCAAAAGG CAGAAATCTA TTTAGCTCAA 
GGAAAATTGG AAGCAGCAAT AACAGCTTGC TATCAAGCAT TAGAAATTGA GCAAAATTTT
CCACTTACCT GCAAAATATT GGGAAATATT TTACAAAGAA TGGGTGAGAT AGATAAAGCA
AAAGAATGGT ATATAAAAGC TATCAGTCAA CAACCAAATT TGGCGGAAGC TCATGCTAAT
TTGGGGAGTA TATATGCACA ACAAAAACAA TGGCATTTAG CAATTGAATG TTACCGGGAA
GCTATTGGGA TAAAACCAAA TATTCCTGGT TTTTACCGTA ATTTAGGGAA AATTTGGCAG
GAACTAGACA AAGTAGAATT AGCTAGAGAT TGTCAGGAAC AAGCATTGAG TTTAGAAGCA
CATTATCCTC AAGCTTCAAA ATATTTAAAA CAGGGAAAAA AGCTCTTAGA AAATGGTGAG
AGAGAAGAGG CGATCGCCTA TTTCCAAAAA GCCATAAATT TTAATCCATC TTTGGTAGAT
GCTTATCAAA ATTTAGGAGA TATTTCACTG AAAACAAAAG ATTTCAATGA GGCAATAAAT
TATTATCAAA AAGCTATTGA GCTAAAACCA GACTTATGGA TAGTTCATTA TAAACTAGGA
AAATTATTTC AAGAAATAGG AGAATTAGAT ACAGCTACCA TTGAGTTTAA TCTAGCAATA
GAACTGAATC CTAGTTTTAT CTATTCTTAC AAAAATTTAG GAGATATTCT GCACCATAAA
AAAGACTTAG ATGTCGCTAA AAATTGCTAT AAAAAAGTTA TAGCAATCCA ATCAGATGTT
TGGGATGCTC ATCGAAAAAT AAACGAAATT CTTCTAGCGC AAGAAAGATT AAATGAAGCA
ATTATTGGTT GTCAGTTGGT GATAAAAATC AACCCAAAAT TATCTTGGCC TTATAAAATT
ATGGGAAATA TTTATACTCA AAATAAAGCA TGGGATAAAG CAATTGTAGC TTATCGTTGT
TTTCTAGAAA TAGAATCAGA TAAGGATTGG GTTTACGAAA AATTAGGGGA TGCTTTAAAA
GAAAAAGGTC TGATAGATGA AGCTATATAC AGCTACCAAA AAGCTATAGA AATTAATCCC
AATAATTACT GGTTTTATTA TAGTTTAGGA AAAGCTCTAT GTAAACTATC TAGATATGAA
GAAGCTATAA CAGCTTATCA ACGGGGAATT AAAATTGACC CTAATTTATA TTTTGCTTAC
CACAATTTAG GGGTAGCTTT AGTAGAGTTA AAAAGATGGA ATCAAGCTAT AGTTGCCTAC
CGTCAGGCAA TTAAAATTAA ACCAGATTCT TATTGGTCTC ATTACAACTT AGGGGAAATT
TTTCTCAAAT TACAGGAGTG GGATAAAGCT GTAGAAACTT ACCGCTATGC AATTGAGAAT
AATCCTAATT CTCCCTGGTA TTATCAATAT TTAGGAATTG TACTCAGAAA ACAAGGAAAA
ATACAAGAAG CGATCGCTTG TTACCGAAAA GCAATAGAAA TAAAACCAGA CTGGCATCGG
TTTTACTCTT TATTAGGAGA TATTTTGCTA GAAATAGGTG ATTCAGAGGA GGCGATCTCT
TGTTATATAA AAGCAATTAA ATTACAACCA AATGCTACTG CAGCTTATCG ACAATTAAGA
GGTATCTATA TTTTTAAATT AGCTCAACTT AGACCTCATC AATTAAATGA ATTAGTTAAA
TGTTATCAAG AAGCTATTAA ATTACAGCCA AACTTTCCAG AAGTATATAT AAATTTAGCA
GACATTCTTA CAGGCAAAGG TGAACTTGAT ACTGCAATTA ATTATTATCA GAAAGCTACC
TATAACAAAC TTTTAGTCTC TCATCCAGAA TTTGTCAAAA ATCATTGGGA TTTTCAAGAA
TTTGGTCAAC CAAGTTTTGT TATTATTGGC ACAGTTAAAG GGGGAACATC ATCACTTTAT
AATTATCTAT GTCATCATCC CAATGTAATT CCTGCTCTAC AAAAAGAAAT CAATTTTTTT
AACAATAAAT TTAATCAAGG AATAGATTGG TATTTAGCAC ATTTTCCTCA ACTACCTGAG
CAGGGAAAAT TTATAACTGG GGAAGCTACC CCTAACTATA TGTATTCTGA TGAAATAGGA
AAAAAGTTAT TAGATAACTT TCCTAAAATT AAAATAATTG CGATTTTAAG AAATCCAGTA
GATCGAACAA TTTCTCACTA TTATATGGCT AAAAGATTGG GACAAGAGTC AAAGAAATTT
ACAGAATTTG TACCCCAAGA AATGAAATTT CTTAGACGAC TAAACAACAA CTATCAAAAT
TATCAAAGAC TAATTAAGGA AATGTCAGCT TATTTCAGGG GAAGCTTATA TATACATTTT
CTGAAAAAAT GGATAAATCT TTTTCCTAAA GAACAGCTTT TAATATTGAA AAGTGAAGAT
ATGTATGAAA ATCCGGCAGG GACAACTAAA AAAGCTTTTG ATTTTCTAGG TTTACCAAAT
TATCAACTAT TAGAATATAA AAAATATTTT CCTGGTTATT ATGCTCCAAT AGATGCTAGT
TTGCGTTGTC AAATTGCTGA GTTATTTCAA CCTCATAATC AAAAATTAGA AGAGTCTCTT
GGCATAAAAT TTAACTGGGA TTAA
 
Protein sequence
MGKTISAIHL NQKAEIYLAQ GKLEAAITAC YQALEIEQNF PLTCKILGNI LQRMGEIDKA 
KEWYIKAISQ QPNLAEAHAN LGSIYAQQKQ WHLAIECYRE AIGIKPNIPG FYRNLGKIWQ
ELDKVELARD CQEQALSLEA HYPQASKYLK QGKKLLENGE REEAIAYFQK AINFNPSLVD
AYQNLGDISL KTKDFNEAIN YYQKAIELKP DLWIVHYKLG KLFQEIGELD TATIEFNLAI
ELNPSFIYSY KNLGDILHHK KDLDVAKNCY KKVIAIQSDV WDAHRKINEI LLAQERLNEA
IIGCQLVIKI NPKLSWPYKI MGNIYTQNKA WDKAIVAYRC FLEIESDKDW VYEKLGDALK
EKGLIDEAIY SYQKAIEINP NNYWFYYSLG KALCKLSRYE EAITAYQRGI KIDPNLYFAY
HNLGVALVEL KRWNQAIVAY RQAIKIKPDS YWSHYNLGEI FLKLQEWDKA VETYRYAIEN
NPNSPWYYQY LGIVLRKQGK IQEAIACYRK AIEIKPDWHR FYSLLGDILL EIGDSEEAIS
CYIKAIKLQP NATAAYRQLR GIYIFKLAQL RPHQLNELVK CYQEAIKLQP NFPEVYINLA
DILTGKGELD TAINYYQKAT YNKLLVSHPE FVKNHWDFQE FGQPSFVIIG TVKGGTSSLY
NYLCHHPNVI PALQKEINFF NNKFNQGIDW YLAHFPQLPE QGKFITGEAT PNYMYSDEIG
KKLLDNFPKI KIIAILRNPV DRTISHYYMA KRLGQESKKF TEFVPQEMKF LRRLNNNYQN
YQRLIKEMSA YFRGSLYIHF LKKWINLFPK EQLLILKSED MYENPAGTTK KAFDFLGLPN
YQLLEYKKYF PGYYAPIDAS LRCQIAELFQ PHNQKLEESL GIKFNWD