Gene Tery_2861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2861 
Symbol 
ID4244932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4460035 
End bp4462080 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content32% 
IMG OID638107910 
Productsulfotransferase 
Protein accessionYP_722507 
Protein GI113476446 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.392476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAATA ATAAAAAGGT AGAAACAGTA GCTATTAATT TTCATCAATT AGCAGAATCT 
AGTCTAGCCC AAGGAAAATT GGATGAAGCT TATGCAGCTT GTTTAAAAGC ATTAAATAGC
CAACCAGAAT TTGCACCAGC TTACAAAACT ATAGGCAATA TTTTACAGGT CAAAGGAGAT
ATAGAGGCAG CCAAAAATTA TTATTTTAAA GCCATAACAA TATTTCCTGA TTTTGCTGAA
GCCCATGCTA ATATTGGTAG TATGTACGCT AAACAAAGGG ACTGGGAAAA AGCATTTTTT
TATTACCAAA AAGCTATCCA TATTAAGCCT AACTTAGCGA TAGTTTATCG AAATTTAGCG
AAAGTCTGCG AGTGTACTGA AAAAGAAGAA TTAGCCACAG AATATACTTA TAAAGCACTT
ATCCTAGAAC CGGAATCAGC TACAGCTATA GAGCATTTGA ATATAGGAAA AAAATTATTA
GAATTAAACA AAATAGAGGC AGCAATTAAA TGTTACCGTA ATGCTGTTAA AATTAATCCT
AATTTGTCAG CAGGATATCA AAATTTAGGA GAATTGTTGG TAAAAAATGG GGAATTAGAG
TCAGCATTAA TAGCTTTACG CGAAGGAATT AAGATAGATG CTAAAAACCC TAGATGTTAC
TACTTACTCG GGGAAGTTTG GCAAAAACAG GGACAATATA AGTTAGCAAT TTCAGATTAT
AGTCGTGCTA TAGAATTAAA ACCAGAAAAT CATTTATTCC ACAAAAAATT AGGAGATGTT
TGGGAAAAAA TGGGTAAGCT AGATGTGGCA ATATCCTGTT ACGAAAAAGC TATAGAAATA
AATCCAAATT TCTTTTGGAG TTACCATAGT TTAGGTAATG TCTATACTAA ACAACAAAAA
TGGGATAAAG CGATCGCTGC TTACGACAAA GCAACTATTA TTAACCCAAA TTTCTCTAAT
ACATACTATA ATTTAGCTGA TGCTTTTTTA CACAATTCTC AAAAAGAGGA AGCTATTATT
ACTTACTTAG AAGCTATTAG ACTTAGACCA GAACATTCTT GGTATTCTCA TCATTCAGTA
TTATGGAAAC ACTTACTAAA AAGTCGGCTT GAGGAAGTAT TAAATTTATA TCAAGATGCC
ACAAAAAAAG AGCCAAATAG TATTTTGTGT CATCTTAACC TAGGAGAAAT TTTTACAGAA
AAAGGAAATA TAAAAGAAGC AATTAACAGC TATCAAACAG CTTGTTACAA CAAAACAAAA
AAATCAAATC CTGCCTTTGT CGACAAGTAT TGGAACTTTG ATAATGTAGC GCGTCCTAAT
TTCATTATTA TTGGTTCTCA AAAAAGTGGT ACGACTTCTT TAGCAAGTTA TATTAGTCAA
CACCCCCAAG TATTACCAGC TATTAAGAAA GAAACCCATT TTTGGTCACG GGAATTTAAT
CAAGGAATAG ATTGGTATCT GGCTCATTTT CCTCCCATTC CTAAGTCGCA AAATTTGATT
ACTGGGGAAG CTACTCCTAA TTATTTAGTC ACTGATAAAA TTCCAGAAAG AATCTATAGT
TTACTGCCTA ATATTAAACT ATTGGTGATT TTAAGAAATC CAGTAGATAG AGCTTTTTCT
CAATATCATC ATTGGCAGAG ATTAAACTGG GAAGACCGCT CTTTTGAAGT TGCAATTAAT
CAGGAATTAG AAATACTGAA AACTACTCCT AAACAACCCC AAGGAGATAG AAAATATTGG
CGACTATCAG GAAATTATAT AGGGAGAGGT GTTTATATAG AATTTATACA GAAATGGATG
GGATTATTTC CTAAGAAACA ATTTTTAATT TTGAGAGGAG AAGACCTTTA TCAAACGCCC
GATAATACCA TGAAGCAAGT ATTTGATTTT TTAGGTTTGC CAGAACATAA ACTGGCAAAA
TATAAAAAGT TAAATTCTGG TTCTTATACA CCAATTTCTG ATTTGCTGCG TCAAAGATTA
TCTAAATATT TTCAACCTCA TAATCAGAGA TTAGAAGAGT ATTTGGGTAT AAAGTTTAAT
TGGTAA
 
Protein sequence
MSNNKKVETV AINFHQLAES SLAQGKLDEA YAACLKALNS QPEFAPAYKT IGNILQVKGD 
IEAAKNYYFK AITIFPDFAE AHANIGSMYA KQRDWEKAFF YYQKAIHIKP NLAIVYRNLA
KVCECTEKEE LATEYTYKAL ILEPESATAI EHLNIGKKLL ELNKIEAAIK CYRNAVKINP
NLSAGYQNLG ELLVKNGELE SALIALREGI KIDAKNPRCY YLLGEVWQKQ GQYKLAISDY
SRAIELKPEN HLFHKKLGDV WEKMGKLDVA ISCYEKAIEI NPNFFWSYHS LGNVYTKQQK
WDKAIAAYDK ATIINPNFSN TYYNLADAFL HNSQKEEAII TYLEAIRLRP EHSWYSHHSV
LWKHLLKSRL EEVLNLYQDA TKKEPNSILC HLNLGEIFTE KGNIKEAINS YQTACYNKTK
KSNPAFVDKY WNFDNVARPN FIIIGSQKSG TTSLASYISQ HPQVLPAIKK ETHFWSREFN
QGIDWYLAHF PPIPKSQNLI TGEATPNYLV TDKIPERIYS LLPNIKLLVI LRNPVDRAFS
QYHHWQRLNW EDRSFEVAIN QELEILKTTP KQPQGDRKYW RLSGNYIGRG VYIEFIQKWM
GLFPKKQFLI LRGEDLYQTP DNTMKQVFDF LGLPEHKLAK YKKLNSGSYT PISDLLRQRL
SKYFQPHNQR LEEYLGIKFN W