Gene Tery_4958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4958 
Symbol 
ID4246612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7553468 
End bp7558333 
Gene Length4866 bp 
Protein Length1621 aa 
Translation table11 
GC content44% 
IMG OID638109769 
ProductTPR repeat-containing protein 
Protein accessionYP_724345 
Protein GI113478284 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAAAA AACCCAGAGA AGCATATATC AACCTCATCC AGCAACTGCT AAGTTCCCCC 
AACGGTGAGG TAGAAAAAAT CCTCGAAGCC AACCAAGAGT TGGTAGATGA AGGGTTACTC
GAAACAATGG AACTTTGTGC ACAACAATTC GCAGCAGCTC CTAATTTTTT ACGCCATCTC
AGGAGTCAGC TTGCAGAAAT ACTAGAAACT TCAGAATATT CTCCCTCAAC TCATTACTCA
TCGGCAGAAT ATTTAGACTT TTTGATGGAA GTATTGCAGG CAACTGCAGA AAGTAATGGT
GACTCGAAAG TTGTCTACCC ACTCCTACAG CAAAACTTAG ATAAACTTGA TGATCACTTT
GCTGAGATAT TGCAAAACTG GGCAACTGCT AAATTTTCTG AAGTGGAGGC GGATCTAGCA
AAAGACATTG CTATAGATAT TGGTGAGTTC AGTAACCTGA TCTGGCAATT TCCTCTGGGC
AGCAAAGCCA ACAATATGGA AATTAGTATT GCCGGTTGTG AAGTAGTGCT AAAAGTTTTG
ACATATAAGA GTCACCGGAA AAGTTGGGCG GCTATTCAAA ATAATCTCGG TATTGACTAC
ACTCAGAGAA TAAGGGGAGA CCAAGCCGAA AATATTGAAG TCGCTATTGC TGCATACGAA
CAAGCTTTGC TGGTGCGGAC CCAAACAGAC TTCCCCATGG ACTGGGCAAT GACTCAAAAT
AATCTCGGCA TTGCCTACAC TCAGAGAAGA AAGGGAAACA AAGCTCAAAA TATTGAAGCG
GCCGTTGGTG CATTCCAACA AGCTTTGCTG GTGCGGACCC AAACAGACTT CCCCATGGAC
TGGGCAATGA CTCAAAATAA TCTCGGCATT GCCTACAGAA ACAGAATAAG GGGAAAGAAA
CCCGAAAATA TTGAAGCGGC TATTGGTGCA TTCCAAAAAG CTTTGCAGGT GTACAGCCAA
ACAGACTTCC CCATGGACTG GGCAACAACT CAAAATAATC TCGGCAATGC CTACACTCAG
AGAAGAAAGG GAGACAAAGC CGAAAATATT GAAGCGGCTA TTGGTGCATT CCAACAAGCT
TTGCTGGTAT ATACCCAAAC TGACTTCCCC ATTGACTGGG CAATGACTCA AAATAATCTT
GGCTATGCCT ATACTCAGAG AAAAAGGGGA GACCAAGCCG AAAATATTGA AGCGGCTATT
GGTGCATTCC AACAAGCTTT GCTGGTGCGG ACCCAAACAG ACTTCCCCAT GGACTGGGCA
ATGACTCAAA ATAATCTCGG TATTGGCTAC ACTCAGAGAA AAAGAGGAGA CCAAGCCGAA
AATATTGAAA CTGCCATTGC TGCCTACGAA CAAGCTTTGC AGGTGCGGAC CCAAACAGAC
TTCCCCATGG ACTGGGCAGA AACTCAAAAT AATCTCGGCG TTGCCTACAG TAAAAGAAGA
AGGGGAGACA AAGCCCAAAA TATTGAAGTG GCTATTGCTG CATACGAACA AGCTTTGCTG
GTACACACCC AAACTGACTT TCCCCTCCAC TGGGCAACAA CTCAAAATAA TCTCGGCATT
GCCTACAGAA ACAGAATAAG GGGAGACAAA GCCCAAAATA TTGAAGTGGC TATTGCTGCA
TACGAACAAG CCTTGCTGGT ACACACCCAA ACAGACTTTC CCATGGAATG GGCACAAACT
CAAAATAATC TCGGCATTGC TTACAGAAAC AGAAGAAGGG GAGACAAAAC CCAAAATATT
GAAGCCGCCA TTGCTGCATA CGAACAAGCT TTGCTGGTAC ACACCCAAAC TGACTTTCCC
ATAGAATGGG CAGCAACTCA ACAAAATATC GGCGTTGCCT ACAAAGACAG AATAATGGGA
GACCAAGCCC AAAATATTGA AGCCGCCATT ACTGCATACA AACAAGCTTT GCAGGTACAC
ACCCAAACAG ACTTTCCCAT AGAATGGGCA GCAACTCAAC AAAATATCGG CGTTGCCCAC
AAAGACAGAA TAATGGGAGA CCAAGCCCAA AATATTGAAG CCGCCATTGC TGCATACAAA
CAAGCTTTGC AGGTGCACAC CCAAACAGAC TTTCCCATGG AATGGGCACA AACTCAAAAT
AATCTCGGCT CTGCCTACAC TCAGAGAATA AGGGGAGATA AAGCCCAAAA TATGGAAGCT
GCCATTGCTG CATTCCAAAA AGCCTTGCTG GTGCGCACCC AAACTGACTT CCCGATGAAA
TGGGCAGCAA CTCAAAATAA TCTCGGCTCT GCCTACGCTC AGAGAGTAAT GGGAGACCAA
GCCGAAAATA TTGAAGCCGC CATTGCTGCA TTCCAAAAAG CCTTGCTGGT GCGCACCCAA
AGTGACTTCC CCATGAAATG GGTAGCAACT CTAAATAATT TTGGCTATGC CTGCAGTAAC
AGAATAAGGG GAGACCAAGC CCAAAATATT GAAGCCGCCA TTGCCGCATA CCAACAAGCT
TTGCTGGTGT ACACCCAAAC AGACTTCCCC ATCGACTGGG CAGCAACTCA AAATAATCTC
GGCTCTGCCT ACAGTGACAG AATAAAGGGA GACCAAGCCC AAAATATTGA AGCCGCCATT
GTCGCATACC AACAAGCTTT GCTGGTGTAC ACCCAAACAG ACTTCCCCAT AGACTGGGCA
GCAACTCAAA ATAATCTCGG CTCTGCCTAC AGTCAGAGAA AAAGGGGAGA CCAAGCCCAA
AATATTGAAG CCGCCATTGC CGCATACCAA CAAGCTTTGC TGGTGTACAC CCAAACAGAC
TTCCCCATAG ACTGGGCAAC GACTCAAAAT AATCTCGGCT CTGCCTACAG TCAGAGAATA
AAGGGAGACC AAGCCCAAAA TATTGAAGCC GCCATTGTCG CATACCAACA AGCTTTGCTG
GTGTACACCC AAACAGACTT CCCCATAGAC TGGGCAACGA CTCAAAATAA TCTCGGCTCT
GCCTACAGTC AGAGAATAAA GGGAGACCAA GCCCAAAATA TTGAAGCCGC CATTGTCGCA
TACCAACAAG CTTTGCAAGT GCGCACCCTA GAAGTATATC CTATCGACCA CCTCCAAACC
ACCCGCAACC TAGGCAACCT ATATTTCGAC AACCAAAACT GGCAACTCGC CGCCGACAAC
TATAAAAAAG CCATAACCGC AGTGGAACTC AGCCGGAGTT GGTCAAAAGA AAATGATCGC
CGCCAAGAAA TTATCGAAGA GTCTATAGAT GTCTACCGCA AGATAGTACA AGCTTACGTG
AATAGCGAGC AAATAGAAAA AGCCTTAGAA TATGTAGAGC ATTCTCGCTC CAAACGGCTA
GTAGATATAA TGGCAAGTAA CGATCGCTAC TCCCAAAGTA AAATACCAGG AGAAGTTGAA
GAACCCTTAA AAGAACACGA AGCTATTCAA CAGCAAATTC ATCAATTTTG GGAACAACAA
CAACAAGGAA AACGTCGAAT AGAGCCGAAA TATTTAGCGG TAGCAACTAG GGGGCGAGTT
GCCACAGAAA CCAGAAACAA ACGCATTACC CAATTAGAGA CTCAAAAGCA GGAAGTCTAT
AAAAAAATCC GCAGCTTCGA CCGAGTATTA GCAGAGGGCA TTCAAGTTGC CCCACTGGAA
TTTCCAAAAA TCCAAGCATT AATCCAGGAG CCCATCACAG CCATCTTGAG CTTTTATATT
ATCACCGACA ATACCTATTT ATTTGTATTG CGACAAGATG GGGTTAAAAT TCATATTTAC
AGCGGGTTGG GGGAGAAAGA ACTGCAAAAC TGGATTTGGG AAAAATGGTT TGAGTCTTAC
CTCTCATCCC CGGAAGAATG GCAACAACAG ATGCCTGAAT TTTTACAGGA TGTAAGTAAG
AAGTTAAAAT TAGAGGAATT ATGCAAATCT TATCTTCAAG ACATTGAAGA ATTGATATTA
ATTCCCCATT TATCCTTGCA CTTTCTCCCT TGGAACGCAA TGCCAGTAGC AGAGTCAGGA
GAAAACAAAT ATCTAGGTGA CCGTTTCCGT ATCCGCACCC TGGCCAGTTG CCAAATTCTA
GACTTTTGTA CCCAACGGGA AGAAATTCAA GGGGAAGTCA AACAAGGCAT TGTAGAAGAC
ACTCACAACG ACCTACCTTG TTCCAGTTAT GAAGCGCAAT ATATAGCCCA AATGTATGGA
GTCCCAGAAC ACCAACGCCT GCGAGGTGAA GCCGCAACTA TAGACAGTTA CATACTATTA
TTATCTCAGG TACAACGGTT ATTGTCAACC CATCACAGCG AATCTCGCCT AGACAACTGT
ATGGAATCAG CATTAGTCTT AGCGGACGGC AGACTTACCT TAGGGCAACT ATTATCTCCT
GCCTTCCGTT TCCCAGATCT AGACGAAGTA TTCATCGACT ATTGTGAGAC CAACTTAGGT
CAAGTGCAAA TCTCCGATGA CGTATTAACA TTAAACACAG GATTTTTATG TGCAGGTGCC
AGGGGTGTGA TCAGCAGTTT GTGGTCTGTA GATGACTTAG GAACATGTTT ATTTTCGATT
TTTTATCACC AACTGCGCCA AGAAGGAAAA AATCGTTCTC TGGCATTGCA ACTAGGACAA
CGACAGTTAC GAGAACTAAC AGGCAAAGAA CTCAAGAAGA AATATAAAAA GGAATTAGAA
AAAGCATTAG GGGAGAAGTT AGAAGTCACA TCTAAGCAAC TTCAGGAAAT AGAACCCAGA
CGTGATAGTT ATACCAAAAG TTCCGTGGAA TATCAGGAGT TAGAGGAGGA GCGGGAAAAA
CTTGTAGCTA TTTATGAGCG TATTTTTAAT ACTAAGAATA AGTACCTGAA AGCAGCTTGT
AAAAAACAAC ACCCCTTTGA GCATCCAGCG TATTGGAGTG GGTTTATTTG TGCAGGGTTG
AGTTAG
 
Protein sequence
MDKKPREAYI NLIQQLLSSP NGEVEKILEA NQELVDEGLL ETMELCAQQF AAAPNFLRHL 
RSQLAEILET SEYSPSTHYS SAEYLDFLME VLQATAESNG DSKVVYPLLQ QNLDKLDDHF
AEILQNWATA KFSEVEADLA KDIAIDIGEF SNLIWQFPLG SKANNMEISI AGCEVVLKVL
TYKSHRKSWA AIQNNLGIDY TQRIRGDQAE NIEVAIAAYE QALLVRTQTD FPMDWAMTQN
NLGIAYTQRR KGNKAQNIEA AVGAFQQALL VRTQTDFPMD WAMTQNNLGI AYRNRIRGKK
PENIEAAIGA FQKALQVYSQ TDFPMDWATT QNNLGNAYTQ RRKGDKAENI EAAIGAFQQA
LLVYTQTDFP IDWAMTQNNL GYAYTQRKRG DQAENIEAAI GAFQQALLVR TQTDFPMDWA
MTQNNLGIGY TQRKRGDQAE NIETAIAAYE QALQVRTQTD FPMDWAETQN NLGVAYSKRR
RGDKAQNIEV AIAAYEQALL VHTQTDFPLH WATTQNNLGI AYRNRIRGDK AQNIEVAIAA
YEQALLVHTQ TDFPMEWAQT QNNLGIAYRN RRRGDKTQNI EAAIAAYEQA LLVHTQTDFP
IEWAATQQNI GVAYKDRIMG DQAQNIEAAI TAYKQALQVH TQTDFPIEWA ATQQNIGVAH
KDRIMGDQAQ NIEAAIAAYK QALQVHTQTD FPMEWAQTQN NLGSAYTQRI RGDKAQNMEA
AIAAFQKALL VRTQTDFPMK WAATQNNLGS AYAQRVMGDQ AENIEAAIAA FQKALLVRTQ
SDFPMKWVAT LNNFGYACSN RIRGDQAQNI EAAIAAYQQA LLVYTQTDFP IDWAATQNNL
GSAYSDRIKG DQAQNIEAAI VAYQQALLVY TQTDFPIDWA ATQNNLGSAY SQRKRGDQAQ
NIEAAIAAYQ QALLVYTQTD FPIDWATTQN NLGSAYSQRI KGDQAQNIEA AIVAYQQALL
VYTQTDFPID WATTQNNLGS AYSQRIKGDQ AQNIEAAIVA YQQALQVRTL EVYPIDHLQT
TRNLGNLYFD NQNWQLAADN YKKAITAVEL SRSWSKENDR RQEIIEESID VYRKIVQAYV
NSEQIEKALE YVEHSRSKRL VDIMASNDRY SQSKIPGEVE EPLKEHEAIQ QQIHQFWEQQ
QQGKRRIEPK YLAVATRGRV ATETRNKRIT QLETQKQEVY KKIRSFDRVL AEGIQVAPLE
FPKIQALIQE PITAILSFYI ITDNTYLFVL RQDGVKIHIY SGLGEKELQN WIWEKWFESY
LSSPEEWQQQ MPEFLQDVSK KLKLEELCKS YLQDIEELIL IPHLSLHFLP WNAMPVAESG
ENKYLGDRFR IRTLASCQIL DFCTQREEIQ GEVKQGIVED THNDLPCSSY EAQYIAQMYG
VPEHQRLRGE AATIDSYILL LSQVQRLLST HHSESRLDNC MESALVLADG RLTLGQLLSP
AFRFPDLDEV FIDYCETNLG QVQISDDVLT LNTGFLCAGA RGVISSLWSV DDLGTCLFSI
FYHQLRQEGK NRSLALQLGQ RQLRELTGKE LKKKYKKELE KALGEKLEVT SKQLQEIEPR
RDSYTKSSVE YQELEEEREK LVAIYERIFN TKNKYLKAAC KKQHPFEHPA YWSGFICAGL
S