Gene Tery_2765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2765 
Symbol 
ID4244798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4284496 
End bp4287747 
Gene Length3252 bp 
Protein Length1083 aa 
Translation table11 
GC content43% 
IMG OID638107824 
ProductTPR repeat-containing protein 
Protein accessionYP_722421 
Protein GI113476360 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAA CTAAAGCAAT AGATTCGGAA ATAGCTGTTG AGAGGGTGGT AGGATTTGCC 
CAACAATTCG ACGGTACACA TCTTGATTTA GCTTGTCATG CAGCGTTTCC ACAGACACTT
ACTCCTGATT TGCTCTACCA AATTTGGCTT CGCTTTGTCC CCCAAGCTCC TTGGACGGCA
GTTGCTCGCA TACTTTTATC TCGCTTGTGC CGTGAGGTAG GCTATGAGCT TTATGAAATG
GATGTAAATG TCCGAAATTT GTTGTTGCAA GAGTTGAAGG AAGATAAGCG TTTTGATAAG
CCTCGGTTAA AAGAACTAGC AGATTTCTAC AGTAACTATG TGAAACAGCA ACTTGATGGT
GATGATTTGA GGAGGAGAGA CTTAGGAACG GCTGAGTATT GGATGTCCCT TGCTTGTAGT
CAGCCTAATC AGCTTAATCA TAAACTGGCA CTGGCTATTG AAGAGAGATT GAAGCAGAAA
AACTGGAAGG AGTTGTTTAG ATTTGGGTTA TTTATAGAAA GTTTTCCAAC TGCTTTAGCG
GAATTTGAAC CACCACTGAT TACCTACGCC CGTGGAATGG TGTCTTTTAC GAGTGGAGAT
TTGGAGGGTG CAACAAAACA ATTCTCTCAG CTTTCTAGGT GGGAACGTCA AGTTAAAATT
GCTGGAGTTA GTTTGTCAAT TCCTGATGAA ATTCCTCTAA TTTCTGTTGA GTTGTCTTTC
CTCGAAGAGT TACTAAATAT TGTTTCTGAT AATGATGATA ATCCTCAATG GAAAATTTAT
CCATTTTTGG AAGCAAATCT AGAGAGGTTA AATGAAGATT TAATTGGGTT ATTACAAGAA
TGGTCAACTA ATATACTGTT AAATACGGAA CCAGTTGAAT TACATAGAAT TGGTTCATCT
CTTACTAGAT TTAGCAATTT ATTAGGCAAT TTTACATTGG GAAATATAGC GATAAACTTA
GAAATTGCTA TTACTGGATA TCAGATTGCT TGTCAAATTT TTCGACGAGA AGAGTTTCCT
AAAGAGTGGG GAATTATTCA AAATCATCTC GGCATTGCCT ACAGTAACAG AATAAGAGGA
GACAAAGCCC AGAATATTGA ATCGGCTATT GCTGCATGCC AACAAGCTTT GATGGTGCTT
ACCCAAACTG ACTTCCCCTT TGAATGGGCA GCAACTCAAA ATAGCCTCGG CAATGGGTAC
AGTGAGAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG TTGCCATTGC TGCATACGAA
CAAGCTTTGC TGGTGTACAC CCAAACTGAC TTCCCCATGG ACTGGGCAAT GACTCAAAAT
AATCTCGGCA ATGCCCACAG AGACAGAATA AGGGGAGACA AAGCCCAAAA TATTGAAGCT
GCCATTGCTG CATACCAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATGGAC
TGGGCAATGA CTCAAAATAA TCTCGGCGCT GCCTACAGTG ACAGAATAAG GGGAGACAAA
GCCGAAAATA TTGAAGCTGC CATTGCTGCA TACCAACAAG CTTTGCTGGT GTACACCCAA
ACTGACTTCC CCATGGACTG GGCAAATACT CAAAATAATC TCGGCATTGC CTACAGAAAC
AGAATAAGGG GAGACAAAGC CGAAAATATT GAAGCCGCCA TTGCTGCATA CCAACAAGCT
TTGCTGGTGT ACACCCAAAC TGACTTCCCC ATCAACTGGG CAATGACTCA AAATAATCTC
GGCAATGCCT ACAGTAACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCTGCCATT
GCTGCATACC AACAAGCTTT GCTGGTGCGC ACCCAAACTG ACTTCCCCAT CAACTGGGCA
ATGACTCAAA ATAATCTCGG CAATGCCTAC AGAGACAGAA TAAGGGGAGA CAAAGCCGAA
AATATTGAAG CCGCCATTGC TGCATACAAA CGAGCTTTGC AGGTAAGCAC CCAAACTGAC
TTCCCCATCG ACTGGGCCGG AACTCAAAAT AATCTCGGCA ATGCCTATAG TGACAGAATA
AGGGGAGACA AAGCCGAAAA TATTGAAGCT GCCATTGCTG CATTCCAACA AGCTTTGCTG
GTGTACACCC AAACTGACTT CCCCATGGAC TGGGCAACAA CTCAAAATAA TCTCGGCAAT
GCCTACAGTG ACAGAATAAG GGGAGACAAA GCCGAAAATA TTGAAGCTGC CATTGCTGCA
TACCAACAAG CTTTGCTGGT GCGCACCCAA ACAGACTTCC CCATGGACTG GGCAGGAACT
CAATATAATC TCGGCATTGC CTACAGTGAC AGAATAAGGG GAGACAAAGC CGAAAATATT
GAAGCTGCCA TTGCTGCATA CCAACAAGCT TTGCTGGTGC GCACCCAAAC AGACTTCCCC
ATGGACTGGG CAACAACTCA AAATAATCTC GGCAATGCCT ACAGTGACAG AATAAGGGGA
GACAAAGCCG AAAATATTGA AGCTGCCATT GCTGCATACC AACAAGCTTT GCTGGTGTAC
ACCCAAACTG ACTTCCCCAT GGAATGGGCA ACAATTCAAA ATAATCTCGG CAATGCCTAC
AGTAACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CTGCCATTGC TGCATACCAA
CAAGCTTTGC TGGTGCGCAC CCAAACAGAC TTCCCCATGG ACTGGGCAAC AACTCAAAAT
AATCTCGGCG CTGCCTACAT TTACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCT
GCCATTGCTG CATACGAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATGGAA
TGGGCAACAA TTCAAAATAA TCTCGGCAAT GCCTACAGAA ACAGAATAAG GGGAGACAAA
GCGGAAAATA TTGAAGCCGC CATTGCTGCA TACGAACAAG CTTTGCTGGT GTACACCCAA
ACTGACTTCC CCATGGAATG GGCAACAATT CAAAATAATC TCGGCAATGC CTACAGAAAA
ATTAATCAAG ACCTAGCACA AGCAGCTAAA GATATTAAAA ATCTACTCAA TCAACTTTCA
GAAGATTATC CTAACGATAG TTATAGAGTT TTAAGTGCTA AAGCTATGGA TGAAGTTGAT
AAAAATCCTC AGTTAAAATA TCGAATTATC CGAGGATTAA AAGCAGGAGG TTTAGCAGCT
TTAGAAAAAA TGATTGATCA TCCTGTTGCT CAGTTCCTTA TTGAAGGTGT AAAAGAAGTA
TTAAATCCTT GA
 
Protein sequence
MNSTKAIDSE IAVERVVGFA QQFDGTHLDL ACHAAFPQTL TPDLLYQIWL RFVPQAPWTA 
VARILLSRLC REVGYELYEM DVNVRNLLLQ ELKEDKRFDK PRLKELADFY SNYVKQQLDG
DDLRRRDLGT AEYWMSLACS QPNQLNHKLA LAIEERLKQK NWKELFRFGL FIESFPTALA
EFEPPLITYA RGMVSFTSGD LEGATKQFSQ LSRWERQVKI AGVSLSIPDE IPLISVELSF
LEELLNIVSD NDDNPQWKIY PFLEANLERL NEDLIGLLQE WSTNILLNTE PVELHRIGSS
LTRFSNLLGN FTLGNIAINL EIAITGYQIA CQIFRREEFP KEWGIIQNHL GIAYSNRIRG
DKAQNIESAI AACQQALMVL TQTDFPFEWA ATQNSLGNGY SERIRGDKAE NIEVAIAAYE
QALLVYTQTD FPMDWAMTQN NLGNAHRDRI RGDKAQNIEA AIAAYQQALL VYTQTDFPMD
WAMTQNNLGA AYSDRIRGDK AENIEAAIAA YQQALLVYTQ TDFPMDWANT QNNLGIAYRN
RIRGDKAENI EAAIAAYQQA LLVYTQTDFP INWAMTQNNL GNAYSNRIRG DKAENIEAAI
AAYQQALLVR TQTDFPINWA MTQNNLGNAY RDRIRGDKAE NIEAAIAAYK RALQVSTQTD
FPIDWAGTQN NLGNAYSDRI RGDKAENIEA AIAAFQQALL VYTQTDFPMD WATTQNNLGN
AYSDRIRGDK AENIEAAIAA YQQALLVRTQ TDFPMDWAGT QYNLGIAYSD RIRGDKAENI
EAAIAAYQQA LLVRTQTDFP MDWATTQNNL GNAYSDRIRG DKAENIEAAI AAYQQALLVY
TQTDFPMEWA TIQNNLGNAY SNRIRGDKAE NIEAAIAAYQ QALLVRTQTD FPMDWATTQN
NLGAAYIYRI RGDKAENIEA AIAAYEQALL VYTQTDFPME WATIQNNLGN AYRNRIRGDK
AENIEAAIAA YEQALLVYTQ TDFPMEWATI QNNLGNAYRK INQDLAQAAK DIKNLLNQLS
EDYPNDSYRV LSAKAMDEVD KNPQLKYRII RGLKAGGLAA LEKMIDHPVA QFLIEGVKEV
LNP