Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2765 |
Symbol | |
ID | 4244798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4284496 |
End bp | 4287747 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638107824 |
Product | TPR repeat-containing protein |
Protein accession | YP_722421 |
Protein GI | 113476360 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCAA CTAAAGCAAT AGATTCGGAA ATAGCTGTTG AGAGGGTGGT AGGATTTGCC CAACAATTCG ACGGTACACA TCTTGATTTA GCTTGTCATG CAGCGTTTCC ACAGACACTT ACTCCTGATT TGCTCTACCA AATTTGGCTT CGCTTTGTCC CCCAAGCTCC TTGGACGGCA GTTGCTCGCA TACTTTTATC TCGCTTGTGC CGTGAGGTAG GCTATGAGCT TTATGAAATG GATGTAAATG TCCGAAATTT GTTGTTGCAA GAGTTGAAGG AAGATAAGCG TTTTGATAAG CCTCGGTTAA AAGAACTAGC AGATTTCTAC AGTAACTATG TGAAACAGCA ACTTGATGGT GATGATTTGA GGAGGAGAGA CTTAGGAACG GCTGAGTATT GGATGTCCCT TGCTTGTAGT CAGCCTAATC AGCTTAATCA TAAACTGGCA CTGGCTATTG AAGAGAGATT GAAGCAGAAA AACTGGAAGG AGTTGTTTAG ATTTGGGTTA TTTATAGAAA GTTTTCCAAC TGCTTTAGCG GAATTTGAAC CACCACTGAT TACCTACGCC CGTGGAATGG TGTCTTTTAC GAGTGGAGAT TTGGAGGGTG CAACAAAACA ATTCTCTCAG CTTTCTAGGT GGGAACGTCA AGTTAAAATT GCTGGAGTTA GTTTGTCAAT TCCTGATGAA ATTCCTCTAA TTTCTGTTGA GTTGTCTTTC CTCGAAGAGT TACTAAATAT TGTTTCTGAT AATGATGATA ATCCTCAATG GAAAATTTAT CCATTTTTGG AAGCAAATCT AGAGAGGTTA AATGAAGATT TAATTGGGTT ATTACAAGAA TGGTCAACTA ATATACTGTT AAATACGGAA CCAGTTGAAT TACATAGAAT TGGTTCATCT CTTACTAGAT TTAGCAATTT ATTAGGCAAT TTTACATTGG GAAATATAGC GATAAACTTA GAAATTGCTA TTACTGGATA TCAGATTGCT TGTCAAATTT TTCGACGAGA AGAGTTTCCT AAAGAGTGGG GAATTATTCA AAATCATCTC GGCATTGCCT ACAGTAACAG AATAAGAGGA GACAAAGCCC AGAATATTGA ATCGGCTATT GCTGCATGCC AACAAGCTTT GATGGTGCTT ACCCAAACTG ACTTCCCCTT TGAATGGGCA GCAACTCAAA ATAGCCTCGG CAATGGGTAC AGTGAGAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG TTGCCATTGC TGCATACGAA CAAGCTTTGC TGGTGTACAC CCAAACTGAC TTCCCCATGG ACTGGGCAAT GACTCAAAAT AATCTCGGCA ATGCCCACAG AGACAGAATA AGGGGAGACA AAGCCCAAAA TATTGAAGCT GCCATTGCTG CATACCAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATGGAC TGGGCAATGA CTCAAAATAA TCTCGGCGCT GCCTACAGTG ACAGAATAAG GGGAGACAAA GCCGAAAATA TTGAAGCTGC CATTGCTGCA TACCAACAAG CTTTGCTGGT GTACACCCAA ACTGACTTCC CCATGGACTG GGCAAATACT CAAAATAATC TCGGCATTGC CTACAGAAAC AGAATAAGGG GAGACAAAGC CGAAAATATT GAAGCCGCCA TTGCTGCATA CCAACAAGCT TTGCTGGTGT ACACCCAAAC TGACTTCCCC ATCAACTGGG CAATGACTCA AAATAATCTC GGCAATGCCT ACAGTAACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCTGCCATT GCTGCATACC AACAAGCTTT GCTGGTGCGC ACCCAAACTG ACTTCCCCAT CAACTGGGCA ATGACTCAAA ATAATCTCGG CAATGCCTAC AGAGACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CCGCCATTGC TGCATACAAA CGAGCTTTGC AGGTAAGCAC CCAAACTGAC TTCCCCATCG ACTGGGCCGG AACTCAAAAT AATCTCGGCA ATGCCTATAG TGACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCT GCCATTGCTG CATTCCAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATGGAC TGGGCAACAA CTCAAAATAA TCTCGGCAAT GCCTACAGTG ACAGAATAAG GGGAGACAAA GCCGAAAATA TTGAAGCTGC CATTGCTGCA TACCAACAAG CTTTGCTGGT GCGCACCCAA ACAGACTTCC CCATGGACTG GGCAGGAACT CAATATAATC TCGGCATTGC CTACAGTGAC AGAATAAGGG GAGACAAAGC CGAAAATATT GAAGCTGCCA TTGCTGCATA CCAACAAGCT TTGCTGGTGC GCACCCAAAC AGACTTCCCC ATGGACTGGG CAACAACTCA AAATAATCTC GGCAATGCCT ACAGTGACAG AATAAGGGGA GACAAAGCCG AAAATATTGA AGCTGCCATT GCTGCATACC AACAAGCTTT GCTGGTGTAC ACCCAAACTG ACTTCCCCAT GGAATGGGCA ACAATTCAAA ATAATCTCGG CAATGCCTAC AGTAACAGAA TAAGGGGAGA CAAAGCCGAA AATATTGAAG CTGCCATTGC TGCATACCAA CAAGCTTTGC TGGTGCGCAC CCAAACAGAC TTCCCCATGG ACTGGGCAAC AACTCAAAAT AATCTCGGCG CTGCCTACAT TTACAGAATA AGGGGAGACA AAGCCGAAAA TATTGAAGCT GCCATTGCTG CATACGAACA AGCTTTGCTG GTGTACACCC AAACTGACTT CCCCATGGAA TGGGCAACAA TTCAAAATAA TCTCGGCAAT GCCTACAGAA ACAGAATAAG GGGAGACAAA GCGGAAAATA TTGAAGCCGC CATTGCTGCA TACGAACAAG CTTTGCTGGT GTACACCCAA ACTGACTTCC CCATGGAATG GGCAACAATT CAAAATAATC TCGGCAATGC CTACAGAAAA ATTAATCAAG ACCTAGCACA AGCAGCTAAA GATATTAAAA ATCTACTCAA TCAACTTTCA GAAGATTATC CTAACGATAG TTATAGAGTT TTAAGTGCTA AAGCTATGGA TGAAGTTGAT AAAAATCCTC AGTTAAAATA TCGAATTATC CGAGGATTAA AAGCAGGAGG TTTAGCAGCT TTAGAAAAAA TGATTGATCA TCCTGTTGCT CAGTTCCTTA TTGAAGGTGT AAAAGAAGTA TTAAATCCTT GA
|
Protein sequence | MNSTKAIDSE IAVERVVGFA QQFDGTHLDL ACHAAFPQTL TPDLLYQIWL RFVPQAPWTA VARILLSRLC REVGYELYEM DVNVRNLLLQ ELKEDKRFDK PRLKELADFY SNYVKQQLDG DDLRRRDLGT AEYWMSLACS QPNQLNHKLA LAIEERLKQK NWKELFRFGL FIESFPTALA EFEPPLITYA RGMVSFTSGD LEGATKQFSQ LSRWERQVKI AGVSLSIPDE IPLISVELSF LEELLNIVSD NDDNPQWKIY PFLEANLERL NEDLIGLLQE WSTNILLNTE PVELHRIGSS LTRFSNLLGN FTLGNIAINL EIAITGYQIA CQIFRREEFP KEWGIIQNHL GIAYSNRIRG DKAQNIESAI AACQQALMVL TQTDFPFEWA ATQNSLGNGY SERIRGDKAE NIEVAIAAYE QALLVYTQTD FPMDWAMTQN NLGNAHRDRI RGDKAQNIEA AIAAYQQALL VYTQTDFPMD WAMTQNNLGA AYSDRIRGDK AENIEAAIAA YQQALLVYTQ TDFPMDWANT QNNLGIAYRN RIRGDKAENI EAAIAAYQQA LLVYTQTDFP INWAMTQNNL GNAYSNRIRG DKAENIEAAI AAYQQALLVR TQTDFPINWA MTQNNLGNAY RDRIRGDKAE NIEAAIAAYK RALQVSTQTD FPIDWAGTQN NLGNAYSDRI RGDKAENIEA AIAAFQQALL VYTQTDFPMD WATTQNNLGN AYSDRIRGDK AENIEAAIAA YQQALLVRTQ TDFPMDWAGT QYNLGIAYSD RIRGDKAENI EAAIAAYQQA LLVRTQTDFP MDWATTQNNL GNAYSDRIRG DKAENIEAAI AAYQQALLVY TQTDFPMEWA TIQNNLGNAY SNRIRGDKAE NIEAAIAAYQ QALLVRTQTD FPMDWATTQN NLGAAYIYRI RGDKAENIEA AIAAYEQALL VYTQTDFPME WATIQNNLGN AYRNRIRGDK AENIEAAIAA YEQALLVYTQ TDFPMEWATI QNNLGNAYRK INQDLAQAAK DIKNLLNQLS EDYPNDSYRV LSAKAMDEVD KNPQLKYRII RGLKAGGLAA LEKMIDHPVA QFLIEGVKEV LNP
|
| |