Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2106 |
Symbol | |
ID | 4243942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3281950 |
End bp | 3285126 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638107214 |
Product | hypothetical protein |
Protein accession | YP_721815 |
Protein GI | 113475754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.866139 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.660296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGTAGTT CTCAAAAGGA GAATTCTGAT GATACAGATA TTCTAATTGG CAGTAATATT GATTTAATTT CTGTAGTACA AAATGCTCTG CCAATAGTTC AACATTTGCT TGACAATTTC TCTAGTAAAG TAGATTTTGA AGAGCAAATG AATCTGGCTT TTGGAGAGAG TTATGATGTT AGTAAAGCTG ATGCTTTAAT TGGTACTTGG CAAAATGAAC ATGCCGGCTT TTTGCCACAG ATAAAAATTG TTTCTGAGAG TAAAATTAAT GGAGCGAATG GTGCTTTTGC TGGAGAGACA CAGACCATTT ATTTAGCACA AGAGTTTGTT GAAGATAATG CTGGGAATGT AGGGGCGATC GCTCCTATAA TTCTTGAAGA GTACGCTCAT TATTTTGATG GGGAAGTTAA TAGTTTTGAT ACCCCTGGGG ATGAAGGGGA AATTTTTGTC AGTTTTGTTT TGGGAGAGGA GTTGAGCGAG TCTGAGTTTT TGCGGATGAA GGTCGAGGAT GATTGGGCCA CAGTCTTCTT AAATGGTAAT ACTATTACCA TCGAACAAGC AAACCTTTCC TGGCTCGGAG GTAGCGGTGA TTGGTATAAC CCTAGTAAAT GGAGCGGTGG TAAAGTCCCC AAGTCTAGCG ATAATGTAAC TCTGGAAGTT TTTGGTCAAA ACATCAAAAT TAACTTTTCC AAGGGAAACC CTAATATTAA AGATTTGTTT TTAGGGGCTA AGGATGGTGG AACTCTAACT TTAAATGGAC TAACCACTGT AGGGAACGAC ACAGATATTT TAGCCGAAGG AAAAAACAGT GTAGTTAAGT TGCCCGATCT AAAGACTTTC TCAGGCAAAG ACCTATATCA GCCTTCATCT ATTACGGTGA AAGATGGTGG CACACTTAAA GCAAATAAAC TGCTCACTAT GAAAGAGGTA GATTTATTTG CGAATAACGC CAGTCTTACT TTACCTGGAG TAAAAAATTT TAGCGGTAAA AGCGATACTG TCATTGAAGC AACGAATGAA GGCAAACTGA CTTTAGGTGC GAGAACTATT AGTGGTGATG TGGATATTAC CGCCACCGGA GAAGGAAGTG TAGTGAATTT GCCCCTGTTA AACAATTTTA GCGGCACAGA CGTTTATCAA CCCTCATTTA TTAAGGCTGA AAATAACGGC TCGGTGACAG CTAAGAAGCT GAAAACCCTC AAAGAGGTAG ATTTGTATGC CGATAACAGC ACATTGAGTT TGTCAGGAGT CAACAACTTT AGCGGTAAAA GCGATACTGT CATTGAAGCA ACGAATGAAG GCAAACTGAC TTTAGGTGCG AGAACTATTA GCGGTGATGT GGATATTACC GCCACCGGAG AAGGAAGTGT AGTGAATTTG CCCCTGTTAA ACAATTTTAG CGGCACAGAC GTTTATCAAC CCTCATTTAT TAAGGCTGAA AATAACGGCT CGGTGACAGC TAAGAAGCTG AAAACCCTCA AAGAGGTAGA TTTGTATGCC GATAACAGCA CATTGAGTTT TTCAGCAGTC AACAACTTTA GCGGTAAAAG TGATACTGTC ATTCAAGCCA GGAATGAAGG CAAACTGACT TTAGGTGCCA AAACTATTAG TGGTGATGTG GATATTACCG CCACCGGAGA AGGAAGTGTA GTGAATTTGC CCCTGTTAAA CAATTTTAGC GGCACGGACG TTTATAAACC CTCATTTATT AAGGCTGAAA ATAACGGCTC GGTGACAGCT AAGAAGCTGA AAACCCTCAA AGAGGTAGAT TTGTATGCCG ATAACAGCAC ATTGAGTTTG TCAGGAGTCA ACAACTTTAG CGGTAAAAGC GATACTGTCA TTCAAGCCAG GAATGAAGGC AAACTGACTT TAGGTGCCAA AACTATTAGT GGTGATGTGG ATATTACCGC CACCGGAGAA GGAAGTGTAG TGAATTTGCC CCTGTTAAAC AATTTTAGCG GCACGGACGT TTATAAACCC TCATTTATTA AGGCTGAAAA TAACGGCTCG GTGACAGCTA AGAAGCTGAA AACCCTCAAG GTAGTAGATT TGTCTGCCGA TAACGGTACC TTGAATTTAT CGGCAAATAG CTTTAGTGGT AAAAGTGATA CTGTCATTCA AGCCAGGAAT GAAGGCAAAC TAACTTTGGG TGCCAAAACT ATTAGTGGTG ATGTGGATAT TACCGCCACC GGAGAAGGGA GTGTAGTGAA TTTGCCCCTG TTAAACAATT TTTACGGCAC AGACGTTTAT AAACCCTCAT TTATTAAGGC TGAAAATAAC GGCTCGGTGA CAGCTAAGAA GCTGAAAACC CTCAATGTAG TAGATTTGTA TGCCGATAAC AGCACATTGA GTTTTTCAGC AGTCAACAAC TTTAGCGGTA AAAGTGATAC TGTCATTCAA GCCAGGAATG AAGGCAAACT GACTTTAGGT GCCAAAACTA TTAGTGGTGA TGTGGATATT ACCGCCACCG GAGAAGGGAG TGTAGTGAAT TTGCCCCTGT TAAACAATTT TAGCGGCACG GACGTTTATC AACCCTCATT TATTAAGGCT GAAAATAACG GCTCGGTGAC AGCTAAGAAG CTGAAAACCC TCAAGGTAGT AGATTTGTAT GCCGATAACG GTACATTGAA TTTATCAGCA AACAGCTTTA GTGGTAAAAG CGATACTGTC ATTAAAGCCA GGAATGAAGG TAAACTGACT TTAGGTGCGA GAACTATTAG TGGTGATGTG GATATTACCG CCACCGGAGA AGGGAGTGTA GTGAATTTGC CGCGCTTAAC CAATTTTTAC GGCACAGACG TTTATCAACC CTCATTTATT AAGACTGAAA ATAACGGCTC GGTGACAGCT AAGAAGCTGA AAACCCTCAA GGAAGTAGAT TTGTCTGCCG ATAACAGCAC ATTGAGTTTT TCAGCAGTCA ACAGCTTTAG CGGTAAAAGC GATACTGTCA TTAAAGCCAG GAATGAAGGT AAACTGACTT TAGGTGCGAG AACTATTAGT GGTGATGTGG ATATTACCGC CACCGGAGAA GGAAGTGTAG TGAATTTGCC GCGCTTAACC AATTTTTACG GCACAGACGT TTATCAACCC TCATTCATTC TGGCTGAAGA TGGCGGTAAA GTCAAGGTTA AGAAATTGAC AGGAATTACC AACGTAGATC CATTCGTAAC TGGTTAG
|
Protein sequence | MSSSQKENSD DTDILIGSNI DLISVVQNAL PIVQHLLDNF SSKVDFEEQM NLAFGESYDV SKADALIGTW QNEHAGFLPQ IKIVSESKIN GANGAFAGET QTIYLAQEFV EDNAGNVGAI APIILEEYAH YFDGEVNSFD TPGDEGEIFV SFVLGEELSE SEFLRMKVED DWATVFLNGN TITIEQANLS WLGGSGDWYN PSKWSGGKVP KSSDNVTLEV FGQNIKINFS KGNPNIKDLF LGAKDGGTLT LNGLTTVGND TDILAEGKNS VVKLPDLKTF SGKDLYQPSS ITVKDGGTLK ANKLLTMKEV DLFANNASLT LPGVKNFSGK SDTVIEATNE GKLTLGARTI SGDVDITATG EGSVVNLPLL NNFSGTDVYQ PSFIKAENNG SVTAKKLKTL KEVDLYADNS TLSLSGVNNF SGKSDTVIEA TNEGKLTLGA RTISGDVDIT ATGEGSVVNL PLLNNFSGTD VYQPSFIKAE NNGSVTAKKL KTLKEVDLYA DNSTLSFSAV NNFSGKSDTV IQARNEGKLT LGAKTISGDV DITATGEGSV VNLPLLNNFS GTDVYKPSFI KAENNGSVTA KKLKTLKEVD LYADNSTLSL SGVNNFSGKS DTVIQARNEG KLTLGAKTIS GDVDITATGE GSVVNLPLLN NFSGTDVYKP SFIKAENNGS VTAKKLKTLK VVDLSADNGT LNLSANSFSG KSDTVIQARN EGKLTLGAKT ISGDVDITAT GEGSVVNLPL LNNFYGTDVY KPSFIKAENN GSVTAKKLKT LNVVDLYADN STLSFSAVNN FSGKSDTVIQ ARNEGKLTLG AKTISGDVDI TATGEGSVVN LPLLNNFSGT DVYQPSFIKA ENNGSVTAKK LKTLKVVDLY ADNGTLNLSA NSFSGKSDTV IKARNEGKLT LGARTISGDV DITATGEGSV VNLPRLTNFY GTDVYQPSFI KTENNGSVTA KKLKTLKEVD LSADNSTLSF SAVNSFSGKS DTVIKARNEG KLTLGARTIS GDVDITATGE GSVVNLPRLT NFYGTDVYQP SFILAEDGGK VKVKKLTGIT NVDPFVTG
|
| |