Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4219 |
Symbol | |
ID | 4245871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6506622 |
End bp | 6509042 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638109115 |
Product | heterocyst differentiation protein |
Protein accession | YP_723693 |
Protein GI | 113477632 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.638928 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTCAAG AATTTCATGT TTCAATTACT CCCTTGGGAG AAGACGAATA TTTTGTCCGG ACAGAAAAAG TCCCTGTTGG AGGCCCAGTG GCAGAAGAAA TAGTCCAATG GCCTGTAGAA AAATGGCTAA CTCAAGCCCG TCAACTATTT CGTGATCCAT TAACTGATCT TCTAGAAGAA AAGTCCAACT CTCAGGTAGT GATCCCTAAT CAGATTATGT CCAGGTCTGG GGAAGTAGTA AGTCCAAGTT TAGTAGCATT AGGCCAAGAA ATGTATCAAG AACTATTCAA AGATAGCCTG AGAGATAGTT GGAATTGTGC CCAAGGAATC GCTCACAATC GAGGTGAGGT TTTACAGTTA CGTTTAGGTA CTAAAGACAG ATATCTATCT AGTTTGCCTT GGGAGGTACT CCATGTAGGC GATCGCCCCC TGGCGACTGG TACTGACATT GTCTTCTCCC GATATCAACC TAATACCTGC TTACGGAAAC TTAACCGGAT ATTAACACCT GAAGAACCAT TAAAAATTTT AATGGCGATC GCGGTTCCCA GTGATAAGGA TAGTCTAGAA CTGGAAAAAG AATATAAGGT ATTACAAGAA GAACTGCAAA AAAATAGTGG GGCAACTCAG ATTTACCTAG ACATTCTCCG GCAACCAGGG CGAGAACAAC TTACCCAAGC ATTAGAACAA GGTAAATATC AAGTTTTCCA CTATGCAGGT CATAGCAACT GGGGTATTTC GGGTGGTGAA ATATCTTTAG TGAGTAATAT AACAGGTTTA GCAGAAAGTC TGAGTGGTAA GGATCTGTCA GGTTTGCTAG TTAACAATGG CATTCAGATG GCGATCTTCA ACTCATGTCG AGGTGCGTAT AACCATCTTA TTGAACCTAC TGATGAGGGA ATAGAGCTCA ACTTAGCAGA AGCGATGGTC AAACGGGGAA TTCCTGGGGT GCTAGCAATG GCAGAACGTA TTCCAGATGA AGTTTCTCTT ATCCTAACAC GTTTGTTTTA TCGTAATCTC AATCAGGGTT ATCCGATAGA CTTAAGTTTA AATCGGGCAA GACAAGGTTT GATTTCTGCC TATGGTTCAC ATCAACTTTA CTGGGCATTG CCTGTTGTTT ATCTACATCA AGATTTTGGT GGATATCTAA TTGAAAAACA AAGCGATTCT CAAAATGAAA TACAAGAGGA TATCCTGACG TCTATTGATG AAAAAATTGA AACTCCGAAT TTATCAGAAA ATTTAAATCC CGTACAACCA GATATTGATT TAGGGAACCT GGATGAGGAT CCTACTATGG CTAAAAATGA TAATTTAAAT TTCCAGTTTC AAGACAGTAA ATATCAAGTT ATGACACTTG AAGATATATT GCCAGAAAAA GAAGATATAT TGCCAGAAAA AGAAATAGGA AGGCAAATAA ATAATCAAGA CTATGAAAAA CAAGAAGATT TAATAACACC TACACAAAGT AGTGAAGAAT CAACTCATTT AATATTCACG ACATCAAAGA CTAAAAATAA GTCTCATAAG TTTCTTTGGC AAACAAGATT TACATTATTA ACATTAGGAT ATGCAATTTC AGGAATAATA TTATTTTTGG GAATTAACTA CATTGTTTTC AATAATCGTC AATCAATAAT GAGTGCTGAA TTACCTAAAG CACCTGCTCC AGCTATCAAA TATAAATACA ACAGAGTTAG ACTTGTTAAT CAGACAATTA ATACTACCCC AGAAGTACAG AACATGGCAA TTCAACAGTT TAATCAAAAT GAGCTTCGAG CTGGTAAAAT GGTAGTAGAA ACATTATTAG AAGAAGGAGA ATTATCTCAA GCAGCAGAAG TTATAGCTGT TGTGCCAGAA AAGCTTAGTG AAGATCGAGA AATTAATTTT CTTAAAGGTC GATTAGCTTG GGAATTATTC CAAGATAGAA ATCAACATAA TCTGATAGAT GAAGCTATAA ATTATTGGGA AAAAGCTGCT GCCCAGAGCC AAGAAAATCT TCAATATCAA AATGCTTTAG GCTTTGCTTA TTACACTAAA GGAGATATAG AAAAAGCCTA TGCAGCTTGG TTAAAAGTGT TGCATTTATC TGGAGAAATT GCCCCAGAAA TAAAAAATGT TAGCCCTGTT TCTCATAACT ACATGGGAAA TTTATCTGTC AAGAATAGAG AAGTTCTAAA TGCTTATGCA GGTCTAGGAT TAGTTACCCT TAAATATGCT CAAAATTTAC AAAAAATTCC TGATAAAGCT TTTAAATATG CTAGTAAAGT TATGAGAGAA GCTGGCCAAG AGTTTCATGT CCTACAACTA CAAAAAAATT GGTTGTGGTC TCCTAAAGCA CGTCAAGATT GGGATTTACT ATTGAAACTT CGAGAAAGGC AACAATTTGA ACATAATAAA AAACCAGAAG AGCTGGCTTA G
|
Protein sequence | MTQEFHVSIT PLGEDEYFVR TEKVPVGGPV AEEIVQWPVE KWLTQARQLF RDPLTDLLEE KSNSQVVIPN QIMSRSGEVV SPSLVALGQE MYQELFKDSL RDSWNCAQGI AHNRGEVLQL RLGTKDRYLS SLPWEVLHVG DRPLATGTDI VFSRYQPNTC LRKLNRILTP EEPLKILMAI AVPSDKDSLE LEKEYKVLQE ELQKNSGATQ IYLDILRQPG REQLTQALEQ GKYQVFHYAG HSNWGISGGE ISLVSNITGL AESLSGKDLS GLLVNNGIQM AIFNSCRGAY NHLIEPTDEG IELNLAEAMV KRGIPGVLAM AERIPDEVSL ILTRLFYRNL NQGYPIDLSL NRARQGLISA YGSHQLYWAL PVVYLHQDFG GYLIEKQSDS QNEIQEDILT SIDEKIETPN LSENLNPVQP DIDLGNLDED PTMAKNDNLN FQFQDSKYQV MTLEDILPEK EDILPEKEIG RQINNQDYEK QEDLITPTQS SEESTHLIFT TSKTKNKSHK FLWQTRFTLL TLGYAISGII LFLGINYIVF NNRQSIMSAE LPKAPAPAIK YKYNRVRLVN QTINTTPEVQ NMAIQQFNQN ELRAGKMVVE TLLEEGELSQ AAEVIAVVPE KLSEDREINF LKGRLAWELF QDRNQHNLID EAINYWEKAA AQSQENLQYQ NALGFAYYTK GDIEKAYAAW LKVLHLSGEI APEIKNVSPV SHNYMGNLSV KNREVLNAYA GLGLVTLKYA QNLQKIPDKA FKYASKVMRE AGQEFHVLQL QKNWLWSPKA RQDWDLLLKL RERQQFEHNK KPEELA
|
| |