Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0221 |
Symbol | |
ID | 4241817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 339480 |
End bp | 340676 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638105565 |
Product | hypothetical protein |
Protein accession | YP_720182 |
Protein GI | 113474121 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.676637 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.491604 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTTG TCTTAAACTC TCGAAATGTC TACGACTATT TAGTAGAAAA TGGTTTATGT GATTCCCTAC AGGAAGGGGA GCCATATCCT TCCCAGAACT CTCAGAGTAA TGTTGAGCTA ATTAGTGCTA AGAATTTTAA TTTATTGGTG ACTTTGCCAG AGGGTCGAAA GCTTCTGGTT AAACAAGAAC AATATAATTC TCAAGGAAAA ACCCTTGGTG AGTTGTTTGG AGAATGGCGA ATTCAAAAAT TCTTACGGAC ATTTCCTGAA CTTGACCATT GGCGTGTCTT TCTGCCAGAG TTGTTATTTC ACGACCCTGA AAAGTCTATT GTGGTCTCTA CTTACCTGGA TAATTATCGA GATTTAAGTG ATTTCTATTC TAAGGAAAAT ACTTTCCCTA CAAAGGTTGC AGCTCAAATC GGTAATTTTT TGGGGACTGT TCACCGCGAT ACTTGGAATA GAGAAAATTA TCGAGAGTTT TTTGCTAGTG AAGTTGTTAG TACTAAACCT GCAGTCTCCC CACTTTTGGT GGATAGTCTC GAAAGGATAG GACCAGACAT TTTTGGTATT GCTCCTGTAG ATGGGTTGAG ATTTTTTGCT CTTTATCAAC GATATGACAG TTTGGGGAAA GCGATCGCTC AACTTCGCGA GACTTTATCT CCTACTTGTC TGACTCACAA TGACCTGAAA CTAAATAATA TTCTCATGGC TCAAAATTGG GAAAATTCCG GTGAAAATAT TGTGCGGTTA ATTGATTGGG AAAGGTCTAG TTGGGGAGAC CCTGCTTTTG ATTTGGGAAC GGCTATCAGC AGCTATCTGC AACTTTGGTT GGGTAGTTTG GTTATTAGTA ATTCTCTGAG TATAGAAGAA TCTCTCAAAT TTGCCACCAC TCCCCTTGAA TTACTTCAAC CGTCTATTGC TGCTTTAGCT AAAGCTTATT TTGACACTTT CCCAGAAATT TTGGCATATC GTCCTGATTT TTTGAGACAA GTAGTACAGT TTGCAGGTTG GGGTTTGATG ACAGGAATTT TGTCAATGGT TCAATATCAA AAGACTTTTA ATAATACGGG TATTGCGATG CTTCAGGTTG CCAAAGCATT ATTATGTCGC CCTGACTCAT CAATGTCTAC AATTTTTGGT GTTGCTATTG AAGAAGTATT GAGGAGTCAG GAGCCAGGAG TCAGTAGAAT GCATTAG
|
Protein sequence | MKFVLNSRNV YDYLVENGLC DSLQEGEPYP SQNSQSNVEL ISAKNFNLLV TLPEGRKLLV KQEQYNSQGK TLGELFGEWR IQKFLRTFPE LDHWRVFLPE LLFHDPEKSI VVSTYLDNYR DLSDFYSKEN TFPTKVAAQI GNFLGTVHRD TWNRENYREF FASEVVSTKP AVSPLLVDSL ERIGPDIFGI APVDGLRFFA LYQRYDSLGK AIAQLRETLS PTCLTHNDLK LNNILMAQNW ENSGENIVRL IDWERSSWGD PAFDLGTAIS SYLQLWLGSL VISNSLSIEE SLKFATTPLE LLQPSIAALA KAYFDTFPEI LAYRPDFLRQ VVQFAGWGLM TGILSMVQYQ KTFNNTGIAM LQVAKALLCR PDSSMSTIFG VAIEEVLRSQ EPGVSRMH
|
| |