Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0790 |
Symbol | |
ID | 4243201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1256445 |
End bp | 1257632 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 638106071 |
Product | NHL repeat-containing protein |
Protein accession | YP_720683 |
Protein GI | 113474622 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.395295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTAT CTCTTAACAA TTACCCTAAT GAAAACCAAG AAATAAAACC ATATATCCTT GAACCTAAGG GAGCAGAAAT TATTCTTGGC GCAGCATATT CTAATTCCTT AGTTGTACCA CTAACACCCA GTAAAACCAC GATGTTCGGA CCCCGTGGGG CTTGCCTCAT CTCCGAAAAC GGCCCCCTGT GGGTAGCTGA TACAGGACAT CATCGGTTAT TAGGATGGCG ACAACGCCCC GAAACTGACG ATCAACCTGC TGACTGGGTG ATAGGGCAAC CTGATTTTTC TCAAGAGGGG CAAAACGCTA ATGGTCAAAC AACCGCAGCA ACTGTTAGCG TTCCTACTAG TATCTGTGCT TATGGGGAAG GGTTGGCAGT GGCGGATGCT TGGAATCATA GGGTCTTGAT TTGGAAACAG TTACCGGAAG ATAATAATGT TCCTGCTGAT ATTGTTTTGG GGCAAGCAGA TTTTTCTGAA AATGAATCTA ACCGAAGCAA GTTAGAAACT GCTGCTGACA GAATGCACTG GCCTTATGGT GTCATCTGTC ATCACGATCA GCTCTGGGTA GCGGACACGG GTAACCGTAG GGTATTGATG TGGCAGCAAT TACCAGAGGT TAACGGGCAA CCAGCAGATT TGGTTTTGGG ACAAACAGAT ATGAGTTGTC GCGATGAAAA TGGTGGCGGA GAGGCAACGG CAGCAAGTAT GCGTTGGCCT CATGATATTA CTTTTTGGGA AGAAAGTTTG GTTGTGACGG ATGCTGGCAA TAACCGGGTG ATGGTTTGGG ATGGTATTCC TACTGAGAAT AATCAACCTT GCTCTGTAGT GCTTGGTCAA TCTCAATTTG ACACGGTACA GTTAAATCAG GGGGTTTATT TTCCTTCTGC TGTTAGTTTG AGTATGCCTT ATGGTGTGGT GGCGACTGGG GAATGGTTGA TCGTTGCTGA TACTGCTAAT TCCCGGTTGT TGGGTTGGCG GGATGTTGTG GGGATGGGAA CTCCTGCACT GGCTTTGACT GGTCAACCTC ATTTTGAGAG TAAGAGTGAA AATGCTTTGA GTTTACATCC AACTCGGCAA AGTTTGTGCT GGCCTTATGG GATTTCTGTT TGTGGTAATA CTGCTGTTAT TGCTGATTCT GGTAATAATA GGGTTTTGTT GTGGTCTTTG ACATCCTCCC CAACTTAA
|
Protein sequence | MKVSLNNYPN ENQEIKPYIL EPKGAEIILG AAYSNSLVVP LTPSKTTMFG PRGACLISEN GPLWVADTGH HRLLGWRQRP ETDDQPADWV IGQPDFSQEG QNANGQTTAA TVSVPTSICA YGEGLAVADA WNHRVLIWKQ LPEDNNVPAD IVLGQADFSE NESNRSKLET AADRMHWPYG VICHHDQLWV ADTGNRRVLM WQQLPEVNGQ PADLVLGQTD MSCRDENGGG EATAASMRWP HDITFWEESL VVTDAGNNRV MVWDGIPTEN NQPCSVVLGQ SQFDTVQLNQ GVYFPSAVSL SMPYGVVATG EWLIVADTAN SRLLGWRDVV GMGTPALALT GQPHFESKSE NALSLHPTRQ SLCWPYGISV CGNTAVIADS GNNRVLLWSL TSSPT
|
| |