Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2093 |
Symbol | |
ID | 4243927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3267818 |
End bp | 3268873 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638107202 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_721805 |
Protein GI | 113475744 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.391868 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCTA GCCAGCTTTT AAGTGCAAAT GAACTGTTAT TTAGACATTC CCAAGGAGAA AGAAATTTTC AGGGTGCAAA CCTAATTGCC GTTAATTTAA GTGCAGTCAA CCTCAATTGT AGCAATCTAA GTAATGCTAA TTTTAAAGAT TCATACTTAG GTAAAACAAA ACTAATTGGA TCTAATTTAA ATGGTGCAGA TTTTAGTTAT GCTAATCTGT CCGAAGCCAA ATTTATAGAA GCTAATTTGA GTGCTGCTAA CTTTACTAAA ACCACACTTA TTGCAACCGA TATAAGTGGA GGAATTTTGA GTGGGGCAAT TTTCTCAGAA GCTAATTTAA CAAGAGCTAT TCTCATTGGT ACTAGCATGG TTGGAACTTC TTTACTAAAT TGCTCAATAT TAACCAAAGC TAACCTCACA AGAGCTACTC TTTCTCGTGC TATTCTTAGT GGTGCTGACT TAACACAGGC TAACTTGAAT CGGGCAATTA TGACTGAAGT GGACCTAAGT GGCACTTTGC TAAATCAGGC AAGTCTAATT CGAGCCTATC TACAACGGGG TAATCTCAAT GGTGCAAAAC TAATCAAGGC AGATTTAACA GAAGCCACTT TAGTACAGGC CAACCTTTGT GCTTCTGATT TAACTGGAGC AGAGTTGCAA GGTGCAAATC TCAGTTATGC TAATTTAAGT GGGTCAAATT TGATGGGAGC GAATCTACAG GGAGCAAATC TCAGCAATAC TAATCTTAAT GGTGTTATTC TCCAACAGGC AGACCTGCAA GCTGCTGACT TGAGCAAAGC TAGCTTACGA GGTGCTAATT TAAAAGCTGT TAATCTCTCA GGGGCAAATT TATTGAAAGC TGACTTGCGC GATACTAACT TACAAAAGGC TAATCTTTAT GGCGCTGGTT TATTGTTAGT ATCTCTCAAA GGCGCCAACT TAAAAGAAGC CTGTTTATGT AATGCTAACT TAATTGGGTC TAGTTTAAAT CTTTCTAGTC TTCAGGATGT TTGCCTAGAA AAAACAATTA TGCCTAATGG TTCAATTCAT GAATAG
|
Protein sequence | MNASQLLSAN ELLFRHSQGE RNFQGANLIA VNLSAVNLNC SNLSNANFKD SYLGKTKLIG SNLNGADFSY ANLSEAKFIE ANLSAANFTK TTLIATDISG GILSGAIFSE ANLTRAILIG TSMVGTSLLN CSILTKANLT RATLSRAILS GADLTQANLN RAIMTEVDLS GTLLNQASLI RAYLQRGNLN GAKLIKADLT EATLVQANLC ASDLTGAELQ GANLSYANLS GSNLMGANLQ GANLSNTNLN GVILQQADLQ AADLSKASLR GANLKAVNLS GANLLKADLR DTNLQKANLY GAGLLLVSLK GANLKEACLC NANLIGSSLN LSSLQDVCLE KTIMPNGSIH E
|
| |