Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1403 |
Symbol | |
ID | 4243074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2133698 |
End bp | 2135026 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638106564 |
Product | TPR repeat-containing protein |
Protein accession | YP_721175 |
Protein GI | 113475114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.116952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAAA TTAAAAACTT ATTTATAACC AGCAGTTTTT TTTGCCTTTA CCTTTCAACT TCTTGTTTAG CAAAAACTAA TGAGTTTCCT GAATATACTC TAAAAAAAAT ATCTATAAAT ATTATTTCTC AACGTCAAAT AGAAAAAGAA GAGGAGTTAG AATCTCTAGA TATATTTTCT CCTAATCCTC TAGAAAATAC TAAACCAGAT CCACTATTAC CAAATCCTCC TACTTCTGGA ATTATAACTG GAGAAAAACG AGAAAAACTT TCTCAGAAAC TCGAGCAACT TAATGTTGAA GCAGCAGCAT TATTAGCAGC AGGAGACAAA ACTAAAGCTT TTGAAATTTG GAACAGAGAA TTACGTTTGC GTCGTTATCT TGGTCAAGTA GAAGAGTTAG CAGCTTTGTC AAGAGTTGCT GAAATAGCTT GGGATAATAG TCAAAAATTA CAGGTGCAAT GGATTACAAA AAGACTCCAA ATTATACAAC CAGAAATTCA ACGGGAAGAA CCTCCTAATT TTGAATTATT ACAAGCATTA GGTTCAGCAT TTAATACAGT TCGAGCTAAA GATTCAGCAG TTGAAGTTTA TCAACAGGTT ATTAGTGAGG CAAGGAAACA AGAGGATTTT GTTACGCTAA AACAAGCTTT AATATCTGTA GGAGAAATAT ATTTGAATTG GTTGGATTAT GATAATGCAG CGATCGCCTA TGAAGAATTA TTAGATGTTC AGCAGCAGAG TAATTCAGAA TTCTCTAAAA TAGAAACAAC AAATCCTTCT ATAAATGGTA ATTTTGAAAA AGCAAATACT CTGAAACAAT TAGCTTTCAT TTATAGTCAA GGGGGGAAAC TTTTACCAGC AATTACAGCT AAAGAAAGAT TAGTAGATTT TTACGAAGGT CAACAAAATA TTGCAGAAGT TTCAGAGATA AAAATTTCCA TTGCTGAAAA TTATGAGGAA TTGGGAAGAC TTAATCTTGC CAGTCAATAC TATCAAGAAG CTTATAGTAT TGCTCAATCA ATTCAACAAT TTTTAACTGC CAGTGATGCT TTAGAAAATT TAGCAGCCCT CTATCTTTCT CAAGATGAAA AAGAGGCAGC AATTGAAATA TATAAGGTAC AATTATTAAT GGTTCAACAG TTTGTTAATG TTTATGGTAT GATGGAAATT TATGATCGGA TGGGGCAAGT TTATGCGCAG TTAAAGGCTT TTAGTTTCGC AATGTCATCT TTTCAAAAAG GTTTAGAATT ATCGAGACAA TTGGGTGGAT ATCGAGAAAG TTATTTTCTA CAAAAGCTTG AAAAATTGAA TAGAAATAGA AATAAGTGA
|
Protein sequence | MIKIKNLFIT SSFFCLYLST SCLAKTNEFP EYTLKKISIN IISQRQIEKE EELESLDIFS PNPLENTKPD PLLPNPPTSG IITGEKREKL SQKLEQLNVE AAALLAAGDK TKAFEIWNRE LRLRRYLGQV EELAALSRVA EIAWDNSQKL QVQWITKRLQ IIQPEIQREE PPNFELLQAL GSAFNTVRAK DSAVEVYQQV ISEARKQEDF VTLKQALISV GEIYLNWLDY DNAAIAYEEL LDVQQQSNSE FSKIETTNPS INGNFEKANT LKQLAFIYSQ GGKLLPAITA KERLVDFYEG QQNIAEVSEI KISIAENYEE LGRLNLASQY YQEAYSIAQS IQQFLTASDA LENLAALYLS QDEKEAAIEI YKVQLLMVQQ FVNVYGMMEI YDRMGQVYAQ LKAFSFAMSS FQKGLELSRQ LGGYRESYFL QKLEKLNRNR NK
|
| |