Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37891 |
Symbol | |
ID | 7202830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 334308 |
End bp | 335843 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182049 |
Protein GI | 219123474 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.934504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGCC TGGGGGGCGG ATCGTCTGGC CAGACACCCC TGCTGGATGG AGACGACTCA GGAGGACCCC TTCCCTTGAC CCAAACTTTG TCGTCGCTGC GTTTAGAGCC CACACAAAAT CTTTCAATAG TTCCGGAGAA GACAAGGTCA GAAAGGAAAA CGCCAATCAA ATTCACATCT TTTCAAATGC CTCGCATATC TTCCAGTTCC AGTGACCATG CTCCTATGCT ACCAGACCGG TCACCGGAAG CCATGCAGAG TGTGTCCCTA CAGAAGCTAT CACAACGATC TGTGAATACT TTTCGGGCTG CTGAGCATGG AAAAGTGGAT ATTTTGCCAA TTTCGATAGC ATCTTCTACT TTCAATAGAA AACGAGCCGT TTCCTCGAAT AACATGTCAT CAGGCTGGTA CGGGCAGAGT TCTATGCCAC AGTCCGAAAG GCCAATGCCT CGTACGCAAT CACTAAGATC ACGAAGATCG AGTAGCAAAC CTACCTGCTA TGATGATGAA CAAAACGACG GTCCGGATAA ATCGGTAACT CCTTACACTC GTCACGTCAG ACATGGATGG GAGGACGCCG ACTTCGGAAG TCCATACAAA CTATCACACA AATCGTTGCG ACGAAGCTTC ACTCTCAACA AGGCTTTTCT AACGCTGCAA CTCGGTCATT TCTTACAACT CGCGGTGGTT ATAGTTGTGT CCTTTTTAGT TTTCGACTCT TATCATAGAG CCATTACGAC GACCGACCAG CTAAGTAAAT TCAAGAACGA CGAATCCATG TTGCTATTAC ATCTTCATCG TGTGGAGCAG CAAGCCTTGA ATCTTCACGA AGAATTTGAC AGACTTAGCA AAAAAGATTT CGTCGAATCT CACAGCATTC AGAACGAGGA TGGTAGGATT GAGCGGAAAA ACGCGGTCGT GGTTGACTCA GAACTGATTC GAAAGCAGAC ACAACAGCTC AGGCAAATGG AAGAGGAGCT GAGCCACGAA GTACGAGCGC TTCAAGAAAG TATTCAAATC GCCGCTCGCA GCTGCATTGT TAGAACTTTT GGAGAGGGTC CAGTCCAGGT AATCCTGGAT CTTAACTTCG GTGAAAGGAA TATACAGGGT GGCACAAAAC TCACCATTCT GCTGTGGTAT GATACCCCTC ATGCAGCCTG GACGCTACTT GAACAAATTC GAAAGGGGAT TTGGGACGGT GCTAGCTTCA GACTTGACAA GGGGAGATCA ATTGCAGCGG CTCCGGAAAA TCCTGACAGG GAATCAAAGC TCGAGTTTAT CGAACATTCT CAGAAAAATC ACGACCCATG GACGATTGGG TTGAGCGATT TTGGCGATGA CGGAATCGGC CTATTTGTCA ATCTGAAGGA CAATTCGGCT TTCCACAAAC AGGATGTCTG TGTTGGGAAA ATCATCGACG GTTTCGATGC ACTGCAACAG CTAGTTGATC TTTCAAGGAG TCGAACAAAC AAAATATCGA TAGCCGCGGC GACTGCCTCT CATCTCACTA GAGAACATAC TTCTGGGCTC GTATAG
|
Protein sequence | MPRLGGGSSG QTPLLDGDDS GGPLPLTQTL SSLRLEPTQN LSIVPEKTRS ERKTPIKFTS FQMPRISSSS SDHAPMLPDR SPEAMQSVSL QKLSQRSVNT FRAAEHGKVD ILPISIASST FNRKRAVSSN NMSSGWYGQS SMPQSERPMP RTQSLRSRRS SSKPTCYDDE QNDGPDKSVT PYTRHVRHGW EDADFGSPYK LSHKSLRRSF TLNKAFLTLQ LGHFLQLAVV IVVSFLVFDS YHRAITTTDQ LSKFKNDESM LLLHLHRVEQ QALNLHEEFD RLSKKDFVES HSIQNEDGRI ERKNAVVVDS ELIRKQTQQL RQMEEELSHE VRALQESIQI AARSCIVRTF GEGPVQVILD LNFGERNIQG GTKLTILLWY DTPHAAWTLL EQIRKGIWDG ASFRLDKGRS IAAAPENPDR ESKLEFIEHS QKNHDPWTIG LSDFGDDGIG LFVNLKDNSA FHKQDVCVGK IIDGFDALQQ LVDLSRSRTN KISIAAATAS HLTREHTSGL V
|
| |