Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22658 |
Symbol | |
ID | 7195186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 24032 |
End bp | 25249 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183411 |
Protein GI | 219126326 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCGG CAGCGGCAAC TACTAGCAGC GCGGGGACGC TGCGGTTCGA AGATGGCGCC GTCCAGTTCC GGCAGCGCAT CGTGGTTAGC ATTCTCTCGC ACCGATCACT TCTGATACGC AATATTCGAG CGGAAGACAT TGATGCACCG GGTTTGCGGC AGTATGAAGC TTCATTTCTG CGATTAATTG ACAGCATGAC GAATGGTAGT CGTATTGAAA TCAACAGTAC GGGTACACAA CTTAGGTTAA GTCCAGGAGT GTTGACTGGA GGATCCATTG AACATACTTG TCCAGTTCCG ACACAGCCAA CAGATTCCCG CCCAGACGAA GAATCAATGG ATTCATCTCG CTCGATTGGT TGGTTTTTGG AAGGCATACT TCCGCTCGCC CCTTTTGGCA AGGAACCCTT GTCGGTTTCC TTTTTTGGCA TTACAGACGG CACTTGCGAC GTCGATCCTA CCTCAGACTA CCTTAAAGCC TCCGCTCTGC CCTTGTTTCA AAAGTTTGGC GTGGGTGTGA CGGATGCCGA AGATTTCCTG TCGCCACAAG CACCGAGTAT TCGCGTGGTG CGACGCGGTG CGGCTCCGGC CGGAGGCGGT CGCGTGGAGC TCTATTCTCC CGTCGTGCAA GTACTGCAAC CGATCGAATT TACGGATCCC GGAAAGTTCA AACGGGTCCG CGGGACAGCC ATTACCTGCA AAATTGTCTC GTCGAGTATG GCCGCACGGG TTGCCTTTGC CTCCAAAGGC CTCTTGCACC GATTGCTGCC GGATGTTTGG ATCCATACAG ACGCGCACAC CATCAAACAC CACAAATGCG GTCCGAGTCC CGGCTTGAGT CTCGTATTGA CGGCAGAATC TACCAACGGT GTTGTGTTGA CGGCCGAGTG CTGTTTGGAC TACCGTAAAG ATGCTAGTCG GGAATTACCG GAAGACTTGG GCACACGAGG ATCGGCTTTG CTGTTGAACG AAATTCGTAA AGGCGGTTGT GTCGATACCG GAATGCAAAG TCTGGCGCTA TTGTGGATGT GTTTATCACC CGAAGATGTG AGCCGAATAC GGGTCGGTAC CCTATCTCAG TACGCTGTAG AGTCGTTGCG GCTTTTCAAA CAAGCCTTCG GTGTGGAGTT CAAAGTCAAA CCGGATCATG CTACCAAAAC GGTCCTCCTA AGTTGCTTGG GTTCTGGTTA CCGAAACATG TCGCGGGCGG CAACGTAG
|
Protein sequence | MSAAAATTSS AGTLRFEDGA VQFRQRIVVS ILSHRSLLIR NIRAEDIDAP GLRQYEASFL RLIDSMTNGS RIEINSTGTQ LRLSPGVLTG GSIEHTCPVP TQPTDSRPDE ESMDSSRSIG WFLEGILPLA PFGKEPLSVS FFGITDGTCD VDPTSDYLKA SALPLFQKFG VGVTDAEDFL SPQAPSIRVV RRGAAPAGGG RVELYSPVVQ VLQPIEFTDP GKFKRVRGTA ITCKIVSSSM AARVAFASKG LLHRLLPDVW IHTDAHTIKH HKCGPSPGLS LVLTAESTNG VVLTAECCLD YRKDASRELP EDLGTRGSAL LLNEIRKGGC VDTGMQSLAL LWMCLSPEDV SRIRVGTLSQ YAVESLRLFK QAFGVEFKVK PDHATKTVLL SCLGSGYRNM SRAAT
|
| |