Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38525 |
Symbol | |
ID | 7203485 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 314367 |
End bp | 316608 |
Gene Length | 2242 bp |
Protein Length | 665 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182658 |
Protein GI | 219124747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAAG AGGAAAGCAA GAAGCGACCG CGAGACAACA TCGCCGAAAG CGAAAGTCTC AAGACGCCCA AACGCTTTTG GAATCGCGAA ATCCCCTCCA GCAGTCACTA TCACGTCTCG TGGTACGTTC GGTCGAGCCT TTTGGCTTCC GTGACATTTA TTTCTTCCAC ACCGTCTTGA TGTCAACTCA CAATCCGCTC TTTCTCTTTC CTCGTGGCAG GATGCACGCA CAGACGCTGA CGGCGGTCGT TACTTCCACC AAATACGGCT ACGTCGTCTC GGCTTCACAG GATGGTACGG TCAAGTTTTG GAAGCGTCTC GAAGTCGACG GTGAGCCGGT GGAAGGCCAA CATCCCTGTC TGGAGTTTGC CAAGTCATTT ACAGCCCACG CAGGTCCCGT GCTCGCACTC GCCATGGATC CCGACGAAGG AGTTTGTGCC TCCGTGGGGG CCGATAACGT TATCAAGTTT TACGACGTCA GCACATTCGA CGCGACCGCC ATGATACGTA CGGAACGACC GCTCGGTACG GCGTGCTGTT GGTTACGCTC GGCGAATCGG AGTGAAACCT TGTTGGCGGT CGGTGCCGCC GACACGGGGG ATATTTATCT CCACGCACCC GACAGATCGC GTGTCGTTCA AACATTAACG ATGCACGGGT CCAATATTGT CACGTGCCTG GCCTACAATG CGACACATCA TTGCGTAGTT TCAACCGATC AAAAAGGAAT TATAGAAATA TGGGATTCTT GGGGGACTCC GGATGTATCG GAAAGAACCA GCGACGCAGA TGAAGGCACT GAGGACGAGA AATTTGCCTT GCTTGTGGGA GGACCTCTGG TACCGAGCCG ACACGGCATC GTCTACGGAT CGAAAGTGGA TACTCAGTTG TACGAACTTG TTCGCAAAAA GACCTTTGCC ACTGCTATTG CCATTGAACC GACGGGTGAG CATTTCGCAA TTTACGGATC CGATCGGAAA ATTCGTATTT TCGAACACCG TACCGGGAAA GTTCGCGTCA CGTACGATGA ACGCCTCAAG GCATACGACC GCATTTTTGG AAACTCGCCG TTCCATCTCG ATACCATTGA ATACGGACAA CGAGCGGCTC TGGAACGGGA AATGGATGCC GAATCGACCG TGTATTCGGC TGGCATGGCA CTGTCGAAGA GCGCACCGGT TGGATGTGCG CCACAACGCA TTGCTTTCCA ATTTGATGGC AGCGGCAAAT ACCTGCTAAT TCCGGCACTC GTGGGTATTA AAGTGATTGA GTGGAGAAAG AACAAGCTCG TCAAAATGAT TGGGCAGGCT GATGCGAGCC AGATGCGCTT CCTCTCAATT TGCCTCTGTC CGGGTGATGC CAAAGTAAAT CGTCAGTTGC AGCTGTCCCG CAATGCTAGC AAAAAGACCA GCGCCGCGAC CGAAGCGGAT GAAATTGAAC GTGCCAGTGA TGTGCTGTTA GTAGCACTAG CTTACAACCA GCGGCGCCTA TATGTGTTTA GTCAGCTGGA TCCGGTGGAC GATCGGGATG CGCCAGACAA TGTGCTGGTA CGGAGAGACG TTTGGAACGA AGCGCCATCC GGTCAGGATC AAATCCATAC CGAAAGTCGG CATACGAACA CTACATCCCA AGCTTCTCGA GCTGTCATAC GGACGACTCT GGGAGACATA CATCTGCAGC TATTTTCGCA AGTACCCAAG ACGATTGAGA ACTTTGTTGG GCATGCCAAG TCAGGCTATT ACGACAATGT CATCTTTCAT CGAATTATCA AAGGTAAGTT GTCGACCAAC AACTAGTTCT TGGACTCTTC CAGAGCCTGT CTGCCACGCT AAACGTAACT CAATTTTCTG CTTAGGTTTT ATGCTTCAGA CTGGAGATCC ACTGGGAGAC GGTACCGGAG GGGAAAGTAT TTGGGGTGGG GAATTCGAAG ATGAGTTTGT GCCCGGTTTA CGGCATGATC GTCCTTTCAC TTTGTCGATG GCCAACGGTG CGTTTTGCAG CACTCTCTCC TTGACAATTA CGATGTTTAG ACTGACGCGT TTTTTGAACA GCTGGCCCAA ACACCAATGG ATCTCAATTT TTCATTACGA CAGTCCCTTG TCCGTGGCTA GATAACAAGC ATACAGTATT TGGACGGGTC ACTCGCGGCA TGGACGTGTG CACGGTTATT GAGAATACAA AGACGGATGA GTCAGACAAG CCTCTTGCGG ACGTTCAGAT CCAAAGTATT GACATAGAGT GA
|
Protein sequence | MPEEESKKRP RDNIAESESL KTPKRFWNRE IPSSSHYHVS WMHAQTLTAV VTSTKYGYVV SASQDGTVKF WKRLEVDGEP VEGQHPCLEF AKSFTAHAGP VLALAMDPDE GVCASVGADN VIKFYDVSTF DATAMIRTER PLGTACCWLR SANRSETLLA VGAADTGDIY LHAPDRSRVV QTLTMHGSNI VTCLAYNATH HCVVSTDQKG IIEIWDSWGT PDVSERTSDA DEGTEDEKFA LLVGGPLVPS RHGIVYGSKV DTQLYELVRK KTFATAIAIE PTGEHFAIYG SDRKIRIFEH RTGKVRVTYD ERLKAYDRIF GNSPFHLDTI EYGQRAALER EMDAESTVYS AGMALSKSAP VGCAPQRIAF QFDGSGKYLL IPALVGIKVI EWRKNKLVKM IGQADASQMR FLSICLCPGD AKVNRQLQLS RNASKKTSAA TEADEIERAS DVLLVALAYN QRRLYVFSQL DPVDDRDAPD NVLVRRDVWN EAPSGQDQIH TESRHTNTTS QASRAVIRTT LGDIHLQLFS QVPKTIENFV GHAKSGYYDN VIFHRIIKGF MLQTGDPLGD GTGGESIWGG EFEDEFVPGL RHDRPFTLSM ANAGPNTNGS QFFITTVPCP WLDNKHTVFG RVTRGMDVCT VIENTKTDES DKPLADVQIQ SIDIE
|
| |