Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48430 |
Symbol | |
ID | 7203610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 438511 |
End bp | 440272 |
Gene Length | 1762 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182966 |
Protein GI | 219125391 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGATCTGCAC TTACAATAAT TTCATCGGCA AAGATCAAAT TGTGGCATGA CGGCCGATGA CGGTGTTACG GAAGGTACCA AGCCTTTCTC GCTGCGGCTC TTTTGGTTTT TGGATCGGGC GCAATCGTCA CACGGCATTC CTCATCAGGA CTACGCGTCG TATCATCGTT TTTGCACCGC TCGTCTCCAT CGGATCCGCC ACGCCAAGCC CGTCCGGTCT ATGTTGACGC ACAACCACAA ATACGTCGAA GGAGCTTCGG GGCGTCGCCA CGCGTACTGT CACCGCGATG TCCCGCGAAC TGTAGATCAC GAAAATGTAC TCTGGAACGT TGTGTGCCAA GCTGAACGGG CCTGGGCGAC GGCGTGTGAA TTGCAGCAGC AGCAAAGCCT GGTGTCTGTA ACTAAAAAGA AAAGCCCACA CTCGAATGTA ATGCGACGAT TGAATAAGGC ATTGCATTGG GCTAGCCAGC TCTCCGAGCT GGCCGGTACC TGCTGTGACG CAAATACCGT TCAAGAGTGC CTGGCTTACC AAGCTTGGAT GCTCGCGAAC AAGCATCTCG AGCAAAAGAG GCACGCGGAT GCCTATCGAT CGTTCAGGGC TGCTATGACC ACTTTGATAG ACTTGGCACA AACGGCATCG ACGGAGGGCT CTCCTGACGG CTTGGCGGCT GCCGATGTTT GGACTACCAA AGCCGAAACG TATATTCGTC CGCTAGTACG ATTTTGCCAG TACGAAGCTC GTGACGAACT AGACTCCTTG GAACTTGAAG ACTCTGCAAG TAGCACCGGC GCATCGACTA AGGCTACAAC GGATGGTATA AACATCGCAT TTCGAGGAAA AGTTGTTTCT TTGGGGTCGT ACAAGCAGCT GTCCGTGTTG TACTTGAAGA TGGAACAGGA GCTGGCACAT GCACAAGAAC TAGATGAAAC GCGCTTTCTG CAGTTGCTTT CAGACTTAGA CGAAGCTTTG CATGTGATCC AGTCTGACTA TGCCCACTAC GAGTCCTTGC CTGCTGGACC TGCCATCCAA GCGAAACGGG ATGAGCTGTT AGCCCTCAAG GGCTATTTCA ATTACAAAAA ACTTTCGGTC ATGCGTTCGC ACCAAGAGCA GCTTATCGAT AAATTGAAAG ACGATGCTGA AATTGTTCAC GTGTACGACA CACTTTTGCG AAATGTGCAG GCCATGGCAG ATCTGCCTAG TCAGCAGGAA GGCGAAGGGC TGAATGCCCT GGCCGTTGAA GAAGATCCTT ACTGGCTGGA GGCACAGGCC CACGTTGTAC GCATTCGGGC TTTGCGCTGC TTTCATGTCG CTCGTCTGTT TGAATTTGTT CTAGATGGCA CCCCGATGCA AGTTTTAGCA TTGCTCAAGC AGGCTCACAA GCTAGCCACA CGCGCTGAAG AAGAGGTAGC TGCTTGTGAT CTGGATGACA GCGATGCACA CATGCAACAA ATGAAAGGTT TGCAAACAAA GATAAAGACA ATGACTTGTC GAATGCAAGC TCGACGCTAC GTAGAATTGT CTTCTGGTAG CCATTCCTCA TCGACCAATC GTCCCATCTG GCTTCGGTTA AACGATTTGG ATGCCGGAAT GGTCATGGCA GATGACCCTC CAATGACTAT CCCAATACCC TGCAAGCCTA CATTCTACGA TATTGCGTGG CAGCGCATTG GGGGAGATTT CAGTATGGAT GCAGTTGATA AAGTTCTGGC CGCAAATCAA TCAAAAAAGT CTGGAGGTAT TCTGGGCTGG TTCAATAGCT GA
|
Protein sequence | MTADDGVTEG TKPFSLRLFW FLDRAQSSHG IPHQDYASYH RFCTARLHRI RHAKPVRSML THNHKYVEGA SGRRHAYCHR DVPRTVDHEN VLWNVVCQAE RAWATACELQ QQQSLVSVTK KKSPHSNVMR RLNKALHWAS QLSELAGTCC DANTVQECLA YQAWMLANKH LEQKRHADAY RSFRAAMTTL IDLAQTASTE GSPDGLAAAD VWTTKAETYI RPLVRFCQYE ARDELDSLEL EDSASSTGAS TKATTDGINI AFRGKVVSLG SYKQLSVLYL KMEQELAHAQ ELDETRFLQL LSDLDEALHV IQSDYAHYES LPAGPAIQAK RDELLALKGY FNYKKLSVMR SHQEQLIDKL KDDAEIVHVY DTLLRNVQAM ADLPSQQEGE GLNALAVEED PYWLEAQAHV VRIRALRCFH VARLFEFVLD GTPMQVLALL KQAHKLATRA EEEVAACDLD DSDAHMQQMK GLQTKIKTMT CRMQARRYVE LSSGSHSSST NRPIWLRLND LDAGMVMADD PPMTIPIPCK PTFYDIAWQR IGGDFSMDAV DKVLAANQSK KSGGILGWFN S
|
| |