Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48306 |
Symbol | |
ID | 7203783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 77828 |
End bp | 80154 |
Gene Length | 2327 bp |
Protein Length | 704 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182766 |
Protein GI | 219124975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.485228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCACACGA TGCTCCAACA AGGCAGGAAC GTTACCCAGC GTACTCACAA TTCACAGTTC ATCGACGGAG ATTTCTCTCC ACTGGTGAGG TATCTTTGAG TGTGTAGTAC GGAGACGGAG TGGTACAATT AACCTACCCG TATTTCAGAG ACAACGTCAA TTACAGTAGT ACTGGTTAGC AACCGACGCA GTGTTACGAT TGCAGCATTC CCATGACCAG CAGTAACTCC TTATCAACCA GGCCTTGCCA GATTTTGATC CGTTCCAAGA CAGCGTTGGA ACTCTTCGAC GTCCCGGAAC ATCCGAGCGA TCCGATTTCA GCGTTGCGCC CTCCTTTGCT CCAAGGACCC ACGACATTTC TCAAAACTTC CCCCGACGGA GCCTTTGCTT ACGTTCATAT TGCTGGACAC GGTATTGTCC GGATCGACTT GCGGGCGAGC AACAATGGCA GCACCGCGAC GGACACCAAC ACCGCTCCAT TCTCCATTTC CAAAGAAAAT ACTCCTACAT TCTTGTCCGA TACCGCTGCT GTCCAAATGA TGGACGTGTC ACCACTCGGG ACGTTTCTGT TGACGTGGGA GCGATACTAT GCCGACAAGA AGGAACCGAA TAATTTGAAA GTCTGGAACG CGGCAGACGG CCAACTCCTC GCCGCCTTTG CCCAAAAGAC GTTGAAACGG GAAGGTTGGC CCTACGTGCA GTGGACTCAC AACGAAGCCT ACGCCTTTCT CCACACGCCC TCCGAGATTC GCATTTATAC CCCCGAGTCC CTTTTGCACA ATCGCTACGT TGACAATTTG CGGATTCCGG GCATCCTCAC CCTCAATCTA CCCCGCACAC GTATCCTTTC CAGTACGGAA GCTCACAGCG CGCCTTCCTA CATGTTTACC TCCTTTTGTG GTGGTACCAA GGACAAACCC GCCCGCGCAT CGTTGCACGA GTACGCGCCC GGTACAGGCA CCGGATCCAG CAGCGCGGCC AATCCCTATC CAGCCTTGCT ATCTAAAAGC CTATTTCAGG CCGAGGAATG CGTCACGCAT TGGAACCCTG CAGGCGATAC GGCTCTCTTG ACAATTCAAA CCTCAGTGGA CGCTTCGGGT CAATCCTATT ACGGATCGTC GCAGCTCTTT TTGCTCGCCA AGCATCTGCC CACGGTCCAG GCCGTGCCTC TGCCGCAAGA AGGTCCCGTA CACGATGTCG CATGGATGCC TAGCCTAGAC ATTAACAAGC CTTCGTCTTT CCTTGTCATT GCTGGTCGCA TGCCCTCACT CGCGTCGATG CATCAAGGGG CGGACGGCAA GGCGACCTTT CTGTTCGGCA GTGCTCATAG GAACGTTATT GACTGGGCAC CACACGGACG CTTCGTATTA CTGGGAGGCT TCGGCAATCT AAGTGGGAGT ATGAGTCTGT GGGATCGCAA CAAGCTCAAA CTAATACCGT CGAGCAGTGC GCAGAATAGT AACGTGAACG GCACTTTGCG GGCGGACGCC GTCGTTGGAC ATGGCTGGGC ACCCAACTCG CGTGTTTTCG CCGTTAGTAC CTGCAGTCCA CGGATGAACG TTGACAATGG TGTGCGTATC TTTCGATACA ATGGAGACGA ATTGCTCAAC GTACCTTGGA AAAACGAACA GTATCGCCCA AATCAGTTAT TGGAGGCGGC CTTTGTACCG GCCAAACCGC TCGTGTACCC TGATCGACCA CAGACGCCCA TCAAAGAAGG ACGAGCGAAC GATACTACTG ATAGCACTTC CATCACCTCG GACACGATAA AATCGGCTAC GGCGAAGCCT GCTGGTAGTG GTGGTCGCTA CGTACCACCA TCCATGCGCG GCCGTGCCTC AGCTGGCGGT GGGACCGGTA GCTCACTGGC CGAACGAATG CGTCGCGAAA AAGAAGGCAA CCTGCAGAAG GCTGGCAAGG TCGAAGATGA TAGCAAAAAG CCCAAAGCTT CGACCGTCAC ATCAGCTTTG ACAGGACGGT CTATTCCTGG CCTGGTCATT CAATCCAAAA CCAAAAGCAA GTCGGCTCTC AAGAAGGAAA AGACGAAGTT GAAACAAGCG CAACAAGACG AAGTTAACCG CAAAGTTGAA GCGCAGAAAG CCTTGCTGAT GCAAGGTGAG CCAATCGCAC CCGTTGACAT GACAAAGACG CCCGAAGCTG TGGAGCCATC AGTAGACCCG GAAAAACGTG CTCGCAAACT CAAGAAAATG CTGAAGCAGA TCGAGGAGTT AAAACAGAAA GCCATGGATA CATTAAATGA CGATCAACAG GCTAAGATCT CCTCGGAGCC CGAGCTTTTG GCAGAACTGG CAAGTTTAAA ATTATGA
|
Protein sequence | MTSSNSLSTR PCQILIRSKT ALELFDVPEH PSDPISALRP PLLQGPTTFL KTSPDGAFAY VHIAGHGIVR IDLRASNNGS TATDTNTAPF SISKENTPTF LSDTAAVQMM DVSPLGTFLL TWERYYADKK EPNNLKVWNA ADGQLLAAFA QKTLKREGWP YVQWTHNEAY AFLHTPSEIR IYTPESLLHN RYVDNLRIPG ILTLNLPRTR ILSSTEAHSA PSYMFTSFCG GTKDKPARAS LHEYAPGTGT GSSSAANPYP ALLSKSLFQA EECVTHWNPA GDTALLTIQT SVDASGQSYY GSSQLFLLAK HLPTVQAVPL PQEGPVHDVA WMPSLDINKP SSFLVIAGRM PSLASMHQGA DGKATFLFGS AHRNVIDWAP HGRFVLLGGF GNLSGSMSLW DRNKLKLIPS SSAQNSNVNG TLRADAVVGH GWAPNSRVFA VSTCSPRMNV DNGVRIFRYN GDELLNVPWK NEQYRPNQLL EAAFVPAKPL VYPDRPQTPI KEGRANDTTD STSITSDTIK SATAKPAGSG GRYVPPSMRG RASAGGGTGS SLAERMRREK EGNLQKAGKV EDDSKKPKAS TVTSALTGRS IPGLVIQSKT KSKSALKKEK TKLKQAQQDE VNRKVEAQKA LLMQGEPIAP VDMTKTPEAV EPSVDPEKRA RKLKKMLKQI EELKQKAMDT LNDDQQAKIS SEPELLAELA SLKL
|
| |