Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46514 |
Symbol | |
ID | 7201593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 570527 |
End bp | 572492 |
Gene Length | 1966 bp |
Protein Length | 595 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180858 |
Protein GI | 219120230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.710645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACGACGAAG AAGCTGTCAA AATTTCTATT TCTCACTGCG CCGTTGGCAA AGTTTGCATT CCTCGCTACG CTGTTGTTCT TATGCCAAAG GCAACTTCAA TTTCTGCGGA CGACTAACGG TTTTTTCGAT TCCATGCCGG AACACTCATT ACCGTTGGAA GAACAGGTAG ATAGACAGCA CTCGTGTACT ATCTTCTACA ATGTATACAC TGCAGAAGGC CATATGAGGC CAGCCCTCAC AATTATCCGC AGGCAATTGC ACCAAATCGC ATCGTCTTTG GTGGAGGTGT TCGGCCAAGC AGCGCGAAAG ATTCCTCTCA AAATCAATAC GATTGGATAT AATGTGAGCA CTCGAAGCAT TATGAGTGCG TGTCGGAGCC ATACCGGACT GAAGTGTCAA CACTTAGACC ATTTTTTAGA CGGAAATTTC GAGGATGTGA CGCTTAGTGC TATACATGAT CACTGCCAGA CGCAGGCTGA TAATCATGTT GTCGTGTACG TGCACTCCAA AGGCACTTAC CACCCTTCCG AAAAGAATAA CATTTGGCGT GACCTTATGA CCAATGCAGT CCTCAGTAAG GGCTGCTTGG AATTAGGTCG CGGCTACCAG AATAATACGG AGAGCAGCTG CAATGTTTGT GGGTTTTTGT TTCAACCAAT CTGGACGTTC ATGTTTCCGG GAAATATGTG GAGCGCACAA TGCGGATATG TCCGAAAGCT AAAGCATCCT AGGCTTTTCA AAGCAAACAT GGACTTTATT GCAGATGATG CCTTAGACTT GATGCAGCAC GGCCACTTTA CCATGAATTT GGCGCCACCT TTGCCCGTGG ATATTGGTCG CCTCGGTCAA GGAAGGCATG CGGATGAGCA TTGGATCGGA TCCCATCCTT CTATCTTTCC TTGTGATGTT GCACCAAACT CGAATTTATG GAAATGGGTG TCGCCGCCTC CCGAGGGTTT CGTGCCTGGA CACGCTCCTC CGATAAGATG GTCTCCTGCG CCTAGGTACA GTATTTTTGG AGAGTGGGGT TTCTATCGAG ACGAAAACGA ATCGCAAACC AAGCGAATGC TCCGCACGCC GGAGTGGCGC TCAAGGGAAT ATTTCCTACT TCCTGGCTTT CTCTGGAAAT GGCTTCGCTG GTACAACGAA GTGCCTCCGC CAAATTCGTT TGTCTGGAAG TGGTTTCCTG ATGGAGAAAT GTGGCTTGAG CGCTCTCGAC AATTAAATGC TGAACAATTG CTTACCTATA TTTCAGGAGC TCTCGACAGT CCTCTTCCGA CAGTCGCCAC TTTCGAGTCT TCGGTGACAA ATATTCGTCC GCCTCTTGTG TACTTCATGC ACATCGATTT GCCGGCAAAA GAAGATAATC AAAACCGGAT TGTGCAACAG CAACTTGCCT ATTTGAGTTC ATCTTTCAAA ACGAATACAA CAATTTTCGT GTACACCAGC GGAGGTCTTT CATCATTGGA TACAACAGTG TTAGAAAGCT GGTGTAGGTT ACAACATCAG CTGAACTGTA AACATGTTCA GCATTATCCT GCTGCTCTGG ATATTATCAC GCAAACGCGT GTGCAAGACT TCTGTCGATT GCACGAAACC TTTCAAGTCG CATTTCTCAG TGGGCCTTCT ATACCCATAC AGTCTCTCGA TGCACTTCAT GCGTGCTGGG AAGATCATAG AGCAAGTGGA GAAGAGGAAA GCTCATGTGA TGTGTGTCTA TCCAATTCCG GGCTAGAGAA TCACGACAGG ATGCTGTGGA TCTCCCAGTG CTCCTTTGTG AAGGACTTGA TTGCTTCGAC TGAACATGCG GCAGCTATGT CTAGCAGAAA GATGAGAAAA GAAATAAAAA TTGGAAAATA CTGGGTTGTA AGCAAGAGAC CGAACACAAC TGGAGCTGCC CGTGTATGTT CATCTGGCTA GGTGATAAAA AAGTTCTTTA TAAATGTAAT GCTTCCGTTT CATTCT
|
Protein sequence | MPEHSLPLEE QVDRQHSCTI FYNVYTAEGH MRPALTIIRR QLHQIASSLV EVFGQAARKI PLKINTIGYN VSTRSIMSAC RSHTGLKCQH LDHFLDGNFE DVTLSAIHDH CQTQADNHVV VYVHSKGTYH PSEKNNIWRD LMTNAVLSKG CLELGRGYQN NTESSCNVCG FLFQPIWTFM FPGNMWSAQC GYVRKLKHPR LFKANMDFIA DDALDLMQHG HFTMNLAPPL PVDIGRLGQG RHADEHWIGS HPSIFPCDVA PNSNLWKWVS PPPEGFVPGH APPIRWSPAP RYSIFGEWGF YRDENESQTK RMLRTPEWRS REYFLLPGFL WKWLRWYNEV PPPNSFVWKW FPDGEMWLER SRQLNAEQLL TYISGALDSP LPTVATFESS VTNIRPPLVY FMHIDLPAKE DNQNRIVQQQ LAYLSSSFKT NTTIFVYTSG GLSSLDTTVL ESWCRLQHQL NCKHVQHYPA ALDIITQTRV QDFCRLHETF QVAFLSGPSI PIQSLDALHA CWEDHRASGE EESSCDVCLS NSGLENHDRM LWISQCSFVK DLIASTEHAA AMSSRKMRKE IKIGKYWVVS KRPNTTGAAR VCSSG
|
| |