Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36165 |
Symbol | |
ID | 7201301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 706577 |
End bp | 707923 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180694 |
Protein GI | 219119886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAAGG CTCTGGAGCG GCGCATACAG ACCGGAAAAC TAAGACAGCT TCCCTCCCTT GGTACTTTCA CAAACGATTC TACGAAGCAT GCGTTGCCCA CAGGGAAAGT GGTTGATTTT TCTAGCAACG ACTATCTGGG ATTGGCGCAC AGTACGACTC AACACAAGCT GGTTCAAAGG ACTTACAACG ACCTCTTCGC AGAACGACCA CACGACGAAA CCGAAGCGCC CTACGCAACA CGGGCCGTAC TCGGCGCAAC CGGATCGCGA TTACTATCCG GTGATAGTGG TATGTTTCAT GATCTCGAGT CAAAATTGGC TCTACTGCAC CGTCGAGAAG CCGCACTATT GTGTAATTCG GGATACGATG CCAACCTGAC CATGGTGTCC TGTTTGCCAT GTCGCATCAT TGTCTATGAT GAATACATTC ATAACTCACT GCATATGGGG ATTCGCTTGT GGCAACAACA GTCATTGACA AACAGTGCGG CAGACCAACA AAGCACTTTG CGTAAACAAA CGTTTTCCTT TCGTCACAAC AATGTCGAAG ATTTGCGAAG CGTTTTGGAC TCGATTGTAC AAGATGCTCC TGAAATCGTC ATTTTAGCTG AAAGCGTCTA CAGTATGGAC GGTGATGTCG CGCCGCTACA CTCCTTGCTT GATGTGGCCT TAGAGTGCAA TGCCAGTGTC GTCGTAGACG AAGCACACGG TTTGGGTGTT TTCGGTTTTC GGGGCTTGGG TGTACTATCG AAGGAACATC AAACACTCAA CAGCCACCCT GCGTTACTTG CTTCCATCTA TACCTTCGGA AAGGCTGCTG GTTGTCATGG GGCTGTTATC TGTGGCAGTA CAATCTTGAA ATCTTACCTT TTAAACTTTG GATACCCTGT AATTTATTCC ACATCCTTGC CGATGCATTC GCTCGTATCC ATCGATTGCG CCTACGACAC GATGGCGAGT ACTCGCGGCG ATTCGTTGCG CACTCATCTA TTTCAACTGG TGCAGGTGTT TCGATCACTG CTGTTATCGG CACTGAATCT TCATGGAGCC TCTCGCACCG ACCTTGCTTT GTCACCGTCA ACATCACCAA TCCAGGCGTT GCTTATTCCA GGCAACGCGA CTTGTGCCGC CATATGCGAC ACCGTTCACC AGCTGTCACG TCAGCAGTTG CGTTTGTATC CCATCAAGTC TCCGACCGTG CCAGTCGGTC AAGAGCGTAT TCGTATCGTT TTACACTCAC ACAATTGCAC TTCAGAAGTG CAGTGGTTGG TACAACTTCT GACTCAAGCT CTGCAATCGC ATGGCTTGTT GAAGACAGGT CCCAGCTCTA TACTCGCTAA GCTATAG
|
Protein sequence | MRKALERRIQ TGKLRQLPSL GTFTNDSTKH ALPTGKVVDF SSNDYLGLAH STTQHKLVQR TYNDLFAERP HDETEAPYAT RAVLGATGSR LLSGDSGMFH DLESKLALLH RREAALLCNS GYDANLTMVS CLPCRIIVYD EYIHNSLHMG IRLWQQQSLT NSAADQQSTL RKQTFSFRHN NVEDLRSVLD SIVQDAPEIV ILAESVYSMD GDVAPLHSLL DVALECNASV VVDEAHGLGV FGFRGLGVLS KEHQTLNSHP ALLASIYTFG KAAGCHGAVI CGSTILKSYL LNFGYPVIYS TSLPMHSLVS IDCAYDTMAS TRGDSLRTHL FQLVQVFRSL LLSALNLHGA SRTDLALSPS TSPIQALLIP GNATCAAICD TVHQLSRQQL RLYPIKSPTV PVGQERIRIV LHSHNCTSEV QWLVQLLTQA LQSHGLLKTG PSSILAKL
|
| |