Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38508 |
Symbol | |
ID | 7203477 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 273386 |
End bp | 275482 |
Gene Length | 2097 bp |
Protein Length | 578 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182503 |
Protein GI | 219124423 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTATGA CACCTGTCCA GCCTACGCCG ATCACCCGCA CTCCAGACAT TGCCTTTCTA ACAAACGGTC TTGTAGAATA AGACCAAGAG AAATATCAAG CGACGAAGAG AATTTTGCAT AATTGATCCG TTTGTGCTCG GGTTTGGGTC TTCGTGCTTC GAGGAAAACA AACCGGTGAT CAACTGGTTG GTCTATCTAC ACGAACTGTA TCATTTGTGT GGGTGGCAGG CCCGTGTCGA CGACATTCTA AGCTGCTCAG CCACATTCGC GCTAAGCAAC TCAAAACCAT CTACTCACTG ATATATTCCG ATAAGTGCTA TTACTGTAAT CTTAATCGCT GTCGACGTGA AACCACGCAC ATCCAACGAA TTGAAATGAT GCGATCAAGG AGGAAACGAC GTCAAACTAC AACGACAAAC TTTCTAGCAA AATTGACGCC TACGTTCTTG CCCCTTATGC TGCTTTTTTT GTCCGTGCTT TTGACTAAGG GCTTTCCGAT AGTAGCACAA ACCAAACCCG GTGGTGGTGC GACGAGTAGC GGTGGCCAAC AATCAATACC ATGGAAACTC TCCCTGAAGG CAGACAATTC AAATGAGGGC GAACCCGCCA ACAAACCCTC CGGTTCTCGG AAAAGACGAG GACGGAAAGT CAACGGCCAC GACCCGCACG ACCATTATTC ATCTGGGAAT TCAACCGCGA CTGCTCCTGC AGAAGCAGTA GCCGAGTCAA ACAAAAATAA AGAAATGATG GAAGGCTTGC TCAAGCGTGT GACTCAACTT GAGCAAATGG TCGCTCGGCA GTCGGTGGAG GTTAGACGTC TCAAGGAAGA GTGCAAGGAT TTGACCGAAG CGGCTGCAGC TTTTGCTCGT GTGGTGGAAC TGTTGCGGGA TGCTGGTCTT CAAACAGGTG GTAGCCCTAA GGACCTGCAA CAAAAAGAGG TTGATGACAA GCAGGCTGCA ATGGAAGCCA ATTCAGAGAA ACGGATCATT GAGTATTTTG ACGACTCAGA AATCCTCGGT AAGGCGCCTG CTTCCGTAAT CGAAGCGGCG GACGCCGCGG GTTCCGCCAT CCTGGCCATG ATGCTAGGCG GCAAGCAACG TATGTTAGTT GACGTGCGAG ACGCCGAGCT TTCTCGTGAT CCCGAGACGC TGGTACAATT CATCGAGCTC GCCATTTTAC CCGTCGCGGC CGGTTTGGAA GGTCTCAAAT CACAGCGGAA CCGGCTCAAA ATTGTTTTTC CTACTGTATC CCAACTGCTA GTCTATCGGA AGACTATGGC ATTGGCAGCG CCAGAAGTAG TTGCGTTGAG CACATTTGGA TTAGAACCAG TCGAAAAGCA GGATAATTTG GTCGTAATCG TTGCGCCCTC ACCTGATGAC GAAGAAGGTT TGATGGCGAT GAACGAACTA CTGCATCCGA CGGATCCCAA CAGAATATCG ATAAAGCAGC CAGTGGTTAT TTTAAATCAC CATATGGTCC CCATTTCTGG ACCTGTCGCT GATTTTGAGG TAGCTTACCA CTTAAGGCTT CTATCAGTTC AATACATGTC CGGTAATGAC GGTGCAGCGC AGGAATACTT CAAGCAGTTT GAAGGATCCG CTCCTCCATT ACCTGTAAAT ACGCCGAGAA ATGAGACCGG CGAGAAAGAT TCCGCGGCCG ATGCAAACGC CACAACAATA GCTGACACAG ACGATTTTGA TCGTCCCGGT GACGCCTTGT TGGAAGCTGC CATGGAGCAT GCCCAACAAG TTGGTATGTC GCAAGGAGTG ACTAGGGCCA TGGTGATACG GGCCTATCCT AAACCGTGGC ACGTCTTTGT CGATACATCG CCTGGCACCG ACGCAGACTT TGTTGTGGCG GCGACCTATG ACAACGAACC GTCACCACAA GAAGTCAATA TAGCGATTGT GGAGTGTCTA GAGGGCAGCG AGCGGGAAGA TGAGCTAGTG GCACAGCAAA TGCAGGAAGC TCTTGAGTCG GGACAATTGG ATAGGGTCTC GAAGATGTTG GAAATGCTGG ATTTGGAAGA CGGTGAAGAA GACGAAGATG ATGAAGGTGA CAGCGCTTGG GGTCTCTTTG GCGAAGACAC TGTTTGA
|
Protein sequence | MAMTPVQPTP ITRTPDIAFL TNAKLTPTFL PLMLLFLSVL LTKGFPIVAQ TKPGGGATSS GGQQSIPWKL SLKADNSNEG EPANKPSGSR KRRGRKVNGH DPHDHYSSGN STATAPAEAV AESNKNKEMM EGLLKRVTQL EQMVARQSVE VRRLKEECKD LTEAAAAFAR VVELLRDAGL QTGGSPKDLQ QKEVDDKQAA MEANSEKRII EYFDDSEILG KAPASVIEAA DAAGSAILAM MLGGKQRMLV DVRDAELSRD PETLVQFIEL AILPVAAGLE GLKSQRNRLK IVFPTVSQLL VYRKTMALAA PEVVALSTFG LEPVEKQDNL VVIVAPSPDD EEGLMAMNEL LHPTDPNRIS IKQPVVILNH HMVPISGPVA DFEVAYHLRL LSVQYMSGND GAAQEYFKQF EGSAPPLPVN TPRNETGEKD SAADANATTI ADTDDFDRPG DALLEAAMEH AQQVGMSQGV TRAMVIRAYP KPWHVFVDTS PGTDADFVVA ATYDNEPSPQ EVNIAIVECL EGSEREDELV AQQMQEALES GQLDRVSKML EMLDLEDGEE DEDDEGDSAW GLFGEDTV
|
| |