Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45154 |
Symbol | |
ID | 7200335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 336242 |
End bp | 339409 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179408 |
Protein GI | 219117227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACGGT TGGGCACGGA CTTCAACGCC ACTGGTTCCG TCATGAGCAA ACGGGAACTG GAAGAACTCC GGCGGAAAAA GGGAGAATGC GTCCGTTGCG GACAAAAATG TTTCCAGAAG AAGCTCTTCA AAATGATTCC CATTACGGAT CACGGCAAGG TGCTCAACGG GCGCTGTTTG GGTTGCAATC CTCTGCCCGG GGACGGCGAC GCCTCGGGCG TACTGCCGGC CGTATCGCGA CCCGCCACGA TGCAAGATCT CCAACGCTTC AATCGGTCAC AGAACAATCT CGTTTTGCCG CCGAATACCG CTAACACCGC TACCGGGAGT GTTGCCGGAG GAAGCGTTTC GGGGAGTGCA AACGGAGGGG GGTCCCGGTC CAATTCGTCA CGCACGTTTA CCCCCATGAC GAACACACCG ACGGGGAGCC ACAACGGTAA CGTTGGTACG GGTAATTCTT CCACTACACC CCGTCGCAGC GCCCCTCGGG CATCCTCTTC GCGGGGCTTG ATCAGGGGAC AGTCGGAACG TGCCAGGGCA TCGTCGGGTA CGGCCGCTTC GATTACCACC GCCACGGGAC GGTCGATGGA GAGTCTCTTG CAACAGCAAT CGGAAGAAGT CCGATTGGAA TCGTCACAGC TAACGCCGTC ATCGTCGCAC CCCGACCAAG ATCACATTCC CGAACAGTGT CGTCAACAGT TGCATTACGG GTCCACGGCA TCGCCGTCAA CTGGTTTCTT GTCCCAACGT AGGAACGGCT TGCAACGTGG GGGTAGTTTT GCGCGGACCG GACGGGGTTC GCCGGAAGTT TACGGAGAGA CTCTGTCCAT GTCGCGGAGT TGCTCCACGT CTCCGGCCAA TACACCAACC GGATCATCGC CGCGGTACGA CATGGAACAC CGACAAAGCA GTGTCCGCAG TGGATTGTCC CACATTAGTT TGGATTCGGC CTTGTCCGGC GCCTCCGCGA TATCGTCGCC GCATCCACAC TGGCAACAAA TCGAGGAAAG TCCCGTCAAT AGGAGGAACA GTGTCAACAC CAGTATCGAC GGCAGTATCA GCAGTAGTGC ACACGTGCGT CAATCACCAC ACTGTAGCAA TAGTGGTGTC GATACGAGTG TGAATGGTTC CGGTGACTTG GTCCACACGG GGCCGACCAC ACACCCTTAC TACGAGCACG ACACCTTCGC GGTCCCGTCA TCGCGCGCTA CGGATACCTC CTACGAATCG CTTCCATCTC ACCACATCTC GACGGCGTCC ACGGACGACG TATCGGGAAA TGATGAACTA CACGATGCAC CATCTCCTGT CACGGAACAG AGACCGGATG TGGATTTGCC ACAGCAACAA CCAGACACCA ATTTAACGGC TCATCCAACT GCCGCAGAGC TTTTGGATTA CTATCGTCGA ACCTTTCGAG AAGCATCCCA ACGCTCGTTA CAGCACCACG GTAGCAACGG TACTGGATCG GTCGCTGGGG GCCGGATGAC TAGCGGGTCG CTGAGTGGCA GCGGTGGCCC CGAAACGACA GGAATGCCAG CAGGGGAGAG CGATCATCTT CCTTCCCACG GTGTGTTGAA CCGCGGCGGT GTCGTGTCTT TTCAGCACTT GCAACATCAA GGGAATCAAG CACAAGGTCC GCAGAGTCTC ACTGGTACCG TCAATAGTAG TTCGCATCAT CATCGACGTG GAAGTTCACG GCCCAGCAGC TCGCGATCTT TGGATTCGAT GAGTAGTTTT GGGGAAGAAG TGAACGCCAG CATTTTGAAT AGCAATGATG CGGAGAGCGC CCCGAGCGAT ACGAGCTTTC GAGAAAGCTC TGCGGTATCA CAAGATCCTG GCGTCGCCCG CATTCAACAA GCCGGTGTGG ATTTCGTTGA AGTTCTCAAT ACTTTACGGG ATCTCCCGGA CTCTTTGCGT ACACAAACAG CTGGTTTGCA CGTATTGTCT GAATTGACCC TGAGTGAAGA AGATTCGGAG ACGTTGCTCA ATATTGGAGT GGTGCAGGTA ATCTTAGACG CAATGCGTCG TTACGCCCAC GATACGTCTC AGGTCGAATT GCAAACGGCC GCCTGTCGAG CTATTCTAAA CGTTACGGGA ACGTCGGAAG CGCAAATAAA TTTTGTGCAA AACCAAACGG TTGAACATGT GTCCACCCTG ATGCAAAATC TTTTGGAGAA TGCGACCGTG CAGGAATATG CAATGGCAAC CATCGCCAAT TTGAGTGTCC TTGAAGCGAA CTTGCCGATT CTGATAGAAG AACACTCGGT CACACGCATT GTTGAAGCTA TGAACAAGCA TTCCGAAAAT CGTCAAGTCC AAATAAAGGG TTGTTCCGCC ATTACCAACA TGGCTTCACA CACGACGCCT TTGAAAAAGA CCATCATGGA CCAAGGAGGA GGCGGAGCTG TGGTCGTTTC CATGGTGATG CATCCTGGCG ATGTCGAATT GCAGGAAAAA GCACTGCAGG CCTTGCGCAA TTTGTCGGCA AACTCAGACG AGAACAAAAT GGAGCTAGCT CGCATTGGCG GGATCGAATC AGTGATTGGT GCCATGCAAG TCCACCGCGA CGAAGCTGGT ATTCAAAAAA CTGGATCCTG GAGCTTGTCC AACTTAGCCG GTTTCGTTGA TAACAAAAGG ATAATTGGCG AGTGCGGGGG AGTGGACGTG ATTGTGCGAG CTATGTGGGT ACATTCGGAT GAAGTATCGG TTCAAGAATG GTGCTGTCGA GCCTTGTTTA CCTTGGCACT GGAGCCCCAG AACCGATTGG TAGTTTTGGA CGTGGGTGGG ATCTCGGCTG TAGTTAATGC TATGCAAGCA CATGTAGATT CATCGACGGT TCAGGAAATG GGATGTGCTG TCTTGTGCAA TCTAGCAACA GATCAAGCAA CCAAGCTTCG CATTGTAGAC GAGGAAGCCT TGGATGCCAT CGTGTTGGCC ATGGTCCTAT TTGGCGACGA AATCAAAGTA CAACAGCAAG GATGTCAAAT TTTATCACAG CTTTGTGTTG CCGAAAACCT TAAATCATTA CAAGCGTCAA ACGCGGGAGA GCTAGCGCTG GCAGCGGCGC ACAAATTCCC GGAATGCGAC GCGCCAGCAC AGTGGTTGTT GAATTCGCTC GAAGAATTTG CTGCTGCGTA TATTGAGACT ACGGAAGCCC ACCATTAG
|
Protein sequence | MERLGTDFNA TGSVMSKREL EELRRKKGEC VRCGQKCFQK KLFKMIPITD HGKVLNGRCL GCNPLPGDGD ASGVLPAVSR PATMQDLQRF NRSQNNLVLP PNTANTATGS VAGGSVSGSA NGGGSRSNSS RTFTPMTNTP TGSHNGNVGT GNSSTTPRRS APRASSSRGL IRGQSERARA SSGTAASITT ATGRSMESLL QQQSEEVRLE SSQLTPSSSH PDQDHIPEQC RQQLHYGSTA SPSTGFLSQR RNGLQRGGSF ARTGRGSPEV YGETLSMSRS CSTSPANTPT GSSPRYDMEH RQSSVRSGLS HISLDSALSG ASAISSPHPH WQQIEESPVN RRNSVNTSID GSISSSAHVR QSPHCSNSGV DTSVNGSGDL VHTGPTTHPY YEHDTFAVPS SRATDTSYES LPSHHISTAS TDDVSGNDEL HDAPSPVTEQ RPDVDLPQQQ PDTNLTAHPT AAELLDYYRR TFREASQRSL QHHGSNGTGS VAGGRMTSGS LSGSGGPETT GMPAGESDHL PSHGVLNRGG VVSFQHLQHQ GNQAQGPQSL TGTVNSSSHH HRRGSSRPSS SRSLDSMSSF GEEVNASILN SNDAESAPSD TSFRESSAVS QDPGVARIQQ AGVDFVEVLN TLRDLPDSLR TQTAGLHVLS ELTLSEEDSE TLLNIGVVQV ILDAMRRYAH DTSQVELQTA ACRAILNVTG TSEAQINFVQ NQTVEHVSTL MQNLLENATV QEYAMATIAN LSVLEANLPI LIEEHSVTRI VEAMNKHSEN RQVQIKGCSA ITNMASHTTP LKKTIMDQGG GGAVVVSMVM HPGDVELQEK ALQALRNLSA NSDENKMELA RIGGIESVIG AMQVHRDEAG IQKTGSWSLS NLAGFVDNKR IIGECGGVDV IVRAMWVHSD EVSVQEWCCR ALFTLALEPQ NRLVVLDVGG ISAVVNAMQA HVDSSTVQEM GCAVLCNLAT DQATKLRIVD EEALDAIVLA MVLFGDEIKV QQQGCQILSQ LCVAENLKSL QASNAGELAL AAAHKFPECD APAQWLLNSL EEFAAAYIET TEAHH
|
| |