Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35054 |
Symbol | |
ID | 7199994 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 965847 |
End bp | 968336 |
Gene Length | 2490 bp |
Protein Length | 621 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179549 |
Protein GI | 219117509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTG TCGAAATTGG ATACTGTACG TTTCCACAAA AAGTCACATC ACATATATCA CGCGTGATGT GACATTGCCC TCGCGACGAA TGTTCGCACC GTCGTCGGAG AGACGACACG GCGACAACGT GTGAGTCCTT ACAGGTCGGT GCCTCGTTTC CGCGAGAAAG GTCCAAACGC ATCTCTCGCG CAGTGGTCCT TCCTATTCGA GCGTCGTCCT TTTGGGGTTT CCCTCACACT GTCCCTACGC GTTCATTCTC AGTTTGACTG GCTTTTGGCT ACTGCTACTA CTGCTGCTGC TACAAAGGCC CGTTGGTCGT CAAAGACAAG CGGGTGTTAC GAGTCGACAG TCGTGTAGAC CACTTCGGTG CTACAGTTAG TCAAGTGCGC AACGAATCCG GAAGCCAAAG CAAAGATACC AGCGGATCCG TTGTTTGTTT CTTAGTGGCA CGTTTGGAAC GAGTTCTTGA TACGTATTGT TTGTTTCGAG CCAACAGAGT TTTTGCAGAG AACTGTCGCA ATCAATCTGA AAAAGCAACC AATTGTGTCT TTAGCTCGGT TTTGCTTGCC GACAGTTGCG ACAGTCTTTC TCCAACAAAC AACTTGTCCA GAGAGTTTTC GAAATCACTG TTGGTATTCT GAGCTGTCGA CCGTCGATTG ACACACTACA CCCAGTAAAC CTTTGGGCCG ACACTTTTCG CAAGTATCTA CCGCGAATTT TGTTGTTTGT AACGTTTACA GCACCAGTTG CGAGGGCACG GACATTCACA CATCGCTCGG CGCCTATGCA GTTTCAATAC AATTCAGATT ACCCCGGTGG TGGTCCGGAC GATAACCGCC TCTTTTCTAC CGCCAAAGCC GCAGCAGACG CCGCAAATCC TCCCGATTTT CCTCCGGCAC ATCCCATTCC GTGTGCCCGG ATCACTACCC GTGTGTTTCA TCCCAACCGA AATCAAGTTG CTACCGTGCA AAACGTTCTG GTACGGACCG TCCTCCAAAA TCCATTACAT TCCTCACCAG CGGTGGCACT ATACAATGCC GCCGAAAACC ACGAGGACTC TTCCATGAGC AGTAGTGACG GGTTTTCCGA CAGCAACGAT GACGACAGTA TGCTGTTGGA CGACGGGTCT CGGGGTGCCT CGGGGCCGAT TGGCTTTCCG GGCATCGCTC CGGCATCGGC TTCCACCAGA ACGGTTCAAC AACAACCCAA TGGCGACGAC GATATGGACA AGGATGGTGA CGATCGCGCA TACTGGATAC AACGGACCAT TCGTGATGCG ATTTACGGGC ACGTGTTCAT GGCGGTGGTG CTGCGGAGAC GAGTACCCAG TCAAGCCGGC AACGACAATG CCGAATGGGA AGTTACCGCA CAACACTGTG CCGTTAAGGA AATGAGCTGG CAGCATATTC GGAAAGAACG CGATCGCTTG GCCGAAGACC CCATAAAGGA AGTCTCTGCT ATGCAATATC TGGTATCCTG GCATCGATCC GAACGGAAAG AATGTGAGCA ACAAGTATTG TCCTCCGCAT CTACAGAACA TCCTCGAGAC GGCGTCTCTC GATCCGTACG AGCCATGGTG GCAACCAACA TTATGATGCC ACTGGATTTG CTATCGGATG ATCGGAATTT ATACAGCGTC ATGCCCTACT GTAATGGTGG CGAGCTTTTT GAGCGACTCG ATATGAATGA ACGATTTAGT GAACCGGAAG CGCGGTATTG GATGAATCAA GTTTTGAATG TACGTATCAA TACAAATGGA GTCATTCGGA AGTTGCCCAT GAAGCACAAT TCTGACCGTG TTCTTGTTTT AAACTTAGGG TATTGAAACT CTACAGAATG CTGGAATATG CCATCGTGAC ATGAGCTTAG AGAACCTTTT GGTACACGAA AATGGTGCGC TGATTATTGA TTTGGGTATG TGTTTGCGGG TCCCTGTGCA GAAGGAGCAT GGAAGCGACA CTCCGGAGGA ACAGGCACAA TTTCTGTCGC AGTCGTTTGA CACGATGAAT ATGAACGGAA ACAACTCAAC GGCGCTATTA ACGCCCACAT CATCCTTGAC AACTACCACC ACAACTATTC GCGGAGGTGC AACCATATGC CGGAAGCAGC CGCGCCGATT GATTACTCCG CAAGGGACCT GCGGTAAATG GATATATATG TCACCGGAAA TATATAAGAA CGCTGCACCT TTTGATGGCT TTGCCGTGGA TATGTGGGCT GCAGGAGTGA TTTTATTTCT CATGCTGACA GGATTTCCGC CATGGGAGCG CGCGTGCCAG ACGGACGAAC GCTTCAAATA TATGACTGCT GGGTATCTGG TTCAGATGCT GACGGAGTGG GACATTGGCC TTAGTCCGGA CGCGATGGAT TTACTGCAGC GAATGTTATT TCTAGATCCT AAAGACCGCT TGAGCTTGGA GCAAGTGCGG GCACATCCGT GGATGGTCAA TGGACCGAGT CAACCGCCAG CGCCACTAGC CGAGTTTTGA
|
Protein sequence | MKRVEIGYCP LVVKDKRVLR VDSRVDHFGA TVSQVRNESG SQSKDTSGSV VCFLVARLER VLDTYSPVAR ARTFTHRSAP MQFQYNSDYP GGGPDDNRLF STAKAAADAA NPPDFPPAHP IPCARITTRV FHPNRNQVAT VQNVLVRTVL QNPLHSSPAV ALYNAAENHE DSSMSSSDGF SDSNDDDSML LDDGSRGASG PIGFPGIAPA SASTRTVQQQ PNGDDDMDKD GDDRAYWIQR TIRDAIYGHV FMAVVLRRRV PSQAGNDNAE WEVTAQHCAV KEMSWQHIRK ERDRLAEDPI KEVSAMQYLV SWHRSERKEC EQQVLSSAST EHPRDGVSRS VRAMVATNIM MPLDLLSDDR NLYSVMPYCN GGELFERLDM NERFSEPEAR YWMNQVLNGI ETLQNAGICH RDMSLENLLV HENGALIIDL GMCLRVPVQK EHGSDTPEEQ AQFLSQSFDT MNMNGNNSTA LLTPTSSLTT TTTTIRGGAT ICRKQPRRLI TPQGTCGKWI YMSPEIYKNA APFDGFAVDM WAAGVILFLM LTGFPPWERA CQTDERFKYM TAGYLVQMLT EWDIGLSPDA MDLLQRMLFL DPKDRLSLEQ VRAHPWMVNG PSQPPAPLAE F
|
| |