Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48280 |
Symbol | |
ID | 7203400 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 713565 |
End bp | 715083 |
Gene Length | 1519 bp |
Protein Length | 398 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182747 |
Protein GI | 219124932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAAGG GCCGCTCCTC CCGATCCTTT GGCAAGTTCT CCATTGGCGA CCCTTTTACG CCAGGCGGCA TGACCTCCCA TATTACGGAC ATTCAATTCC TGTATTCATT GCAATGGAGT CGCTACGGCC AGCGGTAGAA AGTGCGAATT GACTGTAAAT TAATGTAATT GTAGATGTGG GGTTTTCTCG GATGTGCAGC GTGGACAACG TAGGCTTGTC GAAGGGGATC ATGTGTCAGA TGGTTTAAGC TGGATATAGT GACGACATGT AAAGACCAAT CCAAATACGT AGATAGGTCA GGTGTCTGGC CTTCATGCGT TTCCAGGAAG AGCGTGACGT AGGCAACGTA GTTCTTTGCT CACTTTCTCT TCGTTTTGGA TCTGGCTTTC ACTTTTGGCA TTTCTTTAGA AGTTCCGCCG CCATCTGCTA TCCGTTCTAG TAACATTGCG CCATACTCAC CGGAAGGACT CTCAAACTGT ACAAACTCGT CGCCGACCGT CCGTACGGGG AATATGCATG TTGCGTCGCG GATGTTGTCC TTTGTGTTTT GGGCAATCCT GTCTCTTTTG CTGGTGCTGA CAGCGAATGC ACTGGGATGT GTTGGTTTCG GTATGCAAGC GTCTCCGCTT TCTCAGTGCA CGAGTCGTCG TCGGTTCGCA CAAAAGAAGG GATCCCACAG TCACCTCTCG TCTCACCGCG CAATGACGCT CAAATTGGCG AAAGCTCTTT CCGACGATGA CACGAGTGCG ACGATTGTGA CAAAGTCCGC GGAGTCCTAC TTTGTCGACG CAGAATTCTT CGAATTGCAA ATATCACCCC ACAGACCCCT TGGTTGCACG GTAGAAGAAA GCCTCGGCGA AGGACGACAC GTATTCGTCA GCAAAGTCGT TCCGGATGGG AATGCCGCTA AAGCTGGAAT CGCAGTGGGA GACGTCCTTA TAGGGGTCAC GGCGGTCACT GGTGATCAAA AAATGGATGT CTCCGGTCTC GGAATCGAGA CGATGTAAGT GTGCGATTGC GTCAAGGACG GTGGAAAAAT ACGATCCGTA CGCTCAAAGA CTCTCTCGTT TCCTTGTTTT TTTCAGTAAA GGACTAGTCG CATCTCGACC GGAAAATGAA TCCCTATCTC TAAAGCTTGC GAGGGGAACA ACCGTCGTCG AGGATCACGA GCAAGCCATT GTGGATCTGT GTGGCAATGA GGACCAGAGC GAGTCGGAAG CTGAGCAATG CGTCTTGGAT TTTTTGAAGA GTGGCTACGA CTACGCCAAT GATTCTGATG ACAGCATGGA GACTGGTGAT GATGTTGACG CCGCAGACAA TGCTGCGGAG GAAGAGGACT TGGTAGGAAA TATGTACAGT ATGTGGAACG AAGACATGCC CGCCGCATCG CCTAAACCGG AACCCATACC TGCATCCGAG GCGGCTAGTG TAGTCAAACC ATGGTCTTCG CGATCAAGCC CATCCGGAAC TTTCATCCGA GATCCAACTA CTGGAAAGAT GAAAAATATA GACGCTTAA
|
Protein sequence | MSKGRSSRSF GKFSIGDPFT PGGMTSHITD IQFLYSLQWS RYGQRCGVFS DVQRGQQVPP PSAIRSSNIA PYSPEGLSNC TNSSPTVRTG NMHVASRMLS FVFWAILSLL LVLTANALGC VGFGMQASPL SQCTSRRRFA QKKGSHSHLS SHRAMTLKLA KALSDDDTSA TIVTKSAESY FVDAEFFELQ ISPHRPLGCT VEESLGEGRH VFVSKVVPDG NAAKAGIAVG DVLIGVTAVT GDQKMDVSGL GIETIKGLVA SRPENESLSL KLARGTTVVE DHEQAIVDLC GNEDQSESEA EQCVLDFLKS GYDYANDSDD SMETGDDVDA ADNAAEEEDL VGNMYSMWNE DMPAASPKPE PIPASEAASV VKPWSSRSSP SGTFIRDPTT GKMKNIDA
|
| |