Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42613 |
Symbol | |
ID | 7196294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 583241 |
End bp | 585017 |
Gene Length | 1777 bp |
Protein Length | 530 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177116 |
Protein GI | 219110729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.253881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTAAATGGAA CTTCGAAACG CTAACTTCCG AGAAGCGAGT ATTGACTCAG ACAGTTGGCA TGAGGTTTCT CCCCCAATTT ACAGTTATAG TTAACAGTAA AAGGGTTTAG TGATCTCTAG CTGGTAAGTG GAATGGAAGA AAGGTCGCAA CATCGAGAAT CAGGAGCATC AGATACTCCA GACTATGTTT CCCACCATGA ATGGGCAAGT CGACGAAATG GACAACACAA GCCATGAAGA GCCTACCACA AGCTCAAACT CGTCATGTCC TTTATCTGTT GTATCAATTC CGGGACTTGG TGTGCCTCAG AGTCTCTCAA ACGCCTTACG GAGAATTGAA GTACCTCAAC TGGATGCGAT ACCAATATTG CTCGGTGAAG CCGATGCACT GATCACACCA ATTCGAGGAC GAGAAAGGTA TACCGACGAC GACTCTTCGC TCACTGTCCC CCACACTGTG CACACCGCTA CCAAGTTTCC CCATCTAAGT CGTCGGTGGC CTTCCAGGTT GATGCGCCTT ATCAGTGAGA AAAGCAACGA TGATGAGTGG TCTGCGGTGA TGGAACGACT ACAATCTCAC CCGGAAGAAA TTGCCATAAA AGGACCAGCT AATGGTATGA ATGCTTTTCA TGCTGCTTGC GTTCGGTACC CACCTCTTCA TGTGATCTCA GCAATGATTG CAGTCAGTAA TCCAGAAACC ATCGGAGCCG CTAACGCCAA CGGAGAAACA CCCCTTCATT TAGCGGCAGA CGGTGCCAGT GAGGATGTAC AGATGCTCCT GATCGATTGC GCTCCCAAAG CGGCGTTGGC CCAGGATAAA TATGGCGATT GTCCATTGCA TTTTGCAGCA AGATCTGGAG CGACTCGTCA CCTCATGCAG GCCCTTGTGC AAGCGGCACC AGAGTCAATA TCTATCGCAA ATCCACGAGG CGTCACGCCA TTTTCGCTCC TGCCTCGAAC TTTTTTGGAA GCAGAAGATC TAGACGAGAT CTTCGATGAC GAAAGTGATA ATTTTCGAGA CGACTGGGAT CTACACGTGC TATTTTTGAG CTGTAGCTAC GTTGAAAATG GATACACTGT GCCCGAGTTT TCACTACTGG AGGAGGATTA TCGAACAGAC CGGTTCCAAG ATTGGATAGT TCACGCCGCG GCTGTAACAG CAGCGTGTCC ACGACAAGTT TTGACTTTTT TATGCCGCAT GTTCCCGGAT CAAGCCCTCC GCCGCAACGA GAAAGGGTTT ACACCGCTCC TTTTGGCGAC ACAAACGCCT GCAATGGACG AGCCGGACGA GTGGAATGAG AACGAAGACG GTTATCGAGC TGAGATCGAC GCTGTCGAAG GCCGACTTCA GATCGAATCA CATGGTGTTA CCTTTGAGGA AGTTTCTCCA CAAATGGGCG ACTCGGCATT TATTTACCGG TCAACGTCTG CATTGCGACC TGGGTTGAAG TCCAAACAAA AGGACTCGGT CATAGATATA TTGCTTAGGT GGAGTCCACG GTCTATAATC TGTGAAGACG TACAGGGACG TCTACCTTTG GCGCATGCTC TTGTTTCTGG TCATTCGTGG CACACTGTCC GTAGTATCAT TGCTGCTTGT CCACGGGCTC TTGAAGTTCG CGACCGAGCT ACAGGCCTCT ATATGTGTCA GTTGAGCAGT ATCCACTCTC CAGATCTTGA TACAGTGTAC ACAATCGTTC GAAGCCATCC ACACTTCCTA CGATTGTCTG GAAATTTCCA AGCTGGAAGA TCCTGCAAAA GCTCGGTAGG AAATTGA
|
Protein sequence | MFPTMNGQVD EMDNTSHEEP TTSSNSSCPL SVVSIPGLGV PQSLSNALRR IEVPQLDAIP ILLGEADALI TPIRGRERYT DDDSSLTVPH TVHTATKFPH LSRRWPSRLM RLISEKSNDD EWSAVMERLQ SHPEEIAIKG PANGMNAFHA ACVRYPPLHV ISAMIAVSNP ETIGAANANG ETPLHLAADG ASEDVQMLLI DCAPKAALAQ DKYGDCPLHF AARSGATRHL MQALVQAAPE SISIANPRGV TPFSLLPRTF LEAEDLDEIF DDESDNFRDD WDLHVLFLSC SYVENGYTVP EFSLLEEDYR TDRFQDWIVH AAAVTAACPR QVLTFLCRMF PDQALRRNEK GFTPLLLATQ TPAMDEPDEW NENEDGYRAE IDAVEGRLQI ESHGVTFEEV SPQMGDSAFI YRSTSALRPG LKSKQKDSVI DILLRWSPRS IICEDVQGRL PLAHALVSGH SWHTVRSIIA ACPRALEVRD RATGLYMCQL SSIHSPDLDT VYTIVRSHPH FLRLSGNFQA GRSCKSSVGN
|
| |