Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48313 |
Symbol | |
ID | 7203737 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 93222 |
End bp | 95294 |
Gene Length | 2073 bp |
Protein Length | 616 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182895 |
Protein GI | 219125244 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00387548 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCATCATC TATACTTGCG TCGTGAACAC ACCGTCTCCG TTTCAGTATG CTAGTAGACA AGCAGAATCG AGCGTACGTT GCATGTGCAG CATGTGTGCT GATTGTCTCG CAAGGAACTT TGGCGAATGA GGAGTATCAA GTAGACGCCA TATCGGGAGA ATCCGTCTCG CTCTCCTTGT TGGAATGTGG CATTTGGCTG GCACCCTCAA CCATTCCAGG CGCCGGGCTC GGCATGTATG CTGGAAAAGA CTTCAAGTCT AACGAGAACC TGCAAGCTTC GGGTGACGTG ACGATCCCTA TTGTAGACAT GCCGCATCAT CAGAACGGAG ACTGGAAATT TCTCTGGGAC GAATATACTT GGAACGCGGT GAGTCTCGGC ATGATTCACG AAGGCTTGAT CGACGTAAAT GTTGCATCTC CTGGATTTGG TTCCGCTGCA AATTCCTTCC TACCGTTAGT TAATGTGGAG GAGTGGAACC CCGAATACGG CGCCTGTGGA CTTCATCGAT CTCGAGATCC TGGTGCGGGA GCCTTTTCAA CCTACCACAA TCGCATTTCT ACCGCCAAGA GAGATATTGC CGCAGGCGAA GAACTCTTTG TAGGTAAGAT TCCTATTTGA CTGTCATTAT CAACATGGTG GCCAATATGC ACAGGGAGGT CTCTCACAAT TTTGTGCGGT GTCTGTAAGA CTACGGAAGT AACTGGTTTA CTAGTCGAGG ACATCTAGGT CCCATCCCTC TTTACAAAGA CTTGGATCGG GCCACACACC TGTTTCAGCA ATTTAAAAAC TTGAAAACAA AGTTGCAATC TCCCACTTTG AATACATTGC TGACAGAAGT ATGGAATGAG TTTGTGAGTT ATAGCAACTA TGCAGATTCC AGAGTCATTG GTGCATTTCA TCACCATGAC GAGACGGAGC TCCGGCGACT GGAAAATCAA ACTTTGACCG AGATTCGTTT GCAGGATAGC TCGAGATCGA GAGAGTGGCT AGCTTTACAT GGGACATGTG GTGATCATAT TGTCGCTGGC GTGTCGACCA TTCGTCAAGC CGGCCGTGGA GCTTTCGCGA CGCGTGATCT CCCGGTCGGT ACAGTCGTAT CGCACATTCC TCTCATCCAT GTTCCGGACC GAAAACGATT CGAAATGTAC AACCTCGGGG TTGAAGACGG AGAAAAATAT CCCGATAAAG CTCTCGGGAT TTCTGGCTAT CAACTCCTAT TGAACTATTG TTACGGGCAT GCCAATAGTA CGTTGCTTTT ATGTCCGTAC GGACCTATGG TAAGCCTCAA CATCCAAGTG CTGCTTGTCG TGTGTCCTTT GATGCGTTCT CTCATCGTTA ACAGGTCAAT TTGGTGAATC ATAATAGGAC ATTGGCCAAC GTGGCCCTTC GATGGGCAGA TCCACGCCGC GGAAATCACA TGCCGGCCCT TTTAAAGGGA TCGATTGAGG AACTAGAAAA GGACAGGACG GCGAAGCTTG CCATGGAATT GGTTGCAACA CATGACATCC AAGAAGGAGC GGAGGTCTTT ATGGATTACG GCGATGACTG GGAAAAAGCG TGGCATCAAC ATATCAAGGA CTGGGAAGCA CCGCTAGGAT CCCATGCGTA CATATCGGCC GACGATATGA ATGGTGACAA AGAGATGCAC CTAAGGACCG AGTTCGAATT GCTTGATAAA CCTTACCCTC CCAATCTCGA ACTCAAATGT AACGAAGCTT TTGTTGACAG TGCCTGGGAA AATGCCTGGA ATGCGGGTAC TTTAAGAACC TTTTTGGACT CTAAAAAGGA AGACTCCTTT CTCCCGTGTG AAATTCTTCG CTCAGAATGG AAAAAGGGTA AGCTGCTATA CACTGCTGTA CTGCATCAGA AGGGGGTGAC GGACAGTGCA TCGGATCGAA ATGAGCTACT CGAACATGTT CCACAAGACG CGTTTCAGTT CCGAGACAAG CATTACACAA CAGACATGCA TCAAAAGAAC GCATTTCGAC ACGACATTCG AATACCTGAT AACATGCTTC CGAGTGTGTG GATGAATCGA AAAGAATAAT GTAAACATCA CATCAAAACG CCG
|
Protein sequence | MLVDKQNRAY VACAACVLIV SQGTLANEEY QVDAISGESV SLSLLECGIW LAPSTIPGAG LGMYAGKDFK SNENLQASGD VTIPIVDMPH HQNGDWKFLW DEYTWNAVSL GMIHEGLIDV NVASPGFGSA ANSFLPLVNV EEWNPEYGAC GLHRSRDPGA GAFSTYHNRI STAKRDIAAG EELFVDYGSN WFTSRGHLGP IPLYKDLDRA THLFQQFKNL KTKLQSPTLN TLLTEVWNEF VSYSNYADSR VIGAFHHHDE TELRRLENQT LTEIRLQDSS RSREWLALHG TCGDHIVAGV STIRQAGRGA FATRDLPVGT VVSHIPLIHV PDRKRFEMYN LGVEDGEKYP DKALGISGYQ LLLNYCYGHA NSTLLLCPYG PMVNLVNHNR TLANVALRWA DPRRGNHMPA LLKGSIEELE KDRTAKLAME LVATHDIQEG AEVFMDYGDD WEKAWHQHIK DWEAPLGSHA YISADDMNGD KEMHLRTEFE LLDKPYPPNL ELKCNEAFVD SAWENAWNAG TLRTFLDSKK EDSFLPCEIL RSEWKKGKLL YTAVLHQKGV TDSASDRNEL LEHVPQDAFQ FRDKHYTTDM HQKNAFRHDI RIPDNMLPSV WMNRKE
|
| |