Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44438 |
Symbol | |
ID | 7197678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 547950 |
End bp | 549331 |
Gene Length | 1382 bp |
Protein Length | 458 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178250 |
Protein GI | 219114909 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.426284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTAACATGTC CGTCGTGCAG CCTACAAAAC GAGCCAAAGC TGCCGTTGGC CTCATCCTTG TGCTTGTTAT CGGAAGCGAT GGCTTAGATC TGTCTTTAAG ACCGCACTCG AAGCCTTCCA GCGTATCCTA CCCTTGCCGC TTTGTCGTCG GTCCGACTCG CCTCGCGAAG CCCCATCGGC ATCCGCGATT ACCGATGAGA GGATATCGAT ATCCGACAGT GCGGGGGGCA GGCGACTCTC TTAGTAGTGA GCCTTCGCAG AATCGCCCTT CTTGGGCTTG GGTTTGGATG CCGACGTGGT TGTTTACCAT GAATCCGCTA GCTCAGTTTT TGACTACCAT GGGATTTTAT TTACTGCATA TTCTCGTTCT GTCGCAGCGG CAGCTGGTCT TTCCGATCCA GTTGATACCC AACGAGAAAG GTCAATTTGC TTCGATTGGT TATGACTCTA TTGCTGGGAT TCTTGTCGCC GTGTGTTACA CAATATTGCG GAAAGCATCT ACGCAATCCT CATTGCAATC ATCGTCGGAG GCCCCTTTTC CCGCACTCTT CAAGAGTCCG ACGTCTGATG CTCCCTGGAA GCTGCCAAGC AACCACATAA GACACCGAGT CTCAAGCTTC CTGACAATCA TTCTGCTAGT ACAGGCCTAC TTCTTTACGG GTCGCTTTAG TTTGTTTTGG GAAGATACAC TGTACACAAT GTCGGGACTC GGATGGCCCT TGACGGCTCC CATGCACCGC AGTCTTTGCG TATTGTTCGG GCACTTGAGT TGGCTAATAA CGGGAACCTT GCTTTTGCGA TTTGTACCTC GACCACCGCG ATTTTTTGGG CCCAAAGCCG TCTACAACAC CGACGATGAT GATGCGGTTT CAAGCAAGAG TCCAGCAAAA CCGGCCTACC GGTGGTTTCG GTCCAGTATT CGCCGCAACT GGGTATGGTG GGTCGTAGGT GGCTACTTTG TCAGCAGTTG GCTTTTTAAC ATTACTGACG TCATTAATCA GTTTGTCTTG CCGACGGCTG TCTTGGAAGA TGCGCAAGAA TCGGTAGTAT CCCAGTTGGT CAATCCAGAA CACAACGATA TCGCGGCTAG TGTCGCTGGG TACATAGCGC CATGCCTGAC TGCCCCTTGG TGGGAAGAAG TTTTGTATCG TGGGTTTCTC CTTGCGGGAC TGTCACAGCT ACTGGGATAT CCCTGGGCAG TCTTTGTACA GGGTCTTATC TTTTCGGCCC ACCACATGTC TTTGACAGCC GCTCTGCCGC TAGCTGTCCT AGGATGGACC TGGGCGATTC TATACACCAA ATGCCGCAAT CTTTTTACAG TAATTTTTGT ACACGCCCTC TGGAACTCAC GGGTATTCCT GGGATCATGG CTCGGATTAT AG
|
Protein sequence | MSVVQPTKRA KAAVGLILVL VIGSDGLDLS LRPHSKPSSV SYPCRFVVGP TRLAKPHRHP RLPMRGYRYP TVRGAGDSLS SEPSQNRPSW AWVWMPTWLF TMNPLAQFLT TMGFYLLHIL VLSQRQLVFP IQLIPNEKGQ FASIGYDSIA GILVAVCYTI LRKASTQSSL QSSSEAPFPA LFKSPTSDAP WKLPSNHIRH RVSSFLTIIL LVQAYFFTGR FSLFWEDTLY TMSGLGWPLT APMHRSLCVL FGHLSWLITG TLLLRFVPRP PRFFGPKAVY NTDDDDAVSS KSPAKPAYRW FRSSIRRNWV WWVVGGYFVS SWLFNITDVI NQFVLPTAVL EDAQESVVSQ LVNPEHNDIA ASVAGYIAPC LTAPWWEEVL YRGFLLAGLS QLLGYPWAVF VQGLIFSAHH MSLTAALPLA VLGWTWAILY TKCRNLFTVI FVHALWNSRV FLGSWLGL
|
| |