Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_26970 |
Symbol | |
ID | 7199997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 980366 |
End bp | 982778 |
Gene Length | 2413 bp |
Protein Length | 654 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179552 |
Protein GI | 219117515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACCTTCTTT TACCATTCGT TAGTCACAAT AGGAATAGCA AAGGGTAGCA CCGGTTCGGT TTCCAGTCCA CTCGAATTTG CGAAAAAGCG AATACGCCTT ATTCACCACA TAAAGTCTCG CACAAGAGTA TAAAAGTCTC ACTCACAAGT CACTGCATTG GCTACCAAAC CAACAGGCAG GTATCCAGTA GAGCAGACGT TCGGACCCAA ATCTCTTTCA TCGCTTGCTC ATACAGCATC TAATCAGCAA CCAACTGGAT CGATAAACAT CACCGAAGAA CAAAAACGCG AGGGATAGCA AGTTCTGAAC TCTCAAGTGT TTACCTTTTC GCATTCCAAC GAACTCAACA CCACTTTCTA GACTCTTTAC TTCTCCACGA CAACTTCCGC ATCAAGATGA AGTGGGAATC TACCGCATTC GTGCTTCTAC AGCTCATTGT CACTACTCAC GCCTACGCCT TCCCCAAGCC AACACAGAGT TCTCGTTGGG CTACCCCTGC CACGACTGGA AAGTCCTCAG CCGCCAAACA CTTTGTGTCG CGCTCGAGGA GCACTACCAG TCTCGCCGCG TCCACCCAGA AAAAGCCAGA AGAACTCCGT CGGGAGATTG CGGAGCGGAA TTCTCTCGTA GAGGATGAAG CACAGTATGC CGTAGCGGAC GGAGAATTGT TGGAGCGCAT GGGCACTTCG GATGCGGTAG CAGCCGAGGA GACCGACGTC AAGACGGACT ACACCGACAT GTACTCGCGC ATGAAGCGCA TGACCAAACC GAGGGCGTAC CCGCTCTTTT TGGCAGAAAA AGGCGTTGAA TTCTTGGAAG GAACCGTGCA CGATATTGCC AAATCCTTCC AACGCACCGC CGAAACGGGA GCGGCCACTT CTACCAGTGA CGTCAACGGA AGCGTGGGAC AAAAGGAGCG AGTCGTGGTT CTCGGGACAG GCTGGGGCTC GGCTTCTCTA TTGAAAGAAA TCGACACCGA CCTGTACGAT GTTACCGTCA TTTCTCCCCG AAATTACTTT CTTTTCACCC CGATGCTCGC TGGTGCCAGT GTCGGTACAG TGGAATACCG TTCCATTACT GAACCCATCC GGGCGATCAA TCCGCAAGCC AATTTCTTGG AGGCCACCGC CACGAACATT GATACGAAAA CAAACACAGT CACCTGCGAG TCCGTCATTT GCGAAGGCAA TAGTTGTGAT ATCCAAGATT TCAGCGTTCA ATACGATCGT CTCGTAGTGG CGGTGGGAGC TCAAACCAAC ACGTTTGGCA TTCCTGGAGT CAAGGAATAC TGCAACTATT TGCGACAGGT TGAAGACGCA CGTCGCGTAC GAACCTCCAT CATCAACTGC TTTGAACGAG CTAACTTACC GGGTCTTTCT GACGAAGAGA GAATTCGCAA CCTTACTTTT GCGGTGATTG GTGCTGGTCC TACCGGGATC GAGTTTGCCG CCGAGCTGCG TGATTTTGTT GAGGAAGACG GCCCCAAGTA CTATCCGAAG CTTCTCCAGT ACGTGCGCAT CAAGGTCATT GAAGCGTCGC CGATGGTTTT GGCGCCTTTC GACAAAGAGC TCCAGCAAGA AGCCATTGCC CAGCTGAAGC GTCCTACCAT GATTTCGGAC CCCAAAGTAG CGAAGTTACT GCCGCCCAAT TTTCAAATGA CAGAACTCTT GTTGGAAGCT TCCGTCAAGG AAGTCAAGGA GGATCGTATT TTACTGAACA ATGGCCAAGA AATTCCGTAC GGTATCGCTG TTTGGGCAGC TGGCAATGGT CCGATTCCTC TGACACTGCA GTTGATTGAA AGTCTCGGCG ATGAACAAGC GTCGGCACAA GCCGTTGCAC GGGGACGTGT CGCTGTGGAT TGCTGGATGC GGGCCATTGG CGGTCAAGGC AAAGTACTGT CCTTTGGTGA TTGCTCATGC ATGTTCCAGC AGCAGCTTCC AGCGACGGCG CAAGTAGCCT CACAGCAGGG GGAATATTTG GCCAAGCTTT TGAACAAAAA GTTTGAGTTC ACGCCGGCTC TGACTGAAGA TGGCATCTTC CCGCCACCGC GGAAAGACCC CGCCCGGACA CAAACCAGCT TTTCCGACGC GATTGCTGCA TTTGCGTCGA ATAACTACGA ATACGCCAAA CCGTTCCAAT TCTTGAATTT GGGCATTTTA GCTTATACTG GTGGGGGTTC TGCTTTGGCG CAGGTGACAC CCGTGCCGGA TGGTGCTTCG GTCCAGGGCA AGGGCAAACT CGGCAACGCG TTGTGGCGCA GTGTCTACTT GACCAAGCAA GTGAGTTGGC GCAACCGACT GCTCGTGATG AATGACTGGA CCAAGCGTCG ATTGTTTGGA CGAGACATTA CGCGACTTTA GAAATAACAA CAGACTGATA TAATTTGCAA AACAATTACA CTTTACTCTT TTC
|
Protein sequence | MKWESTAFVL LQLIVTTHAY AFPKPTQSSR WATPATTGKS SAAKHFVSRS RSTTSLAAST QKKPEELRRE IAERNSLVED EAQYAVADGE LLERMGTSDA VAAEETDVKT DYTDMYSRMK RMTKPRAYPL FLAEKGVEFL EGTVHDIAKS FQRTAETGAA TSTSDVNGSV GQKERVVVLG TGWGSASLLK EIDTDLYDVT VISPRNYFLF TPMLAGASVG TVEYRSITEP IRAINPQANF LEATATNIDT KTNTVTCESV ICEGNSCDIQ DFSVQYDRLV VAVGAQTNTF GIPGVKEYCN YLRQVEDARR VRTSIINCFE RANLPGLSDE ERIRNLTFAV IGAGPTGIEF AAELRDFVEE DGPKYYPKLL QYVRIKVIEA SPMVLAPFDK ELQQEAIAQL KRPTMISDPK VAKLLPPNFQ MTELLLEASV KEVKEDRILL NNGQEIPYGI AVWAAGNGPI PLTLQLIESL GDEQASAQAV ARGRVAVDCW MRAIGGQGKV LSFGDCSCMF QQQLPATAQV ASQQGEYLAK LLNKKFEFTP ALTEDGIFPP PRKDPARTQT SFSDAIAAFA SNNYEYAKPF QFLNLGILAY TGGGSALAQV TPVPDGASVQ GKGKLGNALW RSVYLTKQVS WRNRLLVMND WTKRRLFGRD ITRL
|
| |