Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29887 |
Symbol | |
ID | 7195127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 366536 |
End bp | 368589 |
Gene Length | 2054 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183349 |
Protein GI | 219126197 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGCACGCTC GTCAGAACGA ATAGGCCTTG TCCGATTTAG GGTGACGACA ACTGAAGCTC CTTTGCCTTC CGAACGAACT GCAGCACTCT AACTCTACTC GCATCTACCT CTACTCGCCT ACGCAGCCTT CTTGACAATG AGCGGTATCG ACAATGAGGG TTCGACGCCG TCTCCCACAA ACACTCTCGT GACGGCCTTG TTGACCGACA TGTATCAAAT ATCCATGACC TATGCGCACT GGAAAAATGA TCGAGCCGAT GATGAATCTG TGTTTGAGCT CTTCTTCCGC AGGAATCCCT TCGGTGGAGA ATACACGATC TTTGCCGGTC TTGACGAATG TCTCAAATTC ATGGCGCACT TCAAATTTAC ACAGTCCGAT GTTGACTATC TCAAGACAAT ACCGTCAATG AAAGGTTGTG ACGATGCTTT CTTCGACTGG CTCTTGCAAG TCGACACATC CAAGGTCAAA GTTTACGCCA TGCGGGACGG ATCCGTCGCC TTTCCCCGTA TGCCACTTCT CATGGTCCAA GGTCCACTGG GCATCGCCCA ACTACTCGAA ACAACGCTAT TGACACTCGT CAATTACCCT AGTTTGATTG CCACCAACGC ATCCCGGATG GTCTTGGCCG CGACGGAACG TCGTGAAGTG TCCGAAAGCC TGCCCGCGCA ATGTCGTCAA ACCCCCGTCT GTGTCGAGTT CGGTTTGCGG CGAGCTCAGG GTCCCGACGG GGGATTTAGT GCGAGCAAGT ACGCGGCCAT GGGCGGATTC GTCGCCACCT CCAACGTACA AGCCGGAAAG CTGCTGGGTC TGAATGTGGC GGGCACACAC GCGCACGCTT TTGTCCAAAC CTACACCGGT CTCGAAGAAG TAAAGGGTCG CACCGTGACC GACAACACTG AGCAAGGCAC TGGGGAAAAT GTCGAAATTC TCCCCAAGGT TTTGGAATAC CGCCACAAAT TGGCGACAGA CAATCCAAAT TTCGGCACCA CAAACGACGG CGAACTGGCT GCGTTTATTG CCTACGCCGT CGCATTTCCA CACAATTTTT TGTGTCTTGT GGATACGTAC GAAACACTAA CGTCGGGCTT ACTAAATTTT GTCGTGGTCA CGTTGGTCTT GGACGATCTT GGTTATGTTC CTAAGGGGAT ACGACTTGAC TCGGGTGATC TAGCCTACCT CTCTCTGGAG TGTGCCAAGT GTTTCGCCAG CTTTGCAGAA AAACGTCCCT ATTTTCACAA TCTTTCGATT GTGGCAAGCA ACGATATAAA CGAGGATGTC CTACATGCGC TCAACAAGCA AGACCATGCC ATAACCGTCT ATGGAATCGG CACGAACCTG GTCACGTGCC AAGCTCAACC TGCCCTGGGC TGTGTGTATA AGCTCGTCGA AGTCGAAGGT AGACCGCGGA TGAAACTCTC ACAAGAAATC GAGAAAGTTT TGATTCCAGG CAGAAAGAAG CCTTACCGAC TGGTTGGTAG GGATGGCCGT CCCATTATGG ACTTACTGAC GGGCTATGAC GAATCTGAGC CTCAGACCAA CGTACGCATT CTATGCCGCC ACGCCTTTAT CGAACGCAAA CGAGCCGCCG TCACACCCTC CCGTGTTGAA GCTCTCCACG TCCTTGTCTT TGACCATGGC AAAGTTGTTC CGGATGCCAA CCGAAATCTG GATCAAGCCC GTGTGGCGGC GGCCGAGGAA TTGCAACGAC TCCGCCCCGA TGTGCGACGG TATTTTAACC CGACACCATA CAAGATTGCC GTAACCGACA GTTTGTTTCA CTTTTTACAT GAGCTGTGGC AATCGGAAAC ACCCGTACCA GAGTTATCAT AGTGACGAGA GCGGAATAGC AACGGCATTG TTTCGATGGT ATTGGAGTTT CTTGTACATG CTCCATGCTT TATATTGTCT GGTTCCAGTA AAACTGGAGG AAGGGAGACT ATGGCACAAC ATGATCCTTG TTTCCTGTGT CAGATCCGCA CAAGGCTACC GGACTTGCAT AAACTTTTTA CAGTTAATCA ATGTTTACTC TCCTGTCTTG CAGAATGAGA CCCG
|
Protein sequence | MSGIDNEGST PSPTNTLVTA LLTDMYQISM TYAHWKNDRA DDESVFELFF RRNPFGGEYT IFAGLDECLK FMAHFKFTQS DVDYLKTIPS MKGCDDAFFD WLLQVDTSKV KVYAMRDGSV AFPRMPLLMV QGPLGIAQLL ETTLLTLVNY PSLIATNASR MVLAATERRE VSESLPAQCR QTPVCVEFGL RRAQGPDGGF SASKYAAMGG FVATSNVQAG KLLGLNVAGT HAHAFVQTYT GLEEVKGRTV TDNTEQGTGE NVEILPKVLE YRHKLATDNP NFGTTNDGEL AAFIAYAVAF PHNFLCLVDT YETLTSGLLN FVVVTLVLDD LGYVPKGIRL DSGDLAYLSL ECAKCFASFA EKRPYFHNLS IVASNDINED VLHALNKQDH AITVYGIGTN LVTCQAQPAL GCVYKLVEVE GRPRMKLSQE IEKVLIPGRK KPYRLVGRDG RPIMDLLTGY DESEPQTNVR ILCRHAFIER KRAAVTPSRV EALHVLVFDH GKVVPDANRN LDQARVAAAE ELQRLRPDVR RYFNPTPYKI AVTDSLFHFL HELWQSETPV PELS
|
| |