Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47390 |
Symbol | |
ID | 7202437 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 509092 |
End bp | 511604 |
Gene Length | 2513 bp |
Protein Length | 668 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181740 |
Protein GI | 219122828 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTGAGCAG CATACAGTAA AAAGCTGCAC AACCAAGAAG TGTATGTATG AAAGGAAAGC TTTCGACTCC TGATCGTCAC TCATCTTCTT TCTGTGTATG GCTTCGGAGA ATCCGTGCTC GTAATGCGAA TCGTCCGTGC GATTTCGAGC TGTATTCCAA ATAGGATTCG AGATGCCCGA TAGGAGGGAA TCTCGCCGTG CCGTCGACGC TATGGCGTGG ATTTTAGTTT TGGCTGGATT GTATTGGTTC GCGGCGTCTT TCTTTCTCGC CAAACGAAGC CTGTCTCAAG TCTCATCCTG TGACGAGGCT CGTGTTTTAC TGTACGAAAC GCTTAAACTC GGACAGACCT CCGGGACGGG TTCTGTTGTT CAAAGGCGCC TTGATAGTAT TCTGTCGTCA TCAGCATCGA GACGGGGTGG ATGTTGGATG GATCGTGTCG TTGACTCCAT GGTTTTTATT GTTGTCGACG CATTGCGATT CGATTTTGCC CTCTACAATC TACCCAAATC AATCGGCAAA CGTTTGGACA AGGCCAAACA TCATGCGTCG ATTTCAAAGG TGGCAGACCA TACCATACTG ACAGCGTCAG CAACCTCTTA CAGTAAGTTG TACCAGTTCG TGGCCGATCC TCCAACTGTC ACAATGCAAC GCCTTAAAGC CCTCACGACG GGAGGGCTAC CAACTTTTGC CGATATCTCC AGTAATTTTG GAGGTGGAAC GGTAGACGAT GATTCCTGGA TAAGGCAAGT ACTACTCGAA CGAAATTGGG AACGGCGGGG CCTGTCGAGG AACACTAAGG CCGCATTTGT TGGCGACGAT ACCTGGGATG ACTTATTTCC GAACATTTTT ACCGAATCGT ATCCGTATCC ATCATTCAAT ACCCGAGACT TGGATACAGT AGACAATGGA TGTCTTTCAC ATGTTCCTGA TCTGATTTCG CGTTTGAGAC ACTCGGAAAA TTCAAACGAG ACGACTACAA CGGGTAACGA CGATGATTTA GAACTAGTTG TCGTCCATTT TCTTGGTGTC GATCATGTCG GCCACACATA TGGTCCTCAT AATCAGCACA TGGATGCCAA ATTGCGCCAA ATGGACGACG CTTTGGCATC AGTCTTAGAC TTTTTGGACT CGTCTGATTC TTGTCATGCT GCAATGATTT TGGGTGATCA TGGCATGACA GAAGACGGCA ATCATGGAGG TGGAACCGAA GAAGAAGTCA ACGCAGCCCT TTTCGTCCAC ATGTCCGCTA CTTGCCATGT CAGGGAATCT GACAAGAACG ACGGACCATC GCGCATGGAT GCTGGCTTTT TGGATACATC TCGACATTCA GATCTCGTGC AAGCCGCATT TTCTTCGATT CATCAAATCG ATCTAGTTCC TACCCTTTCG ATCATGCTTG GCATTCCGAT TCCTTACGCA AACCTTGGCA GCCTGGTACC GTCACTCATA CCGTCCCAGT CAATAGCAGC CATCACGACT GCACTAGCAC TTAACGCTGC TCAGGTTTGG AGGTACTTTA CCTTCTATTC CAATACAGCC AACAAGTTAC CAGGTCTACC TGAGCTAGAA AGCAAACTGG CTATGGGCAT TAAAATGTTT GAGGATGCTT TGCACCAAGA CGATCCAGAA GCGGATGAGT ATTTCCAAGC CGCGACACTT TTCAAGTTTT ATCTGAGAGA AGCGCTGGAA CTTGGACAAC GTGTGTGGAC TCGATTTGAT GATACTGGAA TGACGATAGG TATCATCATC ATAACCATTG GGTTGATATG TTATGCAGCC CCTCTGGTAC GACCTGAAAA GGTACGATTG CTTCCATCGT CACAATTTTG GGAAGTAAGT GTTACGACGG TGTTCATGGT CTTTCAGTGT GGGCTCTTGA CGTTCAGCAA CAGTTATATT CTCGAAGAAC AGCACACCAT TATGTTTTGT TTGACTATTG TGTCGGTGAT TTTGGCATTG CGAATACGCA GAGAATCGGC ACACGACATC ATGTGGCGCG CTGTATTGCT ACTTCCCATG GCCTCTCGTT TGAACGAATT TTTTGTCTTT GGACACGGTA TGGATCCTTC AACAGGATTG TACGCAGCGC ATAGTGCGGC CTGCTTTCTA TCGAGTCTCT TCGTTTTTGG TGCATTTCGA TGGTATCTTT TCCAGAAACG GATCACAGTG TCCCTTTGGA CTACTTCGGA GGATTATGTG GTACTGCTTT GTCTTGCGAA GAGTTGGTGG GAAAAGAGTA GTGTCGAACC AAATCACAAT GGTTATACCG CTTCCACTGT CGCTTTAGCC ATCATTTTTG TCAGTACACC AGGGATGATC CTCCGGGCGC TTCTGGCAAA CAGAAATGCC AAAAAAGGGA AAAGAAAAGC GCTCGACGTC ATACCATTTG AAGGTGTTAG TATAGAAATA TTGTTGAAGC TTCTTCTTGC AATCATGGCT GTGACGGGGC CCTCTTCGGT GACATCTCTT GTTCTCTACA CCTTTCAAGC AGTTGTTGTG TTTCACTTGT CTG
|
Protein sequence | MPDRRESRRA VDAMAWILVL AGLYWFAASF FLAKRSLSQV SSCDEARVLL YETLKLGQTS GTGSVVQRRL DSILSSSASR RGGCWMDRVV DSMVFIVVDA LRFDFALYNL PKSIGKRLDK AKHHASISKV ADHTILTASA TSYSKLYQFV ADPPTVTMQR LKALTTGGLP TFADISSNFG GGTVDDDSWI RQVLLERNWE RRGLSRNTKA AFVGDDTWDD LFPNIFTESY PYPSFNTRDL DTVDNGCLSH VPDLISRLRH SENSNETTTT GNDDDLELVV VHFLGVDHVG HTYGPHNQHM DAKLRQMDDA LASVLDFLDS SDSCHAAMIL GDHGMTEDGN HGGGTEEEVN AALFVHMSAT CHVRESDKND GPSRMDAGFL DTSRHSDLVQ AAFSSIHQID LVPTLSIMLG IPIPYANLGS LVPSLIPSQS IAAITTALAL NAAQVWRYFT FYSNTANKLP GLPELESKLA MGIKMFEDAL HQDDPEADEY FQAATLFKFY LREALELGQR VWTRFDDTGM TIGIIIITIG LICYAAPLVR PEKVRLLPSS QFWEVSVTTV FMVFQCGLLT FSNSYILEEQ HTIMFCLTIV SVILALRIRR ESAHDIMWRA VLLLPMASRL NEFFVFGHGM DPSTGLYAAH KTDHSVPLDY FGGLCGTALS CEELVGKE
|
| |