Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44486 |
Symbol | |
ID | 7197762 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 708686 |
End bp | 711746 |
Gene Length | 3061 bp |
Protein Length | 834 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178286 |
Protein GI | 219114981 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.330794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCCTCTTC AACCAACGAA CGGTCTTGGT CGCGAGCGAT ATCGAAACAC AGTGAGAGAA AAGGGATCCA ACTGAAGACG AGAGGAAACT CATCCACCCG TTCAACACAA GTTATCTTTC TTTTGAAAAA GGAGTAACTC TTAGTATTCT CCCCCGCACT CGTGTTTCAA GGATGATGAT TTCTCGTATT TCGGTCGTGG CGCTGTTCAA CCTTCTGGTA GCGACATACC ATACGGAGGC TTTTGTACCG TCGCATCACC AGCGTCGAGC AGTCATTCGA ACAACAGCAG GCACATATCC TTCCGCTTAC TCCCGCCTCT ACTTTTTTGG ACGCAACAAG GACGACGACG AGGCGTCGGA AGCCGATAAT ACGGACAAAA AAAGTCCATT CTTCGCACGT TTTGGTATCG GAAAGAACGA GAACGAAGAC GACTCTGAAG ATAAAGCATC GGAACAAAAT TCTAAAGTGC ACGAGACTGT TGCAGCGGTG GCTACCGTAG CCAAAGATGA GGTCGAAAAG GAGCAAGGGT TCACTAGTGG ACCTCTTCCT CGCACATCGC GGGCGCCCAA GCGAGTGGTC GATGCAAACT TGTCGCCCCT CGAGCAGGCG GCCGCGTTGC AAGCGCAAGC CAAGAAAATC CGTCTCGAAG CTGAAAAGCA AGACGCGGAA CTTACTTTGG CGAAAATCAC AAAGTTGGAA AAAGAAATTG CACACGCCAA GAGGCACAAG GAGTCGCATG ATGACGCCGT GGTGGAATTA TTGCAAAGGG AACTGCAGGC TTTGGAAGCC AAAATGAGGG GAGAAGCGCC AGCACCAGTG ATACGGTCTA AGCCATTGTC TAATAAAAGC GAAAGCGGCG CGGTCCCTGG CACGACAGAT ATCGCCGCAA AAGTGGCAAC CCGAGCAATG CCTGAGAACG GCTTTTCAGC CACCGTGAGC ATGGCCGAGG ACCCTGCTGA AGCACTACAA TCGCTCGACG AACTTACCAA ATTTATAGAA AATTCACCCA AATTTATGAA GAAAGCACTC GCGGCACAGG TCGAGCTAGA TTATGCCGAC GTGGAAAACC TTAACACGAC TGAAATGGCG CTTCGCGTCG ACAAAATGCG TCGTCTCGAC TTTTCCTTCT CGGCGCGTCC TAAACCCAGA TTTACCTTTG AGGAAGTCAA TGCGAAGAAG AAGGAACTCA GCAAGGGATG GGCTAAATCT CTGTTGGATC CCCGTCTCAC GTCGGCGGCT AGTGGCAACG AAACAGAGCT TGCCTTGCTC GCTCTGGAAT ACGAGTATTA CAACAAGGGA GGCATGGTGC TTTCGCAGGA TCAACTAAGT GAAATGTCCC AGGACGACGA GTTTATCGCG CAAATAGTTT CTGCGGTCAA CAAGTCAGCT GTGGATAGTT CGATTGAAAC ACTTTTTCCG AAGTGTACCC GTAAAGAAGG ACAGGCTCCC ACCATGGCAC AAGTCCAGCG ACTAATTACC GATGTTTTGC CAAAAGCAAA GTTTTCGTCA ACGTCTAAAC CAGAACCTGT TGCCGGTGGC TTTGTCATTC GGGGAATTAA CAAGGCAGAA AACGGCGACG AGCTGATCGA AGCAATTGAC AAGCAGCTTA CTAAGTACCC AAATTTGGCC GACAAACTAT CCGTCTTGTA CACCAACGAC TTTACTGTCT TTGCCAGTAC CGAGGTTGAA AGCGACGAAT TCAACTTCGA CGATGTTGAG CCCATCCTAT ACGTGACTGG TCCGAATATC GTTCGAGAAC CCCGGAGGGT GCTCTTGTCG TTGACGTCTG CTTTAGGAAT CGCTTCATCT TGGTATCTAT CCCTTTACCC GTTCCTTCTA AATCCAGCGT TGTCTAAACG GGTCGACGAG CAGTTGGCTT TGGCCGACGC CAATATGACA CCGGATCTGA CGTGGTTGAC GGATCTGTCG CTTCCCTTGT TTACGACGTT CGTCTCGATC CAGTTAATCC ACGAAATCGC CCACCGGGCT GTCGCTGCCT TTTATGATGT GAGTGGCGTG CTTTGTGGAA GGAAGCGATA AGATCGTAGG AGTTCGGCTC ACACTTTCTT ATTCCTGTCC CACAGATCAA AGTTTCTGCT CCAACGTGGG TACCTTCCAT CTTTACTGGT ATAACGAGCA GTGTAACCAC GTTCCGGACC TTACCAAAGA ATAAGCAAGC GATGTTTGAC TTCTCGGTAG CCGGACCGCT GGCAGGAATG ATTGCTTCTG CCATCGCGAT TTTTATCGGG TCACAGATCA CTGCAAACCA GGATGCTTCA TTGTATCCTG CTCTTCCTTT GGAGATCTTA CGACAGAGTA CTCTGGGTGG TGGAATTATT GAAAGCATGC TAGGTAGTGG AGCACTGAGT GTTCCTGGTG GTGCTCTCGG GACCCAGGCC GTTGCTCAGA TGATGATTCC TTTGCATCCA GTGGCCGTTG CCGGCTATAT TAGTCTTGTT CTGAATGCCC TTGCGATGCT ACCCGTTGGA AGTAAGTAAT GAGCGAGGAT TCCGGTTTCA TGCGTCAAAT TTTGACTTAC GAATCCATTT GTTTATATTT GGCTAACTTG CAGCCACCGA CGGGGGTCGT ATTGCCTTAT CAGTTTTTGG ACGTGGCGCC AAGCTTCTTG TCGGAAACGC GTTTCTGTTT GCAATGCTTG CAATTGGCCT TTTAGGATCT GATCTTTTCC TGTTCTATTT CGCTTTCTGT ATTGCGTTTC AACCCGGCAA CGAAATACCG TCACGGAACG AGGTTGATCG TGTTGACTTT TCTCGTGTTG TGGTTGCCAC TAGTGCATAT ATAGTTGCTA TATTGGCGCT AATACCTTTC CAGTGAGGGC TTCATCTCAT GTTTTTTCAC AGTCAGTATC TGCTATTTGT CAACCATCAC CGGTATTTTA CTAGGACCCG CCTAGTCGAA TGCAGCAGCA TTTAAATTAC TCCTCAAGCG CTCAATCCCA TTCTAGAACT TGCAGGAACA CTGCCCTGCC TCACCATTGA ACTGTTGTAG ATGCCGTCCT ACTAGAAATT TAATGCAATA GGAATGGATT TAAAAATACT A
|
Protein sequence | MMISRISVVA LFNLLVATYH TEAFVPSHHQ RRAVIRTTAG TYPSAYSRLY FFGRNKDDDE ASEADNTDKK SPFFARFGIG KNENEDDSED KASEQNSKVH ETVAAVATVA KDEVEKEQGF TSGPLPRTSR APKRVVDANL SPLEQAAALQ AQAKKIRLEA EKQDAELTLA KITKLEKEIA HAKRHKESHD DAVVELLQRE LQALEAKMRG EAPAPVIRSK PLSNKSESGA VPGTTDIAAK VATRAMPENG FSATVSMAED PAEALQSLDE LTKFIENSPK FMKKALAAQV ELDYADVENL NTTEMALRVD KMRRLDFSFS ARPKPRFTFE EVNAKKKELS KGWAKSLLDP RLTSAASGNE TELALLALEY EYYNKGGMVL SQDQLSEMSQ DDEFIAQIVS AVNKSAVDSS IETLFPKCTR KEGQAPTMAQ VQRLITDVLP KAKFSSTSKP EPVAGGFVIR GINKAENGDE LIEAIDKQLT KYPNLADKLS VLYTNDFTVF ASTEVESDEF NFDDVEPILY VTGPNIVREP RRVLLSLTSA LGIASSWYLS LYPFLLNPAL SKRVDEQLAL ADANMTPDLT WLTDLSLPLF TTFVSIQLIH EIAHRAVAAF YDIKVSAPTW VPSIFTGITS SVTTFRTLPK NKQAMFDFSV AGPLAGMIAS AIAIFIGSQI TANQDASLYP ALPLEILRQS TLGGGIIESM LGSGALSVPG GALGTQAVAQ MMIPLHPVAV AGYISLVLNA LAMLPVGTTD GGRIALSVFG RGAKLLVGNA FLFAMLAIGL LGSDLFLFYF AFCIAFQPGN EIPSRNEVDR VDFSRVVVAT SAYIVAILAL IPFQ
|
| |