Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49291 |
Symbol | |
ID | 7195467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 484856 |
End bp | 486573 |
Gene Length | 1718 bp |
Protein Length | 531 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184003 |
Protein GI | 219127564 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.955571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGCGACAG CGACAGCTTA AACGAGAACA GCAGTTGTGG TAGTAGTGCA CGCGGACGAC AGTATGCTCC ACTGAATCCA ATCCGAGACG AACGTGCAGT ACTTTTCAAA CTCGAAAGAA TCATGAGACG GCCTCTGCGG CTCTGGTACA AGGAACCTCC GCCGCCTTCA CACGATCCTC CATCCCCTCA ACTACATCAT CACCACCATT CACGATTTTA TCATCCACAC TGGGGATTTC TCCTCATCTT ATCGGGAGTT GCCTTGCAAA TGTGGGTTTC CGTTACGCAA TTATCGGCCT TGGAGGTACC CGAAAACAAC CTCGACGTGC CCTTGCTCCC AGCCTTTCGG AGCACCCGCG AAGATTCGTC ACGAAAGACG CCGGCGTCCA CCAGTGCGTG GAGGCTCTAC GTCGGTCGTG TACGGTACGG CGTGCCCGTG TCCGCCGCCT CCTACACGTG GGAAGCGCAA CCCCCGCGAG TCGTTCGTTT GGCTCACGAC GGATCCTACC AACATCCGAT TCCCTCCTCC CGGAATCGCG TGGCCCGACG GAATGTGGCG GAACTAGGGA ATGCTTCCTG TACCTACTGT CAATACGAGT CGGATCAACA TCTATGGGAC TTTGAAAAAC CCTTTTACGA GGACTGTACC CCCATGGCGA AATGGCAAAC CATCTTTTAT CCAACCTGCA ATGTCTTGCA CGAAATTACC ATGATGCACT CGGAGGAAGA CTACGATCCA AAAGGACACG ACTTGGCCGA CGAGAGTATC GATGAAATTG AGGATAACGA CGATGACACC GTCCCTTCCA TCGAAGAAAC CACCTTGCTG AGTATGCAGG GCAGTTGGCG GAGTGTTTGG AAGTACCAGG ATGCGGTCAA CGATACGGCC GTACTCAAAG TGCTGAAACA CAGTCGAGAA TTCGATCACG AATCTTTCGC CTACCACATT ACCGATGCCA TAGTCATGGA GCGCTTGACC TCGTCCCCAC ACATTATCAA CGCTTACGGA TTCTGTGGAC AATCTGTCTT AACTGAATTC GCTTCGGGCT CGGCGCGCAA GCTCATTAAG GACCCCAAAT TTAACAGCAA GGAACGGCTG AAAATGGGAA GAGATTTGGC TCGGGCCTTG ACAGCAATGC ATTCAATCGA CTTTCCCAAC AGCACCAACC CCACGCTGGC TCACAATGAC ATTAACATTG CGAATGCCGT TGAAGTGGAT GGTCGCATCA AGCTTAACGA CTTTAATCTG GCCGTCTTGA TGCGATGGAA TGATACACAA CCGTGTGGCT ATCCGGTACG GTTCGATCGA CCCATGTGGG AATCACCGGA AGACGTCCGC AACCTCACCT ACGTAGATCC GGCACTCGGG GACGTCTACA GTCTCGGCAA TCTACTCTTT AGCGTATTGA CAACCCGACA ACCGTGGCTA CATCTGGAAC CGAACGGCCC CTACAACAAA ACAGAGGTTG CACAAATGAA AACGCAAGGC ATCATGCCAG CTATTCCCGA TAAATATTTA GAATCGCGCA AGATGGCGCA TCACGCGTTG TATTTCGCCA TCCAAGCCGC CTACCGAGAT GATCCGGCCG AGCGGCTCAG TTCCCACGAA CTGGCGGAAG CTTTGGGGAT CGCTCTAAAT TGGGGTCGGG ATGGAAGACG CACCTCCCGC ACCGACCTTG CCATGCTGTT TGTCAAGCCG AGACCCGACA TGTACTAA
|
Protein sequence | MRRPLRLWYK EPPPPSHDPP SPQLHHHHHS RFYHPHWGFL LILSGVALQM WVSVTQLSAL EVPENNLDVP LLPAFRSTRE DSSRKTPAST SAWRLYVGRV RYGVPVSAAS YTWEAQPPRV VRLAHDGSYQ HPIPSSRNRV ARRNVAELGN ASCTYCQYES DQHLWDFEKP FYEDCTPMAK WQTIFYPTCN VLHEITMMHS EEDYDPKGHD LADESIDEIE DNDDDTVPSI EETTLLSMQG SWRSVWKYQD AVNDTAVLKV LKHSREFDHE SFAYHITDAI VMERLTSSPH IINAYGFCGQ SVLTEFASGS ARKLIKDPKF NSKERLKMGR DLARALTAMH SIDFPNSTNP TLAHNDINIA NAVEVDGRIK LNDFNLAVLM RWNDTQPCGY PVRFDRPMWE SPEDVRNLTY VDPALGDVYS LGNLLFSVLT TRQPWLHLEP NGPYNKTEVA QMKTQGIMPA IPDKYLESRK MAHHALYFAI QAAYRDDPAE RLSSHELAEA LGIALNWGRD GRRTSRTDLA MLFVKPRPDM Y
|
| |