Gene PHATRDRAFT_45697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45697 
Symbol 
ID7200749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp965815 
End bp969018 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179755 
Protein GI219117940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.920354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGGGG ATCCACGCCA GCCACCGCAG CCGTCACCGG AATCGGCGCA TATCGGAGAC 
AAAAAACCCG AGACGGCACG ATCCATTCTA TGGAGTGCTG ACGATCATCC CGACGAGGAA
CTGGAAGATG TGGATTTGCC CTTGCCAGTC GTCGATCATC CGTTGACCGT GCCGTTGTCG
ACGAACGCGT CGCACCCCGT ACCGGCGGCG GATCACGAAC TGATATCCCG GATTCTGGTA
CCGTCTCCGT TTCCTCATCA CCACGCGGGC CCCACCAGCG CTCGTTTCAG GAATGCCTTG
AATCGGGTAC AGACTCAACC CACTGCGGAC GTCGAAGCCT GGCAAGCCCT CTTGACGGAA
ACGCAAACGT GCTACAAACA GATTGTGGCC AACAACACAT TGACAAACTC CGTTACGCAC
CCGTACACAA CCGACGCCGA GACGGATGTT GTTGCTCGCG TCAAACAACA ACAATTGGAT
TGGGTGGAAT CCTGCTACGG TGCCGTCTTA CGCTACTTCC CCTACGCCAG TAGCCACGTA
CACACCGTGG CAGAAATTCT GTGGACACTG TCGTCGCACG GCGTCGCGGA AGAACAGCAG
TTACTTGTAT TCGACGGCAC CAACCATAGT CACCACAGTA CCATCAACGC CTCGTCTGTG
TCTCCCCAAC GAACTCAACT GTACGAAGCC AAACTGGAAC GACTCTTGTC TCGCTATCTC
GGTGTCACGT TCGACCATTC CTCGTCCGGT AGCAATTATC CCAACGGTCG GGACGCCGAT
ACGCCACCGT CCCCACCCGA AAACACGGCA CTACCCGGAA TGTGCGATTG GATGGTAGAA
TTGTGGTTGT TGTACGCTCG GAAAAAACGG CGTGACGCTC TACGGCACAG TAACCTCGCA
CAACAACAAC AACTACCCGA CGCGCGTGTA TCTTACGTTC GGGACCAAAC GTTACAAGCC
TACGAGCAGG CACAACCCTT TGTCGGACAC GGTGAAAATA ACGTCATCTT TTGGAAAGCA
TATTTGGACT TTGTCCGTTC CTGGACGGCC ATGGCCAATG AGGACGCGAA AAACCATCAC
GCGGTGGCGC AACAGCAAAT GGTGCGACTC CGGACCATTT ATCAGGCCTT GATCAAGTAT
CCCATGACGG GATTGGATCA GCTGTGGCAA GAGTACGAAG CCTTTGAGCG GGGTCAGAAC
GAAACACTCG CGCAGGCGTT GACGCAGGAA CTGTTGCCCA CGTACCAACA CGCCCGGACC
GTGTATCTCG AACGACACCG TGTGTACGAT ACCAACGATC TACAACTAGG TCGACTCGCC
ACGCCACCGG CGGACAATGC CGTCACTCAA GAAGAAGATT ACGAAACGAA ACGAGCCGAA
GAGCAAGCAT TACTGCGTGC GTGGAAAGTG CGGGTTGCCT ACGAACGGAC CAATCCGGAA
CGTTTGAATT CGAGTGAGTT TGCACGACGC GTCCGACAGG TCTACCAGGC GATGGTTTCG
GTGCTGACGC GGTACCCAGA AGCTTGGCAC ATGTGGAGTA CCTGGGAATT GTCCGTGGCC
ACCGGTACTA CCACGACGTC TGATGTCACT GCTGATGGAC GACACCACGA ATCAACCATC
ACGTTGGCCC GTGCCGTCTT ACAACTCGGA CAAAGTCATA TTCCAGACTG CACATTACTC
GCGCATACCG AAGCCATCCT AGTCGAACTT CATGCAGTGG ATCCCAAATC ATGCTTAAAT
GTCATGGAAC GGTTTGTGGA TCGTAGCCCC AATACTCTCG GCTTTGTTCT ATATCAGCAA
CTGACGCGAC GATATCAAGG TATGGAAGCT GCGAGAAAGG TGTTTGCACG TGCACGACGC
GTATTGGTAA ATCCGGCCGA AGCCGCGGCC GCTGCAGCGA AACAAGATGT CCGGACTGAG
GACGGGGTTG ATGCAGAGAA TCATCCACAC GACGAGGGAA GCGGCGGCAA ACGCTGGGTA
GTGACAAATA GATTAGATCC TAACATTGGA CCGACAAATG GGCAACAGGT GCAAGGTGCT
ACCGAAACGA CGACAGGACA AGAAGGGGTT GTAGACGGAA GCGAAAAACA TCCACCGGGT
GTTATTACTT GGCACCTATA CGCCTCGCAC GCAAACATTG AGCACAGGGT CAACAAGGCA
CCGGAAGTTG CAGCGCGAAT TTATGAGTTG GGTTTGCGGA AGCACGCGGC CTTTTTGACC
GTACCGTCCT ACGTAATGCG ATACGCCCAA CTACTCCTGG AATTAAACGA TACAATGAAT
CTACGGGCCT TGTTAACTCG CGCTGTCGCT GCTTGTGAGG CCCAAGAAAA GGAAAACTCC
CTGGCGTTGC TCTGGAACAT GACTTTGCAT TTTGAGTCGG TCATGGGAGG GTCCGATCCA
ACTAGTGCCG TAACGATGCA GAAAATAGAA CGACAACGTC GTGCAGCCCT GATGGGTGCT
AACGTGGAAG AGGTAGCCAC TGGAGGGTTT GTTGGAATTA ATGAACCAGC CTTGATTGGT
GCTCAAAAAT CCACTATTGC AGACCAGTTG GTGCGAACGG AAAGCTATGA TACGAGTTCC
TCTATTGTAA ATGGAATGAA CCGTGCGGTG GATGTTTTGG AAATAATGGG GTTGTGGGGA
AGCGGGGAGT CATCAGTGGA TCAAGCTCGT CGTCGGATCA AGCAAAGCAA AAACCGGGAA
AGCGAAGTTG ATATTTCCGG TGGGAAGAGC GACACAAGTT TTCAAAAGCG ACTCGAGTAC
CAAAACGCAG TATCGGCAGG GTTCTCACCA GAGGCAGGAA CGACTGATGG CACCGCCATA
GGCAACAAGA TTATGTCAGC TCGTGAGCGC TATCAACAGG GAGCTATAGC GGTCGCCTCC
GGTGGTGCGG TTGGCTCGAG TGCAATTATT ATGGCAATTC AGCAAATGCC AGATTGGCTG
CGACCACTTC TCATGACATT GCCAGCGACA CGACTACGTG TTCCAGTTGT GCCAAAACCT
CCGCCGCATA TGGTTGAAAT CGCTTTGGCC GCACTGAAGG CGAATTCGCT TCCTGCCGAG
CGGCCTGAGG GAGAAGTATC TACGAGTGGC AGCAAGCGTA AATTGGCTGC GATTGACTCC
TCGGACGAAG AGAGTGATGT GCAAGGCGGA GGGTATGGCA GCCAGTTTCG AAACAGACAG
CGTGCTAGAA TGAACGCGTC ATAA
 
Protein sequence
MAGDPRQPPQ PSPESAHIGD KKPETARSIL WSADDHPDEE LEDVDLPLPV VDHPLTVPLS 
TNASHPVPAA DHELISRILV PSPFPHHHAG PTSARFRNAL NRVQTQPTAD VEAWQALLTE
TQTCYKQIVA NNTLTNSVTH PYTTDAETDV VARVKQQQLD WVESCYGAVL RYFPYASSHV
HTVAEILWTL SSHGVAEEQQ LLVFDGTNHS HHSTINASSV SPQRTQLYEA KLERLLSRYL
GVTFDHSSSG SNYPNGRDAD TPPSPPENTA LPGMCDWMVE LWLLYARKKR RDALRHSNLA
QQQQLPDARV SYVRDQTLQA YEQAQPFVGH GENNVIFWKA YLDFVRSWTA MANEDAKNHH
AVAQQQMVRL RTIYQALIKY PMTGLDQLWQ EYEAFERGQN ETLAQALTQE LLPTYQHART
VYLERHRVYD TNDLQLGRLA TPPADNAVTQ EEDYETKRAE EQALLRAWKV RVAYERTNPE
RLNSSEFARR VRQVYQAMVS VLTRYPEAWH MWSTWELSVA TGTTTTSDVT ADGRHHESTI
TLARAVLQLG QSHIPDCTLL AHTEAILVEL HAVDPKSCLN VMERFVDRSP NTLGFVLYQQ
LTRRYQGMEA ARKVFARARR VLVNPAEAAA AAAKQDVRTE DGVDAENHPH DEGSGGKRWV
VTNRLDPNIG PTNGQQVQGA TETTTGQEGV VDGSEKHPPG VITWHLYASH ANIEHRVNKA
PEVAARIYEL GLRKHAAFLT VPSYVMRYAQ LLLELNDTMN LRALLTRAVA ACEAQEKENS
LALLWNMTLH FESVMGGSDP TSAVTMQKIE RQRRAALMGA NVEEVATGGF VGINEPALIG
AQKSTIADQL VRTESYDTSS SIVNGMNRAV DVLEIMGLWG SGESSVDQAR RRIKQSKNRE
SEVDISGGKS DTSFQKRLEY QNAVSAGFSP EAGTTDGTAI GNKIMSARER YQQGAIAVAS
GGAVGSSAII MAIQQMPDWL RPLLMTLPAT RLRVPVVPKP PPHMVEIALA ALKANSLPAE
RPEGEVSTSG SKRKLAAIDS SDEESDVQGG GYGSQFRNRQ RARMNAS