Gene PHATRDRAFT_54983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54983 
Symbol 
ID7195287 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp363350 
End bp366343 
Gene Length2994 bp 
Protein Length891 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183599 
Protein GI219126721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAAAAACG TTCTACTTTG TATTTTGGTC GGGTTTCGGA TCCTTCCAGC AACCATTCTC 
ATTCAAAGTC ACCACTTGTG CGAACGATGG TACCGAAACC TGAAGATCCC ACAGTCAAGG
CAGAGAATAA TGCGGCGATG GATCAACTTA GTCTCCTCGA CAAAGATGAT ATATCGTCGG
CTTCTCGCTC GTGCCGAGAA CTCTACGGTA GGTCGTCATC GCAGTGCCGG ACTGGACTAG
GCTGGACTCC CTGCAAGATT CAATCGGAAC CGACGAAACA CGTGACTCAC AGTTATTGCA
TCATCCTTGA CGCTCCGTAG GACCTTACCC CAAAGCTATT CCTGTGCCGT TCTTGAATTC
TCGTAACGAA GCTCGCGAAG GTGACACTCC CGCCGCCAGC GTCATCGCGC AAGCCAAAAC
CATCTTTGAC GTACCGGCGG ACTATCGTGA CGTGGGAACA CCGGATGAAT GGGTTCCCCG
CGATGGACGC CTCGTGCGTC TGACGGGTAA GCATCCCTTC AACGTCGAAC CACCGCTGGC
GATTCTGAAG CAGCATCGAT TTATTACGCC GTCCTCGTTG CATTACGTAC GCAACCACGG
AGCGTGCCCG AAGCTGTCTT GGAAAGAACA CACTGTTTGT GTGGGAGGAA AACTGGTACC
GAATGCCTTG GAGCTCTCGA TGGACGAAAT CGTAGCGATG GAACCGCGAG AGCTGCCCGT
CACGTTGGTC TGTGCCGGAA ATCGTCGGAA GGAACAAAAC ATGATCCGTC AAACAATCGG
CTTCAACTGG GGCCCGAGCG GCGTCTCAAC CAGCGTTTGG AAGGGAGTGC TCCTACGCGA
TTTGTTGCTC CGCGCAGGGG TTTCGGAAAA GAACATGGCA GGGAAGCACG TCGAATTTAT
TGGTGTCGAA GACTTGCCGA ACAAGGTGGG ACCCGGGCCG TTCCAGGAGG AACCATGGGG
CAAACTTGTC AAGTACGGAA CCAGTGTCCC GCTCGCTCGG GCTATGAATC CAGCGTACGA
CATCCTCATT GCCTATGAGC AGAACGGCGA AGTCTTGCAG CCCGATCACG GCTACCCCGT
CCGTCTCATC ATTCCTGGTT ATATTGGAGG ACGGATGATT AAATGGCTTA AATACATCAA
CGTGATTCCG CACGAAACCA AGAATCACTA TCATTACCAC GACAATCGCA TTTTACCGCC
CCACATCACT GCAGAGGAAT CCTTACAGGG AGGTTGGTGG TACAAACCGG AGTACATTTT
CAATGAACTC AACATCAATT CGGCCATCGC TGCTCCTGAT CACAATGAAA CGCTTTCGAT
CGCCAAGAAT ATTGCCAAGA CGTATGACGT TACGGGTTAC GCATATACTG GTGGTGGTCG
TCTCATCACC AGGGTCGAAA TTTCAGTTGA TGGCGGTATC CATTGGGAAC TTGCCAAACT
TGAACGCAAG GAGCAGCCAA CGGACTACGG AATGTACTGG TGCTGGACTT GGTGGAACTA
CGAAGTAAAG GTGGCCGACT TGGTGGGAGC CAAGGAAATT ATATGCCGCG CCTGGGATGA
GTCCAACAAC CCTCAGCCAG TTGTTCCAAC ATGGAATCTG ATGGGTATGG GAAATAATCA
AGCCTTTCGT GTCAAGGTAC ACATGGACAA GACAGCTAGC GGCGAGCATG TGTTTCGGTT
TGAGCATCCA ACTCAGCCTG GTCAACAAAC TGGTGGGTGG ATGACAAAGG TCGCCACCAA
GCCTGAGTCG GCTGGGTTCG GACGGTTGCT GGAAGTGCAG GGTGAGTCCA AAGAAGACGC
GGCCCCGGCT CCACCTCCGA AGGAAAATAC CAAAATTTTC ACGATGGAAG AGATTGAAAA
GCACAACACT GAAGAAGACT GTTGGATTGT GGTGAAGGAT CGTGTCTACG ACTGTACCGA
GTATCTAGAG CTGCACCCTG GCGGCATTGA CTCGATTGTT ATCAACGGCG GCGCAGATTC
CACGGAAGAC TTTGTGGCAA TCCACTCTAC CAAGGCTACA AAGATGCTCG AGAAGTACTA
CATTGGCCAG CTCGACAAAA GTAGTGTGGC CGAGGAGAAA AAACAAGAAG ACGAACCTCT
CGTCGATGCC GATGGCAATG CTCTTGCCTT GAACCCAAAG AAGAAGACGC CATTTCGTCT
ACAAAACAAA ATCACACTTA GTCGAGACAG CTACCTATTG GATTTTGCTT TGCCAAGCCC
AAAGCATGTT TTGGGGCTAC CCACGGGAAA GCACATGTTT ATTTCGGCCC TCATTAATGG
AGAGATGGTA CTCCGCCGCT ACACTCCTAT CTCATCCAAT TACGACATTG GATGTGTAAA
GTTTGTTGTC AAGGCATACC GTCCGTGTGA ACGCTTTCCA GACGGTGGCA AGATGAGCCA
ATACCTAGAC CAGATCAATG TTGGCGACTA TGTTGATATG CGCGGACCAG TTGGGGAATT
TGAGTACTCG GCCAACGGCA GTTTTACAAT CGACGCCGAA CCTTGTTTTG CCACCAGGTT
CAACATGCTT GCTGGGGGGA CCGGCATAAC GCCCGTAATG CAGATTGCTG CGGAAATTTT
GCGAAACCCA CAAGACCCTA CACAAATGTC CCTTATTTTT GCATGCCGCG AGGAAGGCGA
TCTCTTGATG CGAAGCACTT TGGACGAATG GGCTGCTAAC TTTCCTCACA AGTTCAAGAT
TCACTACATC CTATCTGACA GCTGGTCTTC CGACTGGAAG TATTCCACAG GATTCGTAGA
CAAAGCGCTA TTTTCCGAGT ACTTGTACGA AGCAGGCGAT GATGTTTACA GCCTCATGTG
CGGCCCACCA ATTATGTTAG AGAAAGGCTG CCGTCCAAAC TTGGAGAGCC TTGGTCACAA
AAAGGACAAA ATTTTTTCCT TTTAAAAGTT CTTGACTGAT TGTCATATCA ATTTTGCACT
TTACAATACA TTTTCAATAG CAATTTACTT TAAGACTAGC GCAATTTTTT TCTT
 
Protein sequence
MVPKPEDPTV KAENNAAMDQ LSLLDKDDIS SASRSCRELY GPYPKAIPVP FLNSRNEARE 
GDTPAASVIA QAKTIFDVPA DYRDVGTPDE WVPRDGRLVR LTGKHPFNVE PPLAILKQHR
FITPSSLHYV RNHGACPKLS WKEHTVCVGG KLVPNALELS MDEIVAMEPR ELPVTLVCAG
NRRKEQNMIR QTIGFNWGPS GVSTSVWKGV LLRDLLLRAG VSEKNMAGKH VEFIGVEDLP
NKVGPGPFQE EPWGKLVKYG TSVPLARAMN PAYDILIAYE QNGEVLQPDH GYPVRLIIPG
YIGGRMIKWL KYINVIPHET KNHYHYHDNR ILPGGWWYKP EYIFNELNIN SAIAAPDHNE
TLSIAKNIAK TYDVTGYAYT GGGRLITRVE ISVDGGIHWE LAKLERKEQP TDYGMYWCWT
WWNYEVKVAD LVGAKEIICR AWDESNNPQP VVPTWNLMGM GNNQAFRVKV HMDKTASGEH
VFRFEHPTQP GQQTGGWMTK VATKPESAGF GRLLEVQGES KEDAAPAPPP KENTKIFTME
EIEKHNTEED CWIVVKDRVY DCTEYLELHP GGIDSIVING GADSTEDFVA IHSTKATKML
EKYYIGQLDK SSVAEEKKQE DEPLVDADGN ALALNPKKKT PFRLQNKITL SRDSYLLDFA
LPSPKHVLGL PTGKHMFISA LINGEMVLRR YTPISSNYDI GCVKFVVKAY RPCERFPDGG
KMSQYLDQIN VGDYVDMRGP VGEFEYSANG SFTIDAEPCF ATRFNMLAGG TGITPVMQIA
AEILRNPQDP TQMSLIFACR EEGDLLMRST LDEWAANFPH KFKIHYILSD SWSSDWKYST
GFVDKALFSE YLYEAGDDVY SLMCGPPIML EKGCRPNLES LGHKKDKIFS F