Gene PHATRDRAFT_49054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49054 
Symbol 
ID7195424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp440854 
End bp443415 
Gene Length2562 bp 
Protein Length853 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183749 
Protein GI219127034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGCC CATATGATGG CCAGACTCCA GCAGAGCAGG CAACTTTCGC TCGTGAAGAG 
CTTATGCATC TGGATGAGGA ATCTATGATG GACGACTACA ACGACGACGA GACGGAGAAT
AACCAAAGTT TTCTCAATTT GCTCCCGGAA GACGACGAAG CCGGAAACGA CGACCCGGCA
CAGCTCTTTG CCATTCTTGA AGATGCCATC AATCAAACCG CCAAACAAAC AGGCGATACA
GGGCGAGCTT TAACGGCGGT AGAACAACAG CAAAAGCGCG AAAACGCGGA ACACACGTGG
GACCGCGTGC GGCGCTGGAT GTGGGCGCAT CCGCAAGTGG AACAACGCCA AGCCGCCGCG
TTTATTCGCG GGCAGGCCGA CGCCACAGCG TTGCATCTCA TGTGCAAACT GCATAATCCT
CCCGACGATA TTATCCAAGC CATCGTCGAC TCCGCCGTCG ATGTTGTGTC CTGGACGGAC
GCGCACGGTT GGTTGCCTTT GCACCATGCC TGCGCTAACG GAGTGTCTCC GGAAACCATG
AAAGTCCTCG TGGATGCCTA TCCCGCCGGT AAGCTGCAAC AGGACAATAT GAGGCGAACA
CCGCTCCACT TTTATGCCAC GCGGAATTCC GATAGTACGG TTATCATGAC AACTAACACC
GAACTCCTGG CTGACGATGG CGCGGCCGAA CTCACCGATC GTGGCGGCAT GTTACCCATG
CATTACGCGT GTGCGTACGG GACCGACCCG GCCGTACTCG AAGTCTTGGC TCAGGTCTAC
CCCGCCTCTC TCACGGCCAA AGAAAACAAG GGTCGAACGC CGATGCATTT GGCCATGGTA
AATGCTCACC GCGACGCCAG TCCGAACGTG ATTCATTTCT TATTGGAGTA CGCTGAATCC
CGAGCGACCG TCAACGTACG CGATCAAGAC GGTTACCTAC CCTTGCATTT GTTGGCTCTC
GGTCTCAAAG GATACACGGC GGATGAATCC ACCAAACGCA GTAACGTGAG CGCTTGCTTG
AGTATGTACA TGGACGCGGA GCCTCAAGCC GCAGCCGACT TTTTGACGGC GATTCAGGAT
TTGCCGGATT GGCTTCAAGA TACTGCAGTC GTATCGAAAC ACGTGCGGAA CGTTTTGAAC
TACAAAATTG TTCAGCCCTT TCCCACGTCT ATTCTCATGT TGGACGGATA CTTTCTGATT
GTCCTGATTG TATGTTTTGC GTTCTCGACG AAATATTATA TTGATTTCCT CTTCGAGCCG
GAGAGCAACA AAACAGACAT TGACATTTAC ATTGTGTTCC TTTTCATTGG AGCGTCGTAC
TTTTTGATGC GCGAGCTGGT ACAAGTGGCT TCGTTGGTTA GTCTCGGCAG TTTCAGTAGT
TGGTGGTCAG ACGCCGCCAA TTGGTTGGAT GTTGGTGTAA TTACAATGGT TTACTACTAC
GCCATCTGTA TGAGCATGGA CAGCCCAGTG GATAGAAATC TTTTTCGAAG TGGCGCCGCC
TTTGGACAAG GAATTCTGTA CGTGGCAGTC ATCGTTTATC TCAAAAGCAC ATACGTCGAC
TTTGCCGTAT TCGTGGGTGG CGTACTGTAC GTCGTACAAC GGCTAACGGC ATTTCTGACT
GCCGTGGGTG TTATCTTGTT GGCCTTTGCC CAAATGTTCT ACTTTATATA TGTGGATGGC
GAAATGTGCA CCAACAGGCT GCAGCCCGAT GATCTAAACT ACGACGCAGA CCTTGACTGT
CGCTTTTCTC ATTGCTCCTT TGAAGAATCC CTTCTGCGAG TGTACAGCAT GATGATGGGA
GAGATCGGAG ATGAAAAACG CTATTCCGCG GGCCCGACGA GTCTAGTGGC ACAGATATTG
TACGTCGGCT ACGCCTTTTT GGTTGTCATT CTGCTGTCGA ATGTTCTTAT TGCCATTGTC
ACCGATAGCT ACGAAATTAT TCAGAACGAC CGCGCTGCGA TTGTATTTTG GACGAATCGG
TTGGATTTCG TGGCCGAGAT GGATGCCATC ACGTACGGAG CACAGCGTCG CCTTGGTTGT
TTTCGAAAAA AGAAGGCGCC CATGCAGGTA GAGGAAATTC CCAATGCCCC GGGATTAGTC
TCGGATGACT ACAATGACAG GAATGGCTCT TCCGACTTCT TTCGATACGG TTGGCAACAA
GTCATGCTCT TGTTTGACGC AAACCTGTAC GAAGAGATCG ATCCTATCGA GTCGCTGGTG
TACAATATTT TTCGAATCGC TTGTATTGTT TTCGTTATAC CGTTGTGGTT GATCATGGGC
GCGGCAACTG CTGGGTGGTT ATGGCCTCCA CAAGTACGAG AGGCTCTATT TTGTCAGAAA
GGCACGACCA CCTCTCGTTC GGAGATTGAG CGCAAAAAAC TCGAGCAGCT GCGAACTATT
CAAACCGATC TGACATCATT GAAGAGTGAT ATAGTGCGAG AAATGACTAG CGATCGCGAC
GAAATGATAC GAATGAAGCT GGAAGTGGAA GGAGTCCAGG CAGAAGTCTT ATCCGATCTA
ATACAAGTTC GAGAGCTTAT CACGTCGCTT CTTGGAACTT AA
 
Protein sequence
MSRPYDGQTP AEQATFAREE LMHLDEESMM DDYNDDETEN NQSFLNLLPE DDEAGNDDPA 
QLFAILEDAI NQTAKQTGDT GRALTAVEQQ QKRENAEHTW DRVRRWMWAH PQVEQRQAAA
FIRGQADATA LHLMCKLHNP PDDIIQAIVD SAVDVVSWTD AHGWLPLHHA CANGVSPETM
KVLVDAYPAG KLQQDNMRRT PLHFYATRNS DSTVIMTTNT ELLADDGAAE LTDRGGMLPM
HYACAYGTDP AVLEVLAQVY PASLTAKENK GRTPMHLAMV NAHRDASPNV IHFLLEYAES
RATVNVRDQD GYLPLHLLAL GLKGYTADES TKRSNVSACL SMYMDAEPQA AADFLTAIQD
LPDWLQDTAV VSKHVRNVLN YKIVQPFPTS ILMLDGYFLI VLIVCFAFST KYYIDFLFEP
ESNKTDIDIY IVFLFIGASY FLMRELVQVA SLVSLGSFSS WWSDAANWLD VGVITMVYYY
AICMSMDSPV DRNLFRSGAA FGQGILYVAV IVYLKSTYVD FAVFVGGVLY VVQRLTAFLT
AVGVILLAFA QMFYFIYVDG EMCTNRLQPD DLNYDADLDC RFSHCSFEES LLRVYSMMMG
EIGDEKRYSA GPTSLVAQIL YVGYAFLVVI LLSNVLIAIV TDSYEIIQND RAAIVFWTNR
LDFVAEMDAI TYGAQRRLGC FRKKKAPMQV EEIPNAPGLV SDDYNDRNGS SDFFRYGWQQ
VMLLFDANLY EEIDPIESLV YNIFRIACIV FVIPLWLIMG AATAGWLWPP QVREALFCQK
GTTTSRSEIE RKKLEQLRTI QTDLTSLKSD IVREMTSDRD EMIRMKLEVE GVQAEVLSDL
IQVRELITSL LGT