Gene PHATR_25666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_25666 
Symbol 
ID7204368 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp466862 
End bp469929 
Gene Length3068 bp 
Protein Length885 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186352 
Protein GI219113537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATGATCTT GCTGATGAAA TGCCTTGAGC ATTACTTTTC GGCATGAAAT TTAACTACAA 
ACTGAAACGT CTCTGTGGCG CATACTACGG CCATCCTTTG ACTAACGAAA CTACTGGAGG
GACACGATGG AGTGGATCCA ACGTTGTCTA CGATTCCAGT GGTGACCTCC TTATATCCTC
TGTTTCCAAC CGCATCCAAG TTTTGGATTT GAGGACACAC ACAGTCCGTA CATTGCCTGT
CGAATCCCGA TCGAATGTTC GCTGTCTCGC CCTTTCACCC GACGATGCTA TTTTGATCGT
CGTTGATGTC AAGAACTACG CACTCATTGT CAACTTTCGA AGAGGGATTG TTCTCCACCG
ATTTCAGTTC AAACGTAAGG TAAGGCAAGT TCTGTTCAGC CCAGATGGAA ACTATATCGC
AGTTACACAC GGCAAACAGA TTCAAGTTTG GTGCGCACCG TCGCAACTTC AAAAAGAGTT
TGCCCCCCTT GTACTGCACC GCACATACAC TGGACAGGCC GATGACGTTA TGTGCCTGAG
CTGGTCCTCA GATTGCTCCG TGATAGCAGC TGGTAGCAGG GACTGCTCCG TTCGAATTTG
GAGCTTGCAC ACAACGCGAA ATTTCGAACC CGTGACACTC TCTGGACACA AACGCGCGAT
AGTAGGCGTA TACTTGATAG GGAACCATAG CGGCCGGGTG GAAACCTGTT ACAGTGTGAG
CGAAGACGGA ACGCTTGTAT CGTGGGAATG CAAATTGAAG GAAGGGGAAT GGGATGTTGA
TCACCAGAAT CGAGAAACAC CCCCGGAGGA TTCAACGGAC GATGCAGTCG ATTTTTTCAC
AGGCGCATTT CCTAGGCGGG CGTCGGAACG TACTGGGAAG TCTCAGGCTC ATGATTTAGT
ACAGTCTTTT TGGTCTGTCA AGTCGAGACA TTATTTCCAC CAAGATGGTG CAGATGTTAG
CTCTGTCACA TACTGTGAAC GTGGTCAGCT ACTGGCGGTC GGATTTTCTT CGGGTCTCTT
CGGGCTTTAC GAAATGCCTT CCGTTTCCAA TATTCATACC CTTTCTGTTG GAAACAACCA
ACTAGTAAAA ACGTGTGTCC TTAACAAGAC CGGCGACTGG CTCGCTTTAG CCTGCCCTCA
TTCACAGCAA TTGTTCGTTT GGGAATGGCG TTCCGAGACA TACGTCTTGA AACAGCGAGG
TCATGCGTAC GGCATGCGAT GCATGGCCTA CTCGCCAGAT GGTGTAATTG TTTGTACGGG
AGGCGACGAT GGCAAGCTTA AGCTATGGAA TGCTACATCT GGATTTTGTT ATGTGACAAT
GGAGAAAAGT CATACAGCCC CGATAACGGC CGTCGCATTT GCCAATGCTA GCGTTGTTCT
GTCCTCCAGT TTGGATGGCA CCGTTCGAGC CCATGATCTC TATCGATACC GCACTTTCAA
GACTTTTACC ACACCAACTC CCGTACAGTT CTTGAGTCTC GCAGTAGATC CAAGCGGTGA
GATTGTAACT GCGGGTAGTA CAGATCCTTT TCATATTTAT GCTTGGAATC TCCAAACCGG
TAAGCTTCTC GATATATTGA CTGGTCATAG GGGACCGGTC TCGGATTTGT CTTTTCAAGG
GAATGGCGGT ATTCTAGTTA GCGGTTCTTG GGATGGTACT GTGAAACTCT GGGATCTTTA
CAAGGGGAAT GTTCCAACTG AAAGTTTGCA ACACACGGCA GATGTGGTAT GCGTGACCTT
TCGGCCGGAC AGTAGAGAAG TGTGCACGGG TACAATGGGC GGTATTCTAA GTTTTTGGGA
TGTCGACAGT GCCAAGTTAA AATTTGAAAT CGACGGTCGG AGAGACATAG CTGGTGGTCG
TAAGATTAAT GACCGAATGA CAGCCGACAA CAACGCTTCT TCTCGATATT TCACATCTGT
TTGTTACTCA GCAGACGGAT CCTGCATCCT TGCCGGGGGC AACTCCAAAT ATGTCTGCAT
CTACGAGGTC TCGCAGCAAA TGCTATTGAA AAAATTTCAA GTCACTTTTA ATCGAAGTTT
GGACGGAGTT TTGGACGAGG TACGTTCAAT GATCTCGGCC ATAGTCAGCC TTGGCTCTTT
AACGCTATGA TGATGAACTT CCCATAAACA TTCGCAGTAT GCAGCTGACG GGTTAGACCC
ATTTGGAGCT TTTAACTTCC TTGGATGTCT GAGCGTTGAT TTTGCTCGTT GTCTCTGAAC
ATGTTCGTTA TGCAAATCTA ACATATCCCT TTGGTACTGT CCTTGCAGCT CAACTCCAAA
AATCTGGGTC CCGGAGGACC GATTGATGAT CACGCCGATT CGGGGGATGA TACGATGTAC
AATGCCTTGC AATTGCCGGG TGCTCGTCGA GGTGACGATG GCTCGCGTAG TTCTAGGGTA
GAAGTACTGA CGCTACAAGT CGCGTTTTCA TCTACTGGCC GAGAATGGGC CACTATTTCG
GGAGAGGGGC TTCATGTCTA CTCGTTGGAC GAAGATATGA TATTTGACCC GATCTCTCTC
ACCGAAACCA TCACCCCCGC TGCGGTGGAA TCTAAGCTAA GCACGGGAGA CTACATCATG
GCGCTGCGTA TGGCCCTTCA TTTGAATGAG TTTGCCCTTG TAAAGAACGT GCTAGAGTCG
ACACCGTTTG ACTCGATTGC TCATGTTGTT CGATCCATCG GTCCTGAACA TTTGGAACGA
GTCCTGCAAT ATGTGGCAAA AGTGATGGCG GACTCTCCGC ACATTGAGTT CTATCTGCAT
TGGTGCCTGG AACTGCTACG TACACACGGG ATTCACATGG ATAGGAATCG TGGCAATTTT
ATGCGAGCCT TCCGTGCAAC GCACAAGTGT GTTCAAACGA AATATGACGA ACTCCGGGCC
ATATGCCAAG AAAACAAGTA TAATCTCGAC TTCCTTCAAG ATCACGCTCG TCTACTTCTC
ACTCATGAGG AAAGTGAGAA GATGAAGGTG GAAGACTATC GGGAGTAACT AGTGCTGTTG
CCGATTGTTT GTTCTGGGCA CTCCGTTAAT GTAAAGCAGA TATAATACCG AGAATTTTGT
TTACAGTT
 
Protein sequence
MKFNYKLKRL CGAYYGHPLT NETTGGTRWS GSNVVYDSSG DLLISSVSNR IQVLDLRTHT 
VRTLPVESRS NVRCLALSPD DAILIVVDVK NYALIVNFRR GIVLHRFQFK RKVRQVLFSP
DGNYIAVTHG KQIQVWCAPS QLQKEFAPLV LHRTYTGQAD DVMCLSWSSD CSVIAAGSRD
CSVRIWSLHT TRNFEPVTLS GHKRAIVGVY LIGNHSGRVE TCYSVSEDGT LVSWECKLKE
GEWDVDHQNR ETPPEDSTDD ASFWSVKSRH YFHQDGADVS SVTYCERGQL LAVGFSSGLF
GLYEMPSVSN IHTLSVGNNQ LVKTCVLNKT GDWLALACPH SQQLFVWEWR SETYVLKQRG
HAYGMRCMAY SPDGVIVCTG GDDGKLKLWN ATSGFCYVTM EKSHTAPITA VAFANASVVL
SSSLDGTVRA HDLYRYRTFK TFTTPTPVQF LSLAVDPSGE IVTAGSTDPF HIYAWNLQTG
KLLDILTGHR GPVSDLSFQG NGGILVSGSW DGTVKLWDLY KGNVPTESLQ HTADVVCVTF
RPDSREVCTG TMGGILSFWD VDSAKLKFEI DGRRDIAGGR KINDRMTADN NASSRYFTSV
CYSADGSCIL AGGNSKYVCI YEVSQQMLLK KFQVTFNRSL DGVLDELNSK NLGPGGPIDD
HADSGDDTMY NALQLPGARR GDDGSRSSRV EVLTLQVAFS STGREWATIS GEGLHVYSLD
EDMIFDPISL TETITPAAVE SKLSTGDYIM ALRMALHLNE FALVKNVLES TPFDSIAHVV
RSIGPEHLER VLQYVAKVMA DSPHIEFYLH WCLELLRTHG IHMDRNRGNF MRAFRATHKC
VQTKYDELRA ICQENKYNLD FLQDHARLLL THEESEKMKV EDYRE