Gene PHATR_46802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46802 
Symbol 
ID7204550 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp494763 
End bp496882 
Gene Length2120 bp 
Protein Length579 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185717 
Protein GI219120971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGGATTTCG TAAGTAGGGT CAAGGAAGAC CATGGAAACG TAGACGGAAA CACTTTTCCT 
TCGATTTTTT AGATAGTTCC ACAAGACCAT AGAAAACTTC TAGCATCCAG GAACTTATAT
TGACCTGACG CAGACGAAAA GCGGGATCGC AATCGGTTTT GCATTTACCG CAATCAGGAG
GAGTCCTGTG GCGTTCGGAT CAATCGCACG GCGCCAAGCT TACCGTTGTC GTTTGGGTGC
TGGTGGCGCA CGAGGCCCCT CGTCGAATTG CACATTGACG AAACACAATG GCGAGGCAAT
TTCGAAGAAC AAATGGTTTG CACGGTCCCC ATACGACGGG TCAGATCAGC GCATGGATTG
CCCTGTTGGC AACCTTGGTG CAATTTTTGC TCGTCGTTTC TCCAATACTG CCGTTGGAAG
CTTCCATCCC TGTTACAGTT GTCTTTGTGG CCCTCGTGAG TGGTTCCTTT TACTACGGAT
ACCTTGCGCA GTTCATTGAT CCAATGGACA AGCACTTGCG TGTACATCTG CAGGAAACGG
AACCCGAAAA CGTGGCACCG GCGGTGGCGT GTTGTGGTTG TTGTACTGTA CCGCAATTCC
CTTCGCATCA ACACGATACC GAGCAGCCCA TGGCGAACGA AGACATGAAG CAGTGCTGGA
TCTGCGATAC GCAGGTATCG ACGCACGCTA TGCACTGTAA ATTTTGCAAT AAATGCGTTG
GTCGCTTTGA TCATCACTGC ATGTGTAAGT ACCACCGTGT ATCGATTTGT TTGTACCAGT
ACATGGGCGA AGCTCGGAAG CTCACTCGAT CAACTACGAT ACTTGCTGGT CTTACAGGGC
TCAATACGTG TATTGGGGAA GCGAATTATC TCTACTTCTT TCGGACAATG GTTTTTGTTT
TTGTCATGGA AGTCTACCAC TTGATTGTGC AGCTTGGGCT CTTGATTGAT TCGTTCACCG
ACGGTGCGAC GAATCAAAGG GCCACGGATT GGTTTCAAAC CGGAACGGAC ATTCCGGTGC
ACGTGTTGCT GATTTTATTT ATTCTGTTCA ATCTGCTGTC GCTCTTTTTG ATCACGCAAT
TGCTCCACTT CCATATCGGG CTGCGGCGCA AGCAACTAAC GACCTACCAA TTTATTGTCG
AGGATCACAA AGGGCGACGC GAACGTGCCA AACGCGAAGG TCAATTGGAT TCCAACCGAA
TTGTTGCCGT GACGGAGGCA CAGGAAAACG GTCAAACCTG TACCGCGTGG AAGTTGCAGT
TGGGCGGATT GTGTCGGCAA GCGGGTTGCA CGCAGTGCGA TCCACTGGCT CTGTCACCTC
CAGACAAGCC GGAATCGGAA TCATCCGAAG TAAACGCGCC AGAGAATTTC AGTTCCGCTT
TGGGAGAAAG GGAAAGCGAG TCTCAGTCGG TTGTGGCAGA AACCCCTTCG ACGGAACAGC
CTCCAAGAAT GGAGAATCGC ACCGAAAACG AGGGCGTGGC GTTTTTGAAA ATGAACGGCG
TGGAGGATCC CGAGGAAGCA TCGTCGTCTC GGGCTTTGGA GAACGAGTCG AATTTCAGTT
CCGCCTTGGG AGAAATGGAA AGCGAGTCTC AGTCGGATGT GGCAGAAGTC CCTTCGACGG
AACAGCCACC AAGAATGGAG AATCGCACCG AAAACGAGGG CGTGGCGTTT TTGAAAATGA
ACGGCGTGGA GGATCCCGAG GAAGCATCGT CGTCTCGGGC TTTGGAGAAC GAGTCGAATT
TCAGTTCCGC GTTGGGAGAA ATGGAAAGCG AGTCTCAGTC GGATGTGGCA GAAGTCCCTT
CGACGGAACA GCCTCCAAGA ATGGAGAATC GCACCGAAAG CGCGGGCATG GCGTTTTTGA
AAATGAACGG CGTGGAGGAT CCCGAGGAGG CATCGTCGTC GGGGGCTCTG CCAGATGAAA
CGATGACCGA CGGCAGCATT GAGCGAACGG AAGCTAATGT CTCAAAAGAC GTCGATGTCG
CACCAACCAT TTCTGTTGCG GAGGAATCAG ATTTGGGTAC AAATCCTGCA ACCGGACCAA
CCACCGATGT CGACCCGTCA GAGGACGAGG GCATCCGACA AGCTCAACAA AAAGAGCGCG
CACAAAGGTA TTTTGCGTAG
 
Protein sequence
MARQFRRTNG LHGPHTTGQI SAWIALLATL VQFLLVVSPI LPLEASIPVT VVFVALVSGS 
FYYGYLAQFI DPMDKHLRVH LQETEPENVA PAVACCGCCT VPQFPSHQHD TEQPMANEDM
KQCWICDTQV STHAMHCKFC NKCVGRFDHH CMWLNTCIGE ANYLYFFRTM VFVFVMEVYH
LIVQLGLLID SFTDGATNQR ATDWFQTGTD IPVHVLLILF ILFNLLSLFL ITQLLHFHIG
LRRKQLTTYQ FIVEDHKGRR ERAKREGQLD SNRIVAVTEA QENGQTCTAW KLQLGGLCRQ
AGCTQCDPLA LSPPDKPESE SSEVNAPENF SSALGERESE SQSVVAETPS TEQPPRMENR
TENEGVAFLK MNGVEDPEEA SSSRALENES NFSSALGEME SESQSDVAEV PSTEQPPRME
NRTENEGVAF LKMNGVEDPE EASSSRALEN ESNFSSALGE MESESQSDVA EVPSTEQPPR
MENRTESAGM AFLKMNGVED PEEASSSGAL PDETMTDGSI ERTEANVSKD VDVAPTISVA
EESDLGTNPA TGPTTDVDPS EDEGIRQAQQ KERAQRYFA