Gene PHATRDRAFT_47695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47695 
Symbol 
ID7202703 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp554174 
End bp555638 
Gene Length1465 bp 
Protein Length418 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181932 
Protein GI219123231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00130842 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCAGCTGCA GCTGGAGCTT ATCTTGGTGC AGAATAAACA AAAAAGGACA GGCAAGCGTT 
TGTGAATTCT GGCATTGTCG AGATCGCTGA CCAAGAAATT TCCATTCACG CTGAAAAATG
AACGTTTCGC TCATAGGCAA AATAAAAAGA GGTATTAAGG AAATTTGCTC GGCAACGAGC
AATAATTCAT TTCCAGCCGT TGTCGACAAT CCATGTTGGG GTCTGCCGAG CCAATGTTCC
TTGTCTGACT TGCTGCGATG CTTTGAAAAG GATAAAGTGG TCGTGTACGA AACTGATCAT
TTTCTAGCCT TGAACAAGCC ACCGGATCTT CGAATGGATG GACACCACCC TTCGACGGTA
CTCAAGCTCC TAACCTATTG GTATCCTCCT CCTGCCTTCC AAAATCTTCA GAGTAAGGGA
CTTCTGGAAA AGGTTAGCGA GTTTGAAAAC TACAGGAATA TTGAAGGCAA TGAGCTTAGA
CCCTGTCACC AATTGGATTA CGCAACGTCA GGCATCCTTC TCGTCGCTCG GAATCGGCAA
GCAGCGGATC AAGCTCGAGT TTCATTTGAA GAGCGGTCCA CTCGAAAAAC TTATTTGGCA
CTCGTCCACG GCCACCTATC GGTACCAAGC AATATTCCGG TCATGAGGAG AACGGACATT
GATGAGCGAA TGGGTCGGCT GGAGGAAATA TACCGACAGA GTCGTCGCAA ACACCGGAAG
GACACATATA GAGGATATCA GCCTGCCCAT GGCTTGTTTC AGCAGCTCCA GCAACAGCAC
AGTAAAAGAG CAAAAAAAGA AAAAAAATCG ATTATCAGAT TCCGAGTGGA AAACTGTTTG
GAATGAGCTA CAGTTTACTA AAACAGAAAC AGACCATATC CTTAATTTAA GTTGGAAGGA
GGTGAAGGCG TCCGGAAAAA CTGAACCTTT TGATCGCGCA GCAGAAGTAT TCAACAAACT
TCAATACAAA ACGTTGTTCC CCGAGGATAA AAGTGCCGAG TTGTCGCTCC CGACTTTCTT
TCGCGATGAA AGTGAGGACC CAAACACACT CTTCATTTAT GCTTCTGTCG CACAGGTTCC
TCACGATTTT GCCATGCGAA TAAATCCAAG CATGTCGAAT GCCTCTGCAT ATTTGAAGGT
TGGGGATTCA TCCCTGGACT ACAAGCCGTC TCTCACACGA TGCGTGATTC TTAAGCATGC
TGCCATTAGA GGGCAACCGG TTACAAAAGT TCGTCTTGAG CCTCGAACTG GGCGACGTCA
TCAGCTACGA GTACATTCCG CATTGTTAGG GCACGCCATA GTTGGAGATC AAACCTACAA
AGCCCCAGGC TCTCCCGACC TAACGGATCG CATGTGTTTG CATTCTCAAT GTCTCGAAAT
TCCACTTTTT GAGGAGTTAA TCAAAGTCGA AGCACCAGAT CCCTTCCTTG TAAAGAACCG
TGAGATATTG GTACAGCATC TATGA
 
Protein sequence
MNVSLIGKIK RGIKEICSAT SNNSFPAVVD NPCWGLPSQC SLSDLLRCFE KDKVVVYETD 
HFLALNKPPD LRMDGHHPST VLKLLTYWYP PPAFQNLQSK GLLEKVSEFE NYRNIEGNEL
RPCHQLDYAT SGILLVARNR QAADQARVSF EERSTRKTYL ALVHGHLSVP SNIPVMRRTD
IDERMGRLEE IYRQSRRKHR KDTYRGYQPA HGLFQQLQQQ HNHILNLSWK EVKASGKTEP
FDRAAEVFNK LQYKTLFPED KSAELSLPTF FRDESEDPNT LFIYASVAQV PHDFAMRINP
SMSNASAYLK VGDSSLDYKP SLTRCVILKH AAIRGQPVTK VRLEPRTGRR HQLRVHSALL
GHAIVGDQTY KAPGSPDLTD RMCLHSQCLE IPLFEELIKV EAPDPFLVKN REILVQHL