Gene PHATR_33478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33478 
Symbol 
ID7203875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1031761 
End bp1033047 
Gene Length1287 bp 
Protein Length428 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186172 
Protein GI219113177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00955149 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTGA CAATAACAGA TGGAGATCCA AAAGAGTACA GGACCTTTGT AGATGCAATA 
CCAACTTTCT ACCCTCTTTG CGAACACAAG CTTTGTCACT GGCATCTACT GTATCGCAGT
AATCTCATGA AGGTGCAGAC TGGAAAATGT GGAGTTAAAG CTACTATTCT ATTCCGCGTA
GTTGTTCTTT GGATTGAGAG CTGGATGACC AAAATTGAGA CACAAGAGGA ATACGAACTT
TCTAAAAGGC TCTTGGCTGA TTGGCTTGCA ACCCCCGAAG CTATTGATGT CAAATTGGGT
GGTATGGGGC AAACTATTGT ATCGCAAATT AATGCGTACA TGACACTGTC ACTTTTTCCT
CATGAACAGC GCTGGGCTAG ATATCGCTAT TTATACACAC AAGCATTCAA CACATCTGCA
AGCTTGTATG CCGAGGCAGA AAATAGTGCT TTAAAACGAT GGGGCGACGG GGTCAGGCCA
AGCTTTTCCG TACCAAAAGC AACTCAGGTT ATAAACGAAG GGACACAACC TAGGTCAAAG
AAGAGGCATC AAAAAGCTGT TTACAATTTA AATGCTGCCA AGACGAGAAA GCCTGCCTAC
TATGCAAACA TTGGGGATTT AGTGGATTAC ATTCAAGATT CTCTTTCCAA AGATTTTGAA
GCAGCTGCTT CATTTGTGCT CTTCCGTCCA AATGCAGACC AGTTTTGGGT CAAGCAAGCC
ACTTGCAAAA GCAAAAACAC GGACATTCGG AAAATCAACA ACAGTAGCTA TTACAAGTAC
ATGATTCCGC AGTTTGAACG CACACGAATT GTGGAGCTTG TTAATATTGA TGGTACATTC
TATTTGGTGT GTAGCTGCGG AAAATTTCAG CGACAAGCTT CCCCATGTGC CCATCTTTAC
AAGGTTCTTG GTCAATCACC CACGTCAACC GATGTCTCTG TACGCTGGAC AAAGCACTGG
GATGTGTATT TGCACCGAAG TGGCCACAGT GACCTGTCAA AGCATTTGGA AGACCTGTAC
AAACAGGAGC GACCAGGTCC AGTATTTGTT GATAGTGGTC AGTGGGTGAT CGGAAAAGGT
GAAAAAGGGT CAAATTTTTT CGAAACTTCG CTTCCGTACA AGCCCCCTGT CATACGAGAT
TTTAATCGAT GGGCAGTGTC TTCGCAAACA ACTGGAGCTG ATTTGAGTGG GACCAAAAAT
ACCACAAATA TGTATTTTTC GAGTGGAATG GTGCAAGAAT CAACAAGCCT GTCCAGAGAG
CATGCATTCC AGGATTCATT GCATTAA
 
Protein sequence
MHLTITDGDP KEYRTFVDAI PTFYPLCEHK LCHWHLLYRS NLMKVQTGKC GVKATILFRV 
VVLWIESWMT KIETQEEYEL SKRLLADWLA TPEAIDVKLG GMGQTIVSQI NAYMTLSLFP
HEQRWARYRY LYTQAFNTSA SLYAEAENSA LKRWGDGVRP SFSVPKATQV INEGTQPRSK
KRHQKAVYNL NAAKTRKPAY YANIGDLVDY IQDSLSKDFE AAASFVLFRP NADQFWVKQA
TCKSKNTDIR KINNSSYYKY MIPQFERTRI VELVNIDGTF YLVCSCGKFQ RQASPCAHLY
KVLGQSPTST DVSVRWTKHW DVYLHRSGHS DLSKHLEDLY KQERPGPVFV DSGQWVIGKG
EKGSNFFETS LPYKPPVIRD FNRWAVSSQT TGADLSGTKN TTNMYFSSGM VQESTSLSRE
HAFQDSLH