Gene PHATR_21030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_21030 
Symbol 
ID7204611 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp223865 
End bp225267 
Gene Length1403 bp 
Protein Length390 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185659 
Protein GI219120855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCAAATATAG TTGTCCTTGT CGTTGACCAT GGTGACTCCC AGCCCTAAAG CCTCTGCTGG 
CGATATTGCT ACCTTCAACG TACTCCATGG ATTCCCGGAA GCGCTTGTTC GTGGTATGCG
TTCCAGCTTT CTATCAGATG CTGATTATCA CCATCTTACG CAGTGCGAAA CGTTAGACGA
TGTGCGTTTG AACCTAACAG AGTCTGATTA TTCGGATGCA CTTGCGGATT CTGCGACGAT
GACCCCCGCT TCTCTTCAGA AGGCAGCTAT TGAAAAGGTA CAGAATTAGA TTGAAAATTG
TAAAACTAAT GAGAAGCAAT GGAATAGGCA TGACGGAAGA CACCTCTCTT TGTTGTCTGT
TCTTTAACCC TATGCTTCCT GTTTCTACTT CAGCTCGTTA CAGAGTTTCA GTACCTGCGC
TCCCAGTCTG TCGAGCCGCT ATCGACATTT TTGGACTTTA TTACGTTCGA ATATATGATT
GAAAATGTTA TGCTTTTGTT AAAGGGTGCC TTGAGCGGCC GTGATATTAA CGAGCTAATT
GAACAGTGCC ACCCGCTCGG TATGTTCAAG GAAAGTACTA TGCGTAGTAT TCCAACCTTT
GAAAACAGTT CGAGAGGTTA CGCTGATTTG TACCAGACGG TGCTTGTTGA TACCCCCGTT
GGTCCCTACT TTGCCATGTT TTTGCAGGAA AGTTCGGAAC ACCGTGATGG AGACTCTCGA
AACGTTCTGG AAGAAGTTGA GATCGAGATT ATAAAGTCTT CGCTAATCAA GTATTGGTTA
GAAGACTTTT ATCAGTTTGC CATGAAAATC GGAGGAGATA CATCGCAAAT TATGGGCGAG
CTGTTGAAAG TTCGTGCTGA TACCAATGCC ATTAACATAA CGTTGAACTC CTTTGGGACT
CCTCTAAACG AGCCTTCGAT GCGCTCTTCG GACCGCAAAA GGTTGTATCC TTCTGTTGGT
CATCTGTACC CCGCTGGCAC TACCATGCTG ATTGATGTTC AGGATGAGGA TGAACTTGGT
CGTGTTCTTG AATTGTTTCC GCAGTACTCT GCTATATGGA GCATTCACGC GTCTGGCAAC
GGCGATAAGA GCATTGATGA TGCCTTTTAC GAGCGCGATG TCCAGCAGCT AGAGCTCGCC
TTTGAAAGTC AGTTTCACTA CGCCGTCTTC TACGCGTACG TGAAGCTGAA GGAGCAAGAA
ATCCGAAACA TGGTGTGGGT CTCTGAATGC ATTCTTCAGC AGCAGAAGGA TGAGATTAAT
AAGTTCGTCC CAGTATTTTC TTATCATTCT CCCTGGCGCG ATGGCAAGAA GCGATAATTC
TGTTGCTTTC AGCTGATACA TTTTTATTGA TAGTGAATGA TAACGAATTG CAGGTATAAG
ACTCAAACCT TTTTACGACG GTT
 
Protein sequence
MVTPSPKASA GDIATFNVLH GFPEALVRGM RSSFLSDADY HHLTQCETLD DVRLNLTESD 
YSDALADSAT MTPASLQKAA IEKLVTEFQY LRSQSVEPLS TFLDFITFEY MIENVMLLLK
GALSGRDINE LIEQCHPLGM FKESTMRSIP TFENSSRGYA DLYQTVLVDT PVGPYFAMFL
QESSEHRDGD SRNVLEEVEI EIIKSSLIKY WLEDFYQFAM KIGGDTSQIM GELLKVRADT
NAINITLNSF GTPLNEPSMR SSDRKRLYPS VGHLYPAGTT MLIDVQDEDE LGRVLELFPQ
YSAIWSIHAS GNGDKSIDDA FYERDVQQLE LAFESQFHYA VFYAYVKLKE QEIRNMVWVS
ECILQQQKDE INKFVPVFSY HSPWRDGKKR