Gene PHATRDRAFT_47592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47592 
Symbol 
ID7202647 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp195191 
End bp196390 
Gene Length1200 bp 
Protein Length389 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182023 
Protein GI219123420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.823619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATCA ATCAGAAATA CTTCAACGCC TCACTGTTGC TGGTATGTTC AACTGTATGT 
ACCCTGTTCC AGTTTCGTAT TCTCTATCAA ATGGGCAGAA GGGCAGTCTT CCCCGGATCC
AACTCAGGCG ATGAAGGACA GCAAGATCTG GAGTCGTGGC AGTTGCGTCC AAAATCACTT
CGCACCGTGA TTGTGCCAGA GTCACCGACT AGGACGTGCG CCATCAATCT CTACGGGTTG
CCAAGAGCCT TTGAATCCCT TGCTTTACCG ACACTCATAA AGAACGTGAT TCGTCCCAAT
GCTATCAATG GTTGCGACTA CTTTGTGCAC TACTATTACT TGACAGAAGA AATGCCGGGG
CGGTCCGGCG AAGGAGGTCG CATAAATCCA AATGAGGTAG TGAAGTTGAA GCAAGCAGTT
CAAGAGTTCT CTCCAAAGTC AGTTATTCAA TTTCGCTACG ATAAGGAACA GGCCTTTTGG
GATCAATATC AACCGCTTAT TGACAAAATC CGAGCAAGCA ACGATACGGA TGGAAAATTC
TTATACTTTC CGTGGCGCAG CCCATCGTAT GTATACCCAA TCACTACTGA TAACATTGTT
AAGATGTGGC ACAGCATTCA GTCAGCGTGG AGTTTAATGA CGCTGTACGA GACTCTGACA
ACCCGAAAAT TTGACAGAGT TGCGATGCTA CGCTTGGATG TCGTCTACAT TACACCAATC
AACGTATTTC AAGTGAATCG ACGAGAGGTT GGCAAGAATG AAAAAGTGGC CTTAATACCT
GGCTTTGGAC GTCATCCCGT CAGCGATCGT TTGATTATTG GACCACGAGA GGCGATCGAA
ATTTGGGCGT CGCAGCGATT TGATCGTCTC GAGGAGCACG TGAAGTTCGT ACACGAGAAG
CATCCGGGCT GGGGTATGCA TTCCGAAATG TTTCTCCATT GGACAATCTT CCCGGCCATT
CGAGACACTG GGACGAGCAT CGTGGAAGAC GACAGTTTGT GCTTTTTTCG GGCGCGCGCC
GACGAGTCGG TATGGGTTAG TGATTGTGGT GGGAAACCGG AGTATGCCAA AACCTCCATT
TTGAAGAATC TTGGTGGGGA CAAAGTTGAA GTATTGGAAT CGGTACTCGG ACGGAAATGC
CGAGGCGAAG CTCAGAATCT TTCTTGGTCC TTTGTAGCCC TGGGCTGTCC AGCAGGGTAG
 
Protein sequence
MPINQKYFNA SLLLFRILYQ MGRRAVFPGS NSGDEGQQDL ESWQLRPKSL RTVIVPESPT 
RTCAINLYGL PRAFESLALP TLIKNVIRPN AINGCDYFVH YYYLTEEMPG RSGEGGRINP
NEVVKLKQAV QEFSPKSVIQ FRYDKEQAFW DQYQPLIDKI RASNDTDGKF LYFPWRSPSY
VYPITTDNIV KMWHSIQSAW SLMTLYETLT TRKFDRVAML RLDVVYITPI NVFQVNRREV
GKNEKVALIP GFGRHPVSDR LIIGPREAIE IWASQRFDRL EEHVKFVHEK HPGWGMHSEM
FLHWTIFPAI RDTGTSIVED DSLCFFRARA DESVWVSDCG GKPEYAKTSI LKNLGGDKVE
VLESVLGRKC RGEAQNLSWS FVALGCPAG