Gene PHATRDRAFT_20120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20120 
Symbol 
ID7200453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp874406 
End bp875534 
Gene Length1129 bp 
Protein Length339 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179931 
Protein GI219118307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATCCCCGCG ACCGCGTCAT CAAGTACGCC GCGCTCTTTC TCCTCGTCGC CCAAATGGTA 
GGGCTCGTCC TACTCATGCG CTACTCCCGC ACCCAACATG ACGATACTCA ACCGCTCTAC
TTGGCATCCA CTGCGGTCTT TCTTATGGAA GTTATGAAAC TCGTTATTTG TGTCGGTGTC
ATTGCCGTCC AGACTAAATC GGGGGTGCTG CACGAACTCT ACACTCACAC CATCGGATCC
CCTTTGGAAC TGCTCAAACT GACCGTGCCC TCCTTGCTGT ATACCGTACA GAATAATCTA
CTATATCTGG CGCTGACGAA CTTGGACGCG GCTACGTACC AAGTGTGCTA CCAACTCAAA
ATTCTTACCA CGGCTCTCTT CAGTGCGCTT CTCTTGCAAC GCAAGTTCTC CACCATGAAG
TGGTTGTCGC TGGTTGTTCT TACGATTGGA GTTGCTATCG TTCAGCTTTC CGGCAGCGGT
GACCAACATT CGGAACAAGA CAGCAAGGCC GCGACTGACG CTGTCGATGA TACTAATGGA
ACCGCGGCGG CCCACACGCG TTGGGTGGGA CTCGTGGCCG TACTGTGCGC GGCATGTACC
TCAGGCTTTT CTGGCGTCTA CTTTGAAAAA ATCCTTAAAG GATCCCGGAC GTCTCTCTGG
ATCCGCAACG TCCAAATGGG ATTGTCCTCC ATCGTAATTG CGTACTTGAC GGTTTACGTC
AAGGATGCCG AGGCCATTCG GACGCAAGGT TTCTGGGGCG GCTACAACAC TCTCGTGTGG
ACCGTCGTCA CGGTCCAAGC CGTCGGCGGC CTCATCGTGG CTACCGTCGT CAAGTACGCC
GACAACGTAC TCAAAGTCTT TGCTACCAGC TTTAGTATCG TCGTGAGCTG CATCGTGTCG
GCGTTCCTGT TCGACTTTCA TCCGTCCGTA TCCTTTCTCG TCGGCGCGAG CCTGGTGGTC
ACGGCCACCG TTATGTACAG CTCACCCGAG ACCCGGACAC GCAAAACGCG GCGAAGACCC
GTTTTACCTA TTCACCATCG TAATAATACT GCAACCACCA AAAGTCGGGT CTAATCATTG
CTGCTTAATG AAAAATTGAA GTTAAGTGGC ATTGCGTGTT AAATGACAA
 
Protein sequence
MVGLVLLMRY SRTQHDDTQP LYLASTAVFL MEVMKLVICV GVIAVQTKSG VLHELYTHTI 
GSPLELLKLT VPSLLYTVQN NLLYLALTNL DAATYQVCYQ LKILTTALFS ALLLQRKFST
MKWLSLVVLT IGVAIVQLSG SGDQHSEQDS KAATDAVDDT NGTAAAHTRW VGLVAVLCAA
CTSGFSGVYF EKILKGSRTS LWIRNVQMGL SSIVIAYLTV YVKDAEAIRT QGFWGGYNTL
VWTVVTVQAV GGLIVATVVK YADNVLKVFA TSFSIVVSCI VSAFLFDFHP SVSFLVGASL
VVTATVMYSS PETRTRKTRR RPVLPIHHRN NTATTKSRV