Gene PHATRDRAFT_40861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40861 
Symbol 
ID7198786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp129230 
End bp130537 
Gene Length1308 bp 
Protein Length435 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184895 
Protein GI219129436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCAAA AAAGTAATGC TTTAAGCTTA CGAGTGGCTT GGGCCGTCAT ATTGGCGCTC 
TCGTGTTCTT CGGCATTCTT TGGCTACTAT AACAACGTTC CTTGTCCGCA GGTGTTCCCT
CCGGATTGGC AAGTCACCGA AAAGCATCGC TTCAGCAAGG CTGTCACCAC GACAGATGCA
GCAAAGGATC TTGATGCACG TTTGAAACGA GCGTTTGATG CCAGTGTATT GCGAAAACAT
CGTGCTGCCA CCCTAGAACG GGCCAGCGTG GCTCCATACA AAAACATGGA TTTACAGTTA
TACCACAAAC CACATCCAGT TCTTAATCCG CTGGATCCAA AGTTGCGACC TAAGCCAGAA
TGGGGTAACA CCACCTTTCC TGACATCAGC GTCGTGGGCT TTCCAAAGGC CGGAACCACT
CAGCTGTACA ATATTCTCGT TTCACACAGT GAGGCAGAAG CCTTCAACAA GCGCGACAAG
GAATTCTGTT TTGCGGGGAA TGAATCATCC TTGTCCCGGA ATAACAACTG GGAAGACTTC
GTTCCAGGAT CACGACCGAC CACCATGCAA ATCGAACTCC AGGAAGCCTT GCATACAGCC
TTACAGAAGC ACCGAAACCT TAGAACATCT TCTCAAAAAA AAACTGTAAA CGGATGCTTA
AGCCAGCGTA TCGTCTCGGT CGTTTACGAC TACTTCAATC AACCATCGAA TAAAAAGTTT
ATCATTGCAT TGAGAGACCC TGCGGATTGG TTGTGGGCCG TTTACAATTT CTGGGCTCTC
CCGGATATAG ATACCGTTGT TCCCCGCCCA GATTGGGCTG CACCAGAGCA ACACTATCGG
TCGCCCGAAA TGTTCCACGA TCTTGTAGCC TCCAGTCATG AGATGCTTTT TTTTGAAAAA
ATGTTGGGAT CAAGGGGGAA GCATGCTATG GATTACGTTT GGCAGTTCGA AGCAATGGCG
GGACGGGAAA ATATTCTCTA CATTCGCAAC GAAGATCTTC TACCAGGGGT TGTCGCGCGG
CCGGGAGGAG TCCTCGACCA GCTGGCTGCT TTTACGGGCC TAGATCGTAA AGGTTTTGAC
TCGCAGACGT TCGGCGAGAT ATCCAACTGC AACGACCAGA AAGGGTTTGT GAAAAAATGT
GGAACAGCCA AGAGTAACGC GTACGAAATC ACTGGAGGAA GATCCATGCT TCCAGAAACG
CGCACTCTGA TATATTTACT CTATTACGAA GAATGCAAAC TGTGGTCGCA AAGATACGAT
GTTGTCTACG AGGACTGTTT GAATGTGTTG GAGGCAACTA AATCTTAG
 
Protein sequence
MGQKSNALSL RVAWAVILAL SCSSAFFGYY NNVPCPQVFP PDWQVTEKHR FSKAVTTTDA 
AKDLDARLKR AFDASVLRKH RAATLERASV APYKNMDLQL YHKPHPVLNP LDPKLRPKPE
WGNTTFPDIS VVGFPKAGTT QLYNILVSHS EAEAFNKRDK EFCFAGNESS LSRNNNWEDF
VPGSRPTTMQ IELQEALHTA LQKHRNLRTS SQKKTVNGCL SQRIVSVVYD YFNQPSNKKF
IIALRDPADW LWAVYNFWAL PDIDTVVPRP DWAAPEQHYR SPEMFHDLVA SSHEMLFFEK
MLGSRGKHAM DYVWQFEAMA GRENILYIRN EDLLPGVVAR PGGVLDQLAA FTGLDRKGFD
SQTFGEISNC NDQKGFVKKC GTAKSNAYEI TGGRSMLPET RTLIYLLYYE ECKLWSQRYD
VVYEDCLNVL EATKS