Gene PHATRDRAFT_3787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_3787 
Symbol 
ID7201345 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp300606 
End bp301871 
Gene Length1266 bp 
Protein Length333 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180408 
Protein GI219119290 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGGTCCTGT TCGCCACGGA TGAATGGTTT GCGGCGGCGG ATAACCTGTT GAAGGACACG 
GACCCCGTGT TTGATCCGGA AGCCTTTTGC GAGCAAGGCA AAGTCATGGT ACGTCAACAT
GTTCTGACGA GTTCTCGGAG ACTTGTGTGT CGACCTACTT AGCTCTACCT GTGCTCATAC
TACACTTTCT CTCACTACGT GTAGGATGGT TGGGAAACCC GCCGCCGCCG TGAATCCGGC
CACGATTGGT CGGTCCTCAA GCTTGCCTCC CGCGGTATCA TTCACGCCGT CGAGATCGAT
ACCGCACACT TTACCGGTAA CAACGTTCCG CAAATCTCGA TTGAGATCGC GGATCTCTCC
TGCACGGAAG AAAGCCGCAT GGTAATGCAG TTCCCTGGCG CTCTGGATCG TCTGTTGCAC
GGATGTGTGC AAGGCACCGG TGCCAGTCCT GAAGAAGTTC AACGCGCGGA AGAGGCCTGC
CAAGGCGTGC AGTGGAAGAC TCTCCTCGAC AAGACAGCTT TGCGTCCCGG CTACGAACCA
ACCCGTCTAC ACTACTTTAG TGTGGATGCG GTGGAAGGGA CTCACATTCG CGTCAACTAC
TTCCCTGACG GAGGCGTTGC TCGCATCCGT CTTTGGGGCC AACCCATCGA CGAGGGTGGC
CCTTTGCCCC GCCCTGCTTA CGTTCCCATC AAGACCGGCC GCACTTGCTC CGTCATTTGT
CATGGCGAAG AAGTTGAATT GCCGTCACGC ATGCCGTACA TTTTTCCGGA GATCTCCGGA
CAGGAGCACG GAGGTGTTGG ATTTTCCTGC TCCAACAAGC ACTACGGAGA TCCCTGGAAC
CTTATCCAGC CCACTCTGGG CCGAGATATG GGTGATGGCT GGGAAACCGC CCGTCATCCG
GAGCGCCCCG CTGTTCTCCA GCGCAACCCC GTCACCAAGC TCGTGGACAG TGACTTGATG
GACTACTGTG TTATCAAGTT GGGTGCCATT GCTGGCGACG GCATTGCGCG CATCATTCTA
GATACGAAAC ACTTTCGCGG TAACTACCCC GAATCTGTTC AAGTCCAAGG ATGCTGTGCT
CCGGACGATA AAGTCACGGA AAACGAGGCC ACTTGGTTCA CCTTGATCCC CCGTGGTCGC
ATGGCACCCG ATGCTGAGCA CGTCTACGAA TGCGACAAAG GTCAGATCGA AAATGTCCAC
AAGGCTGTGA CCCACATCAA GGTCAGCATC TACCCCGACG GCGGCTTGAG CCGTGTGCGT
GTCTAC
 
Protein sequence
KVLFATDEWF AAADNLLKDT DPVFDPEAFC EQGKVMDGWE TRRRRESGHD WSVLKLASRG 
IIHAVEIDTA HFTGNNVPQI SIEIADLSCT EESRMACQGV QWKTLLDKTA LRPGYEPTRL
HYFSVDAVEG THIRVNYFPD GGVARIRLWG QPIDEVELPS RMPYIFPEIS GQEHGGVGFS
CSNKHYGDPW NLIQPTLGRD MGDGWETARH PERPAVLQRN PVTKLVDSDL MDYCVIKLGA
IAGDGIARII LDTKHFRGNY PESVQVQGCC APDDKVTENE ATWFTLIPRG RMAPDAEHVY
ECDKGQIENV HKAVTHIKVS IYPDGGLSRV RVY