Gene PHATRDRAFT_39660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39660 
Symbol 
ID7195288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp367968 
End bp369158 
Gene Length1191 bp 
Protein Length396 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183735 
Protein GI219127005 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGTCT TTCGAATTTG TTTCTTGCTC GCCTTGAGCA GCTCATCCTT ACTGTGTCAA 
GGCACAGATT GTGCGGCTAC TTCTTTCCAG TCAGTTCGCC GAGACTACCA CGATGACCAA
ATGTGCGATC GCGGTACCTT CAACAAGTTC AAGGAAAGTT TAAACCGCAA TTTTCAGTAC
CTTCAACTCA AGCAACTTGG GAACGCCTTG GTTCCCGACC AGAAGACAGT TGTTTGCCCT
AGCCACGTGC ACAATCTTCA CAACGAGAAC GACATCATCG GCTTTGATTC CGTCTGGATA
GTGGAAAACA CCGCCTCGTC CCCAATTGTG CTGAGCTTCG TGCACGGCGA TGGCCGAGAA
AGCTCAGCAC TTAACCCCAA GATCAGTCCA GCGCAAGTAG ACCCCCGCGC AGTCGTCATG
CCGGGGGACT ACCGCGCTGT CAATACATTT GAGGGTCACG TTTTTCATGC CCGCGAAATG
CTTCCAGATG GTGGTGCCGG TAGAGTGTTG TTGCAACATC GTGTGGGGTA TATCCCTATA
GGATTGAATC AATCAAACGC GGCGTGCTCC GGCCAAGATC TAGAACCCGT GATCACCGAC
GCATTAACCG ATGAGACCAG AATAGCTCCT GAATATGCCC GAACACCTCC TAAGCCTTTC
CTTGACTGCA ACGCCCTCCA TGTCGGTTTC CGGAACAAGG TAGGTTGTCC GGTGCACGGC
TTTTTCGTAG AAGCCACAGA AAATGACGAC TGTCATGAAA ATTTTAAGTT TCATTTGGGA
GTCAATCCGA TGACGGATGA TTTCATGTGG AGCTGGGATT CTCCTACCAA GTTTGAAACT
TCTTACATTG GACACACGTT TGCCTTTCGC CTCGCCGATC GTCCTGGTGT TCTGGTCGAC
AAGGTGACGC TCGGACCCAC ACAAATTTCC GATTGTCCGG GCCTGGCCCA AAGCTTCGCC
ATCCCAATTG GAGCAGATGG TCAGCTCCTG CCAGTTGCTC GCATGCTTTG GGATGCCAGC
CATAAGACCT CTGTCTACCA CGTGAACAAT CCTGGGCTTT ACCGCCACTC TAACTCCACA
GCGAGTTCTG TACTCGTCAA CACGAACGCA TCGCTACCCT CGGCTGCTCG GTGCGCCGGT
GCCAGCTCGG TTGCGAGGGA ACGTAGTCCT CTATTCACGC TCACAATCTA G
 
Protein sequence
MMVFRICFLL ALSSSSLLCQ GTDCAATSFQ SVRRDYHDDQ MCDRGTFNKF KESLNRNFQY 
LQLKQLGNAL VPDQKTVVCP SHVHNLHNEN DIIGFDSVWI VENTASSPIV LSFVHGDGRE
SSALNPKISP AQVDPRAVVM PGDYRAVNTF EGHVFHAREM LPDGGAGRVL LQHRVGYIPI
GLNQSNAACS GQDLEPVITD ALTDETRIAP EYARTPPKPF LDCNALHVGF RNKVGCPVHG
FFVEATENDD CHENFKFHLG VNPMTDDFMW SWDSPTKFET SYIGHTFAFR LADRPGVLVD
KVTLGPTQIS DCPGLAQSFA IPIGADGQLL PVARMLWDAS HKTSVYHVNN PGLYRHSNST
ASSVLVNTNA SLPSAARCAG ASSVARERSP LFTLTI