Gene PHATRDRAFT_47941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47941 
Symbol 
ID7203130 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp508097 
End bp509817 
Gene Length1721 bp 
Protein Length381 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182406 
Protein GI219124218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGAGATATAC TACCGCTGAG AGTGGTGATG TTCCCTTTAC GCTCCACCTT AACTGTAAAG 
CATTTGTCGC ATCGCGTACA CACTAGACAG GAAGTTGCTT GTGTACAAAA CCTTTGTTGT
TCGACCCATT TACGGAGGCC TTTTCAGTTG TTCATGGTAG TTGCCGCTCC ATAGTGCGCA
GAGCGCACGT TGAGGAGATG TCACGATAAC GCCCCGTTGG TGCAGGTTGA CTCCTTGATC
TATAGTACTT ATCCGTGACG AGTTGCGTGT TTTTTGCGGA GTTGCTCTCT TTTTTGGGCT
GTTCAAAGGT TCACCGTGTG CGGGTCTCCA AACTTTGCTT TACACGCCAT GATGAGTAAA
CACTTTTACT TGTTTTCATT TTCGTTGTTG AGTGACATTC TTTGTTTGGA TCTCGACAAC
AAGAATGGAA AGGTCTGGTT CAGTCGAAGA CTTTCCCCCT ACTTTCGTGG TAAGTATCCA
ACGCGCGTGG CGTGGCTTCC GCAAATCAAC ATATTGTATA TTGTTGGCGT GGTTTGGCTC
ATTGACATTT TGTTTCGTCC TGACTTGTGT TTCAGTCGTA TCGCTCTGAG TACCCTCGAG
AAGAATCTTA CCGCTGCATG ATGAACAACC GAACTCCCAA TTCAGTCTCA CCTCCAGACT
TTTTGAAGAT TCCTTGCTAT CGCCCGGTGC CGAGGAAAGG TTTCTCGTCG CATCATGATA
GCTCATGTCA TCCTGGTTTT TATGCCGGGA ATCATTCAGC TAATGTACCG CACTACGCAA
CCCAGGCAGC ACCATCATTC GATTCGACGG GTAGTAGTCA CGGAGGCCAC TATACTCCTC
CTCCTCCTCC TCCACCAGTA CTCAGCAACA TCAATCCACA TTCTCATAGC TATCCTCCAC
CGTACCACGG CGGTTACCAA TACTACGCTC CCTGGCCCAA CACGCCACCA CCCGAATACG
TGACAGACAT TCAACCGGAA GATGTCCTTT CGGGACGCGG CGGCGCCACC AATTCGCACT
CTGGTAACAG AGCCTTTCGT ACTCTCGTGA AAGATTTTCA GGAGCGATAT CTGAAAGCCA
AGAAACGAGA CAAGCCGTCG GTGGCTTCGC TTGTCGTGGA ACTGGTTCGT CAAAAGGGCG
GCCGCTTTCT CCGTAGGATG GGCACCGATT CCGATGGCCA GGTTTTGTGG ATAGACATTG
GCGATGAAAG AGCTCGTGAG AAAACATGTC AGGCCCTGCG GGAAGGTGCC CCTTTGTTGC
GTCGATCGAG GCACACACCG AGATCATTCG ACGACGTGGT GGATGCAAAA CTGCACGATT
CAATAAAAGA GAATGACAGT TTTGAGACGC CGTCGAGCAC GGTACGTACC ACTCCAGCAA
GAAATCTCGG CCACGGGATG GTACAGTCTT CCATCGTCCG TGTCGTCCAA GACAATGAAA
ACTGGATGAA AGGGAGTATT TTTTCATCCA GCAAAGACCA CGACATTAAT GACGGTCCCA
TTGTGATTCG ACCGATGCGT CGGCTATTGC ATCGTCGGTC AGTTGCTCCA ATCCCTTTGG
ATCAACTATC TCCACAAGAT CGGGATTTAT ATCTGCGAGA CTTTTTGCCG CCGTGTCCGT
CAATAGGCAA GCAGAGCAAT ATTGCTGCGG AGCCCACGGC TTCGCCTTCG CACCATCCCG
TGGAGTACGT GGAGAAACCA AACCCTCGGG CTACTATATA G
 
Protein sequence
MERSGSVEDF PPTFVSYRSE YPREESYRCM MNNRTPNSVS PPDFLKIPCY RPVPRKGFSS 
HHDSSCHPGF YAGNHSANVP HYATQAAPSF DSTGSSHGGH YTPPPPPPPV LSNINPHSHS
YPPPYHGGYQ YYAPWPNTPP PEYVTDIQPE DVLSGRGGAT NSHSGNRAFR TLVKDFQERY
LKAKKRDKPS VASLVVELVR QKGGRFLRRM GTDSDGQVLW IDIGDERARE KTCQALREGA
PLLRRSRHTP RSFDDVVDAK LHDSIKENDS FETPSSTSSI VRVVQDNENW MKGSIFSSSK
DHDINDGPIV IRPMRRLLHR RSVAPIPLDQ LSPQDRDLYL RDFLPPCPSI GKQSNIAAEP
TASPSHHPVE YVEKPNPRAT I