Gene PHATRDRAFT_47094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47094 
Symbol 
ID7202010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp384943 
End bp386508 
Gene Length1566 bp 
Protein Length503 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181198 
Protein GI219121698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTTCTCTT GTAGAATGTC GGTTGTTACC AGCTAGCCCT GACGCACAAA GCAGATGAAT 
TCGTCGTCGG TATGCCACGA AATTCCGGAT TGTCCCAGGC TCTTTGTCGA TGCGTTTCGA
GGAGCCTACC TGCAAAAAAC CCGAAATCCG GACAACGTTT TCATACTTTC TCACTGGCAC
GGGGATCATT ATGGCTCGCT ACCCAGGGAT GGCAAATATC AGGGTCCGTC ATTGATTCAC
TGTACGCCTA CTACTGCAGC TCTTCTACGG GAGATACACC AAGTACCTGA AAAGTACGTC
GTAGAGCATG GGTACGGAGA GACCTGGCTT TTCCATCCTC TCGGTAATAC GAAGGGTCTC
TCGACTGTTC AAATAACGTT TTACGACGCC AATCATTGTC CCGGTGCTGC TATAATTGTC
GTCGAGATGG CCGACGGTAA GGTGCACCTC CACACAGGAG ATATGCGCTA TCACACAATG
ATGAACGTGT ATCCTATTCT TGAACGAGCA GCCAGCAGTC GTGCTATCGA CACAGTTCTT
TTGGACACAA CATACTCCGA CCCTAAACAC AATTTCCAAC CTCAGGAAGC TGCAATAGAT
GCTATCGCAG CGTACTCAGA AGGATTACTC GGCACTTCCA GAAAATGCTG TTCCAATGTC
CTCATTCTTC TGTCTTGTTA TAGTATCGGG AAAGAGAAGG TTCTATGGGA GGTCTCTTCT
CGGACGAATC AGCTAGTTTA TGTGAACGAT CGCAAAATGC GAATGATGCG CTGCATTCAG
AAACACCATG AGAGTTCTAG CCAGATTGTT CAGCGGTGCA CGACTGACCC AAACGCAACG
GATATCCACG TCATTCCCAT GGGTCTTGCT GGAGAACTTT GGCCCTATTT CCAACCAAAC
TATTGGGCAT GCGCTGAATA CGCAAAAGCA CTAGAAACGG AGTACACAAA GGTGGTTGCT
TTTATCCCGA CTGGATGGGC TGATGGATCG AAGTGGAACA AGAAAAACGC TACTTCAAAA
TTTGACTGTA AAGGAATTGA GGTCGAGATA CGACTAATAA GTTATTCCGA ACATTCAAGT
TTCTCAGAGT TAAAAACATT CGTGGAATTT CTTCGCCCCC GCAAGGTTGT ACCGACCGTT
TTCAAAGATG ACAGAGACCG AGTAAAAATC GAAGGTCGAT TTGCGATTGA TTCAGGTCGG
GCAAAGCAAT CTTTCTTCAG CACCATGACA TCAAAGTCAT CAAACATTGG CAAGCAGGTA
TTGCCTGGCG CAGCGACACG ACTAGACCCG TTTCCGAAAC GGCCGAAATT GAATATGAGT
CGCTCCTCGC CTCTGAAACA GAAAGAAGGG CACGTCGAAA AATTAGCATC GATGAGATTG
ATGGGATTCA GTGCGGATGC TGCAAACGGG GCACTAGTCG AAAGTAAGGG AAACATCGAG
GAAGCCGTCG GATTGCTCAT TACTGGAAAG ACAAAGCTTC CACCGACAGC CGAAGTAATC
GACTTGAGCG GGACAAATGA TGTTGTAGAA AGTAAGAGCG TCCCGGCCAC TCCATGTGAA
AAGTAA
 
Protein sequence
MNSSSVCHEI PDCPRLFVDA FRGAYLQKTR NPDNVFILSH WHGDHYGSLP RDGKYQGPSL 
IHCTPTTAAL LREIHQVPEK YVVEHGYGET WLFHPLGNTK GLSTVQITFY DANHCPGAAI
IVVEMADGKV HLHTGDMRYH TMMNVYPILE RAASSRAIDT VLLDTTYSDP KHNFQPQEAA
IDAIAAYSEG LLGTSRKCCS NVLILLSCYS IGKEKVLWEV SSRTNQLVYV NDRKMRMMRC
IQKHHESSSQ IVQRCTTDPN ATDIHVIPMG LAGELWPYFQ PNYWACAEYA KALETEYTKV
VAFIPTGWAD GSKWNKKNAT SKFDCKGIEV EIRLISYSEH SSFSELKTFV EFLRPRKVVP
TVFKDDRDRV KIEGRFAIDS GRAKQSFFST MTSKSSNIGK QVLPGAATRL DPFPKRPKLN
MSRSSPLKQK EGHVEKLASM RLMGFSADAA NGALVESKGN IEEAVGLLIT GKTKLPPTAE
VIDLSGTNDV VESKSVPATP CEK