Gene PHATRDRAFT_47465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47465 
Symbol 
ID7202479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp697340 
End bp699559 
Gene Length2220 bp 
Protein Length731 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181785 
Protein GI219122922 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCGATGTA CGCAAGTTTA GAGTATGTTC TGGGATAGAG AAGCTACCAT CCAGGTCCGA 
GGTGAACCGA CCGACCGGAA GGTCGATGAC CTACATGGTT CCACTGTTAT ACGAGTAGAT
AAAGTTATTG AAAGGCGTCA GCACCTTCCT GTTGAGAGTG TTGAATTGGA GGAGCGTGAG
CAACCTTGTG TTGCAACACC ACTGACCGAA GGCGTTGGTC AGACCTGTCA TTTTAGCTCC
AGTATAAAAA ATCAAAAGAA GGTTCTATCT GATAACGGCG AGACTGGTAT GCCAGTTGAA
GAAGAGCCAA ACTGGTTAGA TATTTCGAAT TGGTTCACTG CCTTAGGGAA ACAAGGGCTA
GAAGACCACC ATCAACCGAA AGCTCGAATT GATTTCATTG AGATTGATAC GAAGGTTGAA
GACCACGAAC TGGTTCAATT ACAAGAAATG TATCTTCCAA CTTGTTTGAG ATGGCTTGAC
CCAACCTTCC AGCCTATCGA CGACCTCGAA GGTGCGACAA CAACAAACGA GCAATTGTCG
GCGGGCGTGT CAACCAGAAT AGGCTTTTCT GACGTACAAT CCGTTACTTT GGCCAATGGC
GAAGGGTCCA TTGCAGATTT GGTAGCCAGC AATTCTTTCG ATCTTGGAAA AAATGAAGAT
TTTCCCGAAA GAGTAAAACC TACCTTCGAC GTCGCAAACA AACTGGCCGA TACCAGCAAG
TCAGAGGCGA CTGCTACTAT CAGAACGAGC GACAATCGTC TTTGTTTGGC TAAAGGAAAA
AAAGCGATGT CGGAGGTGCA CAATGAGGAG CGCCGACAAA TTCTTGTCAA GGAATTGTTA
TCATCGATAT CCACTTACGG TCGCTACGAT CCTCGTGTCG CCGACGCCAG CGCTTCACTA
GGTGATCATC TCGATGAGTC AGGTGAGCAC AAGCAATCAC TCAAGCTTTA CCGAGACGCT
GTATCGATAT ATAGTTCAAA GCTCGGTGAT GACCACAAGA AGACAATGGA TGCCCGTGTC
AAGCTCGGTA GGATCCTTGA GCACGCTGGG GAATACAACG AAGCCATCAA CACGTACTAC
CTCGCCACAG TCATGCGCAA AGCGGTGCGC GGAGAAAAAG ACCCTGCAGC AGCGGATTCT
ATCGTCTGCA TTGCGCATAC TTTACGAAAA AAGGGCGACT ACCACCAAGC CATCAAGGAA
TTAAAGCGCT CCCTCAAAAT ATACCGTGAA TCCCTGGGAG ATGCCCATCC GAAAGTCTCT
AGTGCTGTGG ATGAGATTGC CTCATTGTAT GTTACACTAG GAGACTTCGA CAAATCTGCT
GCGATTCTCG AGGAGGTTGT CAAACTCAAG GCGGCGACGC TGGGTATGAA TACCAAGGAA
GTAGCTTCTA CGTTGATCAG CCTAGCGACG ACTTACGAAT GCTCCGAGCA AGTTGAAAAA
TCCTTGAAGA CATTGAAAAA GGCGTACAAG ATAGAATCTG AGATTGGCGG GTTTTCCTCC
GAGGGAGCCA TCAGTATTCT GAACCGTATC GCCATGCTAT ATGAAGGAAC GGGTGACTAC
AATCGTGCCT CAATAGCTTT CCTCGGTGTG CTTCGTGGAC AGAAGAGTAT CTATGGGGAG
GAACATCTCG TCGTCGGCGA AGCATACTAC AAGCTCGGAT ACTCCCTCCA TCAAATGGGT
CACATCCATA AAGCTCTTAA GTGCATGAAA GAAGCCCTCC CTATTTTTGT GCGTGAAGGC
ACCGAAACCA GTGACGTCGA GCGCATTGCC GAGATTTTGC ACGAGATGGG TCTCATGAAC
AAGGAAATGA AGAACTTTCA CGAATCTACC TGTATGTTCA AACAGGAGCT AGGAATTCGT
CGGAAGATTG GTCAGAGCGA GTTCCCTCTG ATAACCCGTG CATTGAACCA GTTGGGCGTG
GTTGAGTTTG AGATGAAAAA TAGCTCCCGT GCTCTCAAGT ACCTAGTAGA AGCTCTGAGC
ATAATGCAAA AGCACGGTGA TCCAGGTTTG GACTGTGCTG AAGTATTGTA TAACTCTGGG
TTGGTTTTTG AAGTGTGCAA CAACAAGGAC AGAGCTTTGG AGGCTTTTGA AGAATCTGTT
CGCATCTTGA TGAAACTTGG ATTCGAGGGC GTGCACCCAC AGGTGGTGAA GGCTCAAAAC
AAGATTGAGA TGCTTCAAGA TAAGAGAAAG CAGCGAGGAT ATTGGACGCC TGGACAGTGA
 
Protein sequence
MFWDREATIQ VRGEPTDRKV DDLHGSTVIR VDKVIERRQH LPVESVELEE REQPCVATPL 
TEGVGQTCHF SSSIKNQKKV LSDNGETGMP VEEEPNWLDI SNWFTALGKQ GLEDHHQPKA
RIDFIEIDTK VEDHELVQLQ EMYLPTCLRW LDPTFQPIDD LEGATTTNEQ LSAGVSTRIG
FSDVQSVTLA NGEGSIADLV ASNSFDLGKN EDFPERVKPT FDVANKLADT SKSEATATIR
TSDNRLCLAK GKKAMSEVHN EERRQILVKE LLSSISTYGR YDPRVADASA SLGDHLDESG
EHKQSLKLYR DAVSIYSSKL GDDHKKTMDA RVKLGRILEH AGEYNEAINT YYLATVMRKA
VRGEKDPAAA DSIVCIAHTL RKKGDYHQAI KELKRSLKIY RESLGDAHPK VSSAVDEIAS
LYVTLGDFDK SAAILEEVVK LKAATLGMNT KEVASTLISL ATTYECSEQV EKSLKTLKKA
YKIESEIGGF SSEGAISILN RIAMLYEGTG DYNRASIAFL GVLRGQKSIY GEEHLVVGEA
YYKLGYSLHQ MGHIHKALKC MKEALPIFVR EGTETSDVER IAEILHEMGL MNKEMKNFHE
STCMFKQELG IRRKIGQSEF PLITRALNQL GVVEFEMKNS SRALKYLVEA LSIMQKHGDP
GLDCAEVLYN SGLVFEVCNN KDRALEAFEE SVRILMKLGF EGVHPQVVKA QNKIEMLQDK
RKQRGYWTPG Q