Gene PHATRDRAFT_47461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47461 
Symbol 
ID7202578 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp678233 
End bp679629 
Gene Length1397 bp 
Protein Length332 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181783 
Protein GI219122918 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGATACCTC CTCCCCGTGG TCTCTCTCAC CACCGTTTAC CGGTCCTGTG TTGGTGTGGT 
CACTCACTCA CTCACTCATT ACCTCATCGT CTGCTTACTT CCGCATTGTG CGATACACTT
TTAAACAAGT CACATCATGT CTGTCGAGTT CCCTGTCTAT CAGATTGACT TTTCGGCGGT
AGCTGCGGGA AAACGCGTGG CCGCTACGAA ACGGCGTATT CGATGGTACG TACCGTACCG
TACACTACCG TACCCGACCC CTACGGTATG GGACGGTGCG AATCTTGGTG GAAGGATTCC
GTGATGTGTC GTGGATTGGA TGTTTTCGTA GTAATCATCC CGCATTACCA TGTTGGTGTT
GTGGTTGTTG TTGTGGTCGA TACAGCGTCT CACGAAAACG GCATCTTGCG ACGCATTTCT
GACTTTCTCT TGTTCTTCTT CTTCTAGGCG TTTTGGTTTC CCCAATCAAG CCGCGCTCGC
CGACGGTAAG ACGGGGATCG AATGTCGAGG TGAAGAGCAC GAAATTGTGA TTGTTTGGTC
CGTGACTTCC GGCAAGCGGC AAATCATCAT GGACGGTCGC GAAGTGCACT TTTCCAACAC
CCGAACGTCG CTCATGGATC ACACTTGGTC CGGAAAGGGC AATCACGTCA TGAAGGTCCT
CTGCCACGCA TCGGCTCCCA TGTCGGCCAA CCCCGGATTC CGTCAGTACG ACTTTTTCAT
TGACGGACAA TCCTTCTTCC GCATGCCCAA GGTCTACGAG CTCGGTGTCC GACCAGGCTC
GAGCGCCAGT CCCCGCGGTG GAGGCTACGA CGGCGGATAC GGACCTCCTC CGCCCCGCGC
TCCCGCCGTG CGTTCGCCCT CGACGCTTTC GCAAGAAGAC GCCGAGCTAC AAGCCGCCAT
TAACGCCTCT CTGGAGGAAT CACGGCGTCG TCTGGGGCCA CGCGCCGGGA GTAGTGGAGG
CGGATCCCTG GCCCCGCCCG CGGCCGATCT GCTCGATATG GGAGCGGAAC CGTCGCCGGC
TCCTCCCGAG GCCTACTCGC AGTCCAGTTA CGACCAGGGT CAAGGGGGTC CGCCGCCGCC
GCCGAATTAC GGCGCACAGT CGACACAACC ATCCTACTCG TACGGAGGTA CCACGGTCGC
ACCCGGGGCG AACAGTAACC AACAGTTCTT GGCCTTGCCA TCGTCCAGTA ACTATCCGCC
CCCGCAACAG CAACAGTACA CTCAGGGACC AACATCACCT CCGCAACAGC AACAATACAA
CCAAGGCCCT CCACCTCCGC AACAGCAATA TTACGACCAG TCGTATGGAC AGCAATCCTT
CCAGTCTTCT CAGGGATACG GTTCGCCGGG GCCGTCCCCC AACGGAGGTG GAGATCCGTT
GGGTTTGCAT ACGGCGG
 
Protein sequence
MSVEFPVYQI DFSAVAAGKR VAATKRRIRW RFGFPNQAAL ADGKTGIECR GEEHEIVIVW 
SVTSGKRQII MDGREVHFSN TRTSLMDHTW SGKGNHVMKV LCHASAPMSA NPGFRQYDFF
IDGQSFFRMP KVYELGVRPG SSASPRAPAV RSPSTLSQED AELQAAINAS LEESRRRLGP
RAGSSGGGSL APPAADLLDM GAEPSPAPPE AYSQSSYDQG QGGPPPPPNY GAQSTQPSYS
YGGTTVAPGA NSNQQFLALP SSSNYPPPQQ QQYTQGPTSP PQQQQYNQGP PPPQQQYYDQ
SYGQQSFQSS QGYGSPGPSP NGGGDPLGLH TA