Gene PHATRDRAFT_47694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47694 
Symbol 
ID7202702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp544795 
End bp545862 
Gene Length1068 bp 
Protein Length355 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181931 
Protein GI219123229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0222477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAAC ACGTCGCTCG CTTTTCGCAA GCTGAACAAA ACAAAGACAC ACAGGGGACC 
GGAGGCGTCC GTCAGGAGGC GACTTCAATG GGTGAAGAAT GGGTCGACCG CGGCGTCGAA
AGTGCCAAAG ATGTTGCCGG AAAAGCGAAG GAGAAAGCCG AGCCTGCCCA AAAGAAGGCC
GAGTCCGCGA TGGAGACCAC AAAAGAACAA GGAGAGGCAG CGTATGAGAA GGCCAAGGAA
CGCGGTGATG AGGCTTATTC GAAGGCGAAA GAAAAGGGCG AGTCATTAGC CGAAGCAGCT
AAAGGAAAAG TAGAGCCAGC AAAAAACAAG GCTGAATCAG CGATGGAAAG TACAAAGGAA
CATGGAAAGG AAGCATACGA TAAGGCTAAA GAACATGCTG AGAACATGAA AGCAAAGGCC
AAAGAAACGG GTCAGTCGGT TGTCGAAACG GCAAACGACA CGGTCGACTC TGCACAAAGA
AAGGCGAAGT CGGCTATGGA GACTACAAAA GAGCACGGGG AGGAAGCGTA CGAGAAGGCC
AAGGACCGCG GTGGCGACGC TTACGAGAAG GCTAGGGAAT ATGGTGAGGA AGCATACATG
AAGGCACAGG AAAAGGGTGA GCCAGTTATG GAGATGGCTC AGGAAAACGC CGAGTGGATG
GCATCAAAGG CCAAGGAAAA GAGCCAAGAA GCCGTAGAAA AGGGAAAACC CTATGTAGCA
CAAGCAGCCG AAACGGTCAA GGAGAAAGGT CATGAAGCAA TGGAAAAGGG CAAGCCTATG
ATGGAGAATG CTACGAAACA GTTTCAACAA AAATCCAAAG AAGTTTACGA AAAGGGTAAG
GAAAAGGCTG CTCCAATGAT GGAATCTGCC CAGGAAAAGA TACATGAAAA AGGTCAAGAA
GCTGCTGCCA AGGCGAAGGA GCTGGGATAT GTAGCTGCCG ACAAGGCCAA AGACCTCGGG
AAGGAGGCTG CCCAGAAGAC AAAGGAAGGT GCTTCGGTCC TTTTCGAAAA AGCCAAGGAA
GCTGCAATCA GTGCAAAAGA CAAGATCAAG GATTCATTGT CCAGCTAA
 
Protein sequence
MVQHVARFSQ AEQNKDTQGT GGVRQEATSM GEEWVDRGVE SAKDVAGKAK EKAEPAQKKA 
ESAMETTKEQ GEAAYEKAKE RGDEAYSKAK EKGESLAEAA KGKVEPAKNK AESAMESTKE
HGKEAYDKAK EHAENMKAKA KETGQSVVET ANDTVDSAQR KAKSAMETTK EHGEEAYEKA
KDRGGDAYEK AREYGEEAYM KAQEKGEPVM EMAQENAEWM ASKAKEKSQE AVEKGKPYVA
QAAETVKEKG HEAMEKGKPM MENATKQFQQ KSKEVYEKGK EKAAPMMESA QEKIHEKGQE
AAAKAKELGY VAADKAKDLG KEAAQKTKEG ASVLFEKAKE AAISAKDKIK DSLSS