Gene PHATRDRAFT_47913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47913 
Symbol 
ID7203174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp417262 
End bp418453 
Gene Length1192 bp 
Protein Length315 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182389 
Protein GI219124182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTTCTAGGCA TCTCTAGAAG GAGCCCTTTG TTCTTTAAAT TATTGTGTAC TATCCCCTCT 
AGTCGCCACC CCTATTATTC CTGCTAGCGC ACTTTTTCAG TTCACTTTAC TGTTAAAGCG
AGTGCCTTAG GCCAACAAGT AGAGCCCGCG CAACGCAGTG GTCGTATTCA ACTCTGTTTT
TGCTCCATTG ATTTGGTTCT ATACTGGCAT TTTGCTTATC GCTATGTCTC GTCTTGGATT
TTCGTGTAAT GAGCGTGATG TGCTAATGGG TCGTGGTAAG CGAGTTTCGG AATGGCCGGG
CAACATTTAT TTTCGTCAAG TTGTGAACAA ATACCGCGAG CGATACGCGA AATCTGCACG
GACTGTGAAA GTCACCATTG CGCAAACAGT AATTGACGAA ATCCACCAAG TGAACGGTCG
ATTCTTGAAA GAGGAAAGAG ACGGTAGCTG GAACGAAGTT GAAGCGGACA GGACAGTGGA
AAAAACCTGC CAGGCGCTCC GCGAGAAAGA GAAGTCGAAT CCGGCCCCGA TGAGCCCTTT
CGTGGACACT GCTGGTCATC CTCAGCGCAA AAGAGCCAAG CTGCAGCCAG CGCTCAAGAG
ACCTCCTTCG TCCGACGAAA ACTCGGTTGA AAGTATTGGG GACGACACGG CAACCGAGTC
AGACGAAAGC AGCGAAGACG AAACGGAGAC GGAAAGTGAG GAAGAAGAGG TCAATGTGTG
CCCCCCAAAA CGAGCTGCAT CGAAAGGTCC AACGCCTCTT TCAACTGTGA AGCAAAAACC
TAATGAAGAA TGGCTGGAAC AAGTCCAGAA GTATGCTATT AAATATAAAC ACCTAGCCGT
CCCTCCTGGA TGGTCGGAAA ACGTCAAGTT TGCCGATTGG TGTGTTGGTA TGAGGCTTCT
CAGACGTGAA CTTGACTTGG GATACAGGAG AGTAAGCACG GCCGAAAGAG ATATTTTGGG
GGATCTGGAG GAAAGGGGTT TTGTCTGGGA TTATGAAGCC TGGCACTGGG AGAAGCGATA
CAAAGAGCTG CAAGAAACTT TGAACATGGG GCCACACGAA AATCTGAAAG ACACTACTTT
GCATTGGTTG GACAACCAAA GGCGACTCGG CCGAGCAAGC ATCCCAACCG ATCGACTAGA
AAAATTGCAG AAGTTAGGAA TATACCTTTA AATGCGGCAA ACGTTATACG AC
 
Protein sequence
MSRLGFSCNE RDVLMGRGKR VSEWPGNIYF RQVVNKYRER YAKSARTVKV TIAQTVIDEI 
HQVNGRFLKE ERDGSWNEVE ADRTVEKTCQ ALREKEKSNP APMSPFVDTA GHPQRKRAKL
QPALKRPPSS DENSVESIGD DTATESDESS EDETETESEE EEVNVCPPKR AASKGPTPLS
TVKQKPNEEW LEQVQKYAIK YKHLAVPPGW SENVKFADWC VGMRLLRREL DLGYRRVSTA
ERDILGDLEE RGFVWDYEAW HWEKRYKELQ ETLNMGPHEN LKDTTLHWLD NQRRLGRASI
PTDRLEKLQK LGIYL