Gene PHATRDRAFT_50914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50914 
Symbol 
ID7200874 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp13894 
End bp15200 
Gene Length1307 bp 
Protein Length388 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179957 
Protein GI219118365 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC TTCAGATACT TTCCAAGTTC ACCGTTGGTG GACAGGAGCT GCAGAACCGT 
GTCGTTCTGG CCCCTTTGAC CCGCGCTCGG TAAGCGGCAA GAACGAACTA TTTTGTGCCC
AGTGTCGAAC ACGATTACTC ACTGTACTCG TGTTTCAATT CTATAGCTGC ACACCTACCG
AAGATCCGCT CGATACCGTC TCCCGGACAC CGAACGACCT CATGGCGACT TACTATGAAC
AACGTGCGTC GGCGGGTCTC ATCATTACGG AAGCCACTGC CGTTTCTGAA GAGGGCTACG
GCTGGCTCAA CAGTCCAGAG CTTCGTACCG AAGCACAAAT GGAAGCATGG AAAAAGATCG
TGGATAAGGT ACACGCCAAG GGATCCAAGA TTTATGTCCA ATTTTGGAAT ATGGGTCGAC
AAGCCCATTC GTCTTTTCAC GTCGAATCCC AACGCGTAGT TTCGGCGTCC GACATTCCCA
TGGCCGACTC TTTCAAGGTC AAGTCATCAA CCTTTGAAGA TGTACCGCCC GAAACACCCG
TTCCCTTGAC GGTGGACGAG ATTCAAAGTG TGGTCGCAGA TTTCGTACAT GGTGCCAAAC
TCGCTCGTCA GGCAGGCTTT GACGGAATCG AGATCCACTC CGCCAACGGA TATTTGATTG
ATCAATTCTT GCAGTCCAAG ACCAACAAAC GCGCGGACCA ATACGGCGGA AGCATGGAAA
ACCGCTTTCG CTTTTTGAAG GAAATTGTGC AAGGTATCGT GGACAGCGGA GCCTACCCCT
CGAATCGGAT TGGCTTTCGA ATCTCGCCCA ACGGAGTCTT TGGAGACATG GGTAGTGAGG
ACAACGCCCA GATGTTTACC TTTGTGGCGG CCGAAATGAG TAAACTCAAG GTGGCCTACC
TGCATCTTAT GGATGGTCTC GGCTTTGGAT ACCATGGATT ATGTCCGGCA GTTACGGCTG
CCGATATCCG TAAAGTCTTT GACGGTCCCA TTATTTGCAA CGTTGGACTT ACGAAAGAAA
TTGCCGAAGG GATGATTCGC TCGGGTGCCG CTGATCTGGC CTGCTTTGGA CGTTTGTACA
TTAGCAATCC CGATCTGGTC GAACGTTTCG CCAATGACTG GCCTCTAGAA CCTGAAGCTG
CTTATCAGCA CTGGTGGCAA CACGTTGGCG CCAAAGGTTA CACCGATTGG CCAACGTACA
AGCCATCCGA GGAAGATAGC GACGACGCTC AGAACGACGA GTAGGCTGCT ACTACAAGAG
CTCGAAATTT CGTGAATGTG CCACGTTCCT GCACTTTGCG GTTGGGT
 
Protein sequence
MSNLQILSKF TVGGQELQNR VVLAPLTRAR CTPTEDPLDT VSRTPNDLMA TYYEQRASAG 
LIITEATAVS EEGYGWLNSP ELRTEAQMEA WKKIVDKVHA KGSKIYVQFW NMGRQAHSSF
HVESQRVVSA SDIPMADSFK VKSSTFEDVP PETPVPLTVD EIQSVVADFV HGAKLARQAG
FDGIEIHSAN GYLIDQFLQS KTNKRADQYG GSMENRFRFL KEIVQGIVDS GAYPSNRIGF
RISPNGVFGD MGSEDNAQMF TFVAAEMSKL KVAYLHLMDG LGFGYHGLCP AVTAADIRKV
FDGPIICNVG LTKEIAEGMI RSGAADLACF GRLYISNPDL VERFANDWPL EPEAAYQHWW
QHVGAKGYTD WPTYKPSEED SDDAQNDE