Gene PHATRDRAFT_50336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50336 
Symbol 
ID7199075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011696 
Strand
Start bp348430 
End bp349764 
Gene Length1335 bp 
Protein Length342 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185263 
Protein GI219130208 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATCATCTA ATACACAATC GTAACAGGGG AAGGATCATT GACTTTGTCG GATTCGTTAG 
TTGTGAAATC GGCTCACTGT TAATCGTTAT GCGCAACCTT TTTTTCCTCA ACAGACCGCC
GCCCTTGAAC GCACCACCGC GTTTGTCCAG CGAACGCACA TGTTACCACA AGAAACCTTG
CTTCCGCGAA CGATATGACT AGTTACGACG CTGTGGGGGG CCCCGCAGCC TTGCCTTTTG
CGGCCCCCGC CGACGGAACG TCCAAGCCCA AGACTGCCAA GGGAAAGTCC TCGAAGACCA
AGAAACAGAA GCAGTATCGA GCCAAGAAAC CGAAAGATAT GCCTCGCCGT CCACTCAGCG
CGTACAACAT CTTTTTCAAA GAAGAGCGAG CTCGAATGCT CGCGAATGCC AGCGAAAAGG
CAGCGTCAGC TGAAATCGAA GAAAACGAAG GAAATGAACC GGACGCCGCC CCGTCTACAA
AAGGGAAAAT CGGATTCGAG GCCATGGCAA AAACTATTGG TAAACGATGG AAGGAGCTTG
AAGCAGAAAA CCTCGAACGG TACAAGAAGC TCGCCAAGGA AGATATGGAG CGCTACCGAG
TGGAAATGGA CAAGTATCAT CTGGAACTAG CAAAGAAGTC AAGAGTAGAG AGAGAAGAAG
CAGCCAAGCT TGGTCCGATG ATGGGTGCAA CGAATGACCA GATGATTGGT GGGATGCAGG
ACAACCAGAT GGCCATCGCA CAGCAGATGC AAGATCCTTC CATGATGGGA GCCGCACAAC
TCGATCAGTT TCTGCGTGCG CAAATGATGG CTGCCGGCAA CGGATCTCCG AACGCCCAGT
TCTCCGGAGC GCAGATGGGA ATGCCCCCCA ATTTTGGGGG CTTCTACCCT GGCTTCCAAG
GAGCCATGGG CGGAATGCCG CAGAGTTTCG GTGGCGGGGC AGATGGGGGC GGACAACAGC
TCGGTCAGCA GCAGAACTTT TTTCCAAATC CCATGATGCA AGGCATGCCT TTTCAGCAGA
ACCAATTTAT GGGTGGGCAG CAAGAAGCGC TCATGGGACA ATTTGAACAA CAGCAGCAGT
TCTTGATGCA GCAGATGGGG AATCAACAGT TCGGGGGAGC TGGCGCTTTC CCGAACGCCA
GTGGCGGTAA TCAAGGGTAC AATTTTGGAA ACCAAGGCGG CGCTCAATTT GGACAATTCC
AACAGGACGA TCAAAACAAG TGAGCAAAGG TGACCACTTC GGAAAAGTGT CTACGTACAA
GCCATCATCA CCACAGGCAC TCATGGACGC CGACCAACAA ATATGTGTTA AATTAACTAT
TAGTTATCTA TTTGG
 
Protein sequence
MTSYDAVGGP AALPFAAPAD GTSKPKTAKG KSSKTKKQKQ YRAKKPKDMP RRPLSAYNIF 
FKEERARMLA NASEKAASAE IEENEGNEPD AAPSTKGKIG FEAMAKTIGK RWKELEAENL
ERYKKLAKED MERYRVEMDK YHLELAKKSR VEREEAAKLG PMMGATNDQM IGGMQDNQMA
IAQQMQDPSM MGAAQLDQFL RAQMMAAGNG SPNAQFSGAQ MGMPPNFGGF YPGFQGAMGG
MPQSFGGGAD GGGQQLGQQQ NFFPNPMMQG MPFQQNQFMG GQQEALMGQF EQQQQFLMQQ
MGNQQFGGAG AFPNASGGNQ GYNFGNQGGA QFGQFQQDDQ NK