Gene PHATRDRAFT_49937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49937 
Symbol 
ID7198541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp350567 
End bp352111 
Gene Length1545 bp 
Protein Length428 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184787 
Protein GI219129208 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGTATTCAC TGGTGCTTGT TCTGCAAGGG AACGGCTTCC CAAATACTCT CACCTGTCCG 
ACTGTCTCGC AAAGGTAGGA CTAGATACTG TTGCTGCTAC TAGGACTGAT CCTGTGAATT
GTTTGTCGAC AATGCCGTCA TATATATTGT TTCTTTCGTT TTCATATATC TGTCAATCCC
TCTTTTTAAC ATTTTGCTCT GCTTCCCATG TCATTCTCCT TTCCGGACGG AAGTAGTTCC
TTCCATTCCC TGCGCAACAT GCAACCAAAC TATAACATAG GAGGTGGTGA TGCGGCTCAC
TGGAACGGCG GACACGGAGA CGAGAAGTCC GCAATGTCGA ACGGCTACAA CGGCAGCGGA
GGTCTGGCTT CTTTCCAACA ACAGCAGCAA CAAGCGTCCT TTCAACAACA CCAGGCCAAC
GCACTCGGAG GGTACGGTGC CGGAGCTGCC TCGCAGTTCC CCATGTGGTC ACAGATGCCG
TCGGGCAGCG ACGGCTATGC CGGAGGTGGA GCGGACCCCT TCCATCCCTA CATGCCGCAA
CAGCCCACGC CAGACATGAT GCAGCAAATG TCGGGCCTGA CGAACGGCGG TTACGGCGGC
GGGTATCCTG GACAGGACAT GCAACAGATG ATGTTCCAGC AAATGATGAA TTCGCAAGCG
TTTGCGGCCC AGCAAACCGG CGGCTACGGA GGAATGCCCG GATCGAATCC GTCCATGGCG
GCATACTATG GCCTGCCTAC GCAGAACGCG TCGTCCATGG CCCATGCTGC GGCTTCCTCG
TCCGCTTCGG CCGGATTGGA CCGCAAGTCC TCGCGACCTA AAAAGAATAA GGACAAACCG
AAGCGTCCCC TTTCGGCCTA CAATCTTTTC TTCAAGGATG AGCGCCTGCG AATGCTTTCC
GCGATTCCTG ACAAGAAGGT AGAAGACAAA GAGCAGACCT CGGATGACGA CACCAAGAAG
GAGGACGACG ACAACGACGA TCCCAAGGCA ACCAACGAAA AGGAAGACAA AATCAAAGCC
GAAAAAGGCA AGGCCGACAC GGAGAAGGCG ACCGATGCAA AGGAAGACGA AGACACATCC
GAGGATATCA AGGCTGATAC AGACAAGGCG ACCGATGTAA AGGAAGACGA AGGCAACAAG
CAAGACTCTG TCAACGAGGA AACCGACAGC AAGGCCTCTC TCGAAAAAAA TGAAGACAAG
GACAGTTCCA AGGAAACGAA AGGGAAAGAA GAAGAGAAGA GCGGTACCGC AGCGAAAGGG
GAGGACGAGT CCAAAAACGG TAAACGCAAG CGAGAACCTC ACGGCAAGAT CAGCTTCGAA
GCAATGGCCA AGGCCATCGG TGCCAGTTGG AAGGCGATAG ATCCGGAATT ACTCGACACT
TACAAGGCTA GAGCTGCAGT AGATATGCAG CGCTACAAAA AGGAAATGGA AGAATTTCTT
ATCAAACAAC GCCAAGGTTT GGAAGAAAGT CGCGACCAGC TAGAAACCTC TGTCGATCCT
ATGGCAAAGA TGCGCTACTT CTCCACTAGC AACACTGGTA TGTGA
 
Protein sequence
MQPNYNIGGG DAAHWNGGHG DEKSAMSNGY NGSGGLASFQ QQQQQASFQQ HQANALGGYG 
AGAASQFPMW SQMPSGSDGY AGGGADPFHP YMPQQPTPDM MQQMSGLTNG GYGGGYPGQD
MQQMMFQQMM NSQAFAAQQT GGYGGMPGSN PSMAAYYGLP TQNASSMAHA AASSSASAGL
DRKSSRPKKN KDKPKRPLSA YNLFFKDERL RMLSAIPDKK VEDKEQTSDD DTKKEDDDND
DPKATNEKED KIKAEKGKAD TEKATDAKED EDTSEDIKAD TDKATDVKED EGNKQDSVNE
ETDSKASLEK NEDKDSSKET KGKEEEKSGT AAKGEDESKN GKRKREPHGK ISFEAMAKAI
GASWKAIDPE LLDTYKARAA VDMQRYKKEM EEFLIKQRQG LEESRDQLET SVDPMAKMRY
FSTSNTGM