Gene PHATRDRAFT_43744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43744 
Symbol 
ID7197265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp1353587 
End bp1354741 
Gene Length1155 bp 
Protein Length269 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177812 
Protein GI219112121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAGTGTTTT TTTCACAAAC GCCAACGAAC TCTCTCGTAC ATCACACTAT CAACATCTTA 
AATTCTCTCT TTTCGCAACC ATGAGTATGC AAGTTGGGCA CCTAATTGAC GCTGCCGCTG
ATGCGGCTGC TGGAGCTCTC ACGTCGCTCC CGAACCTTCT CAAGCGTAAA AGCGAAGAAG
GCGCAGAGCA ATCCGAAGCC AAAATGTCCA AGACAGAAGC CCCTTAAACT GCCGATGAAG
ACAAGGCGAA CAAACGATCC CCCATGCCAA AGGCTCGTGA AATCCGACTT GAACAGAATC
GCAAAGCTGC TCGGGAATCG CGTCGGCGGA AGAAGGTTAT GATTGAAGAA CTTCAACGCA
GCGTGATTTT CTTCTCGCGT GCCAACGGAA CCCTCAAACA ACAAAATGAT GAGCTGACAC
GACTTTTAAT GCAAGCTCAG ACTCAAGTCA CGGTCTCTAG CACTGCTTCG AACAGCACTC
TAAGTTCAGA CCAACCGAAT GATCAGTCGC ATCAAGTGAA TCGCAAGACC GAAAATGCTG
AAAAGACGAA TAGCGAGCAA GTGCAGGCTC AAGCTGTAGC GACTCAAGCC GTCTACGAAA
GCCAAGGATT CCCGGCTGCC GCTGCGCGTG CCGCGGCTTT AACTATGAGT GGCAATAACT
TGGCCCCCAA TACCGCACCC GCGCCTGTCA ACACGATCCA ACAAGCACTT CCCGCAATGC
AACCTGGTGC CACCATGCAA GCCATGGCCA ACTTTCAGCA AGCCGCTGCC GCTGCTATGC
AGTCCGCTAT GGGACAAATG CAATCCATCC CAGGTGTCAA TATGAGTCAG CTTGCGGCTG
CTCCCGTCGG TGCCAACGCT CAACAGGCAT ACACAGACAC CATGACGGCT TTGGCTATGC
AGCAAGCAGC AGCCGCGGCT GCGGCGTCCG GGCAGCAGTT TGTAATGGCG GGCGGTGTTC
CGTTTATGCA TCCCATGTTG GCTTGGCAGC AGCAAGTACA GAACCAAGCT TCGCCGCCTG
TCATTACACA ACAACAGATG GCCGCGAATT CAACTCCAAA GCAAGACAAC TGAATTTCAT
TCAGTGGAGG GGCCAAATGA GCTTGATCTG TGTTACTGTA TAACTACTTA TAATATCAAC
CAATGTAAAG GTTCC
 
Protein sequence
MPKAREIRLE QNRKAARESR RRKKVMIEEL QRSVIFFSRA NGTLKQQNDE LTRLLMQAQT 
QVTVSSTASN STLSSDQPND QSHQVNRKTE NAEKTNSEQV QAQAVATQAV YESQGFPAAA
ARAAALTMSG NNLAPNTAPA PVNTIQQALP AMQPGATMQA MANFQQAAAA AMQSAMGQMQ
SIPGVNMSQL AAAPVGANAQ QAYTDTMTAL AMQQAAAAAA ASGQQFVMAG GVPFMHPMLA
WQQQVQNQAS PPVITQQQMA ANSTPKQDN