Gene PHATRDRAFT_47392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47392 
SymbolMAT1 
ID7202438 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp514658 
End bp515788 
Gene Length1131 bp 
Protein Length294 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181741 
Protein GI219122830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAGAGTGTA GCGTCTTGGT GAATAAGCAT TCTTGCTAGT CATGGAAGAG GATCCTCAAG 
ACGATCTTTT TCGATGCGCC TTGTGCGGGA CCGCGGAAGG TGATTCATCC AATCTATCAT
CTCATACATC GCTACAAACG AATGCCACGG TCCGGTGTGG ACATCAATTG TAAGTAGAGT
TACTTTGGCT GTTCTTGGAC AATAGTGTCT ATACCATCAG ACAACGTCAT CATTCCCATT
GTGATGTATT GCCTGGCCAG CCGAAAACAA CAGCCGCAGG GGATTTCAGC ATATACGGAA
AAGACTTGCT CCACGAAAAT ATCCGACACA CTTCTAATCT CGTTGGGCTC TGCTTCCACT
GTCTATTCAC ACAGCTGTAA CTCCTGCATC GATCGAGAAC TCGTCCGCAA GCGTGAATTT
CCCTGCCCTG TATGTCAAAC CCCCGTCAAA CGTGTGACGC TAACCGTCCG GAGTCTAGAC
GATGTCCAGT GCGAAAAGGA CACATCTTGG CGTCGACGGG TACTGAAAGT CTTCAACAAA
ACTGAGCCAG ACTTCTCTTC TCTGCTGGAA TTTAACAATT ATCTGGAACA AGTTGAAGAC
ATGATCTACT CTATTGTCAA CGAAGAGCCA GATGCTGAAG CTTGTAAGGC GAAGATTAAG
GAGTATGAAA ACGCTCACAA GACAGAAATC GTCATTCGGC AATCCCAACG CGCCGATGAG
GAACGTTCCA TCCAGGATCG GATCGCCGCT GAGCAAAGAA GCACCGAGCG ACTCCGGCGT
GAAGCGTTTG ATGAGGAAAA GGCTGTCGCT AACGCTAAGA AGCGTCTAAA GCAGGAAAGT
ACGCAGGTGT TGCTAGGAGA ACGTGAAGAA GTATCGGCGG AGCTGCGACA AGCCCAGATG
CAAGGGTACC GCAATGAGTT AAAGAGGCAG TCGAGAGGTA AAAAAAGCAG CGACTTTGTC
TCGCCACGCG TTCGGGAACC AGCCGATGGT TGGAAAAAGG AAACACTGGA TCGGCAGCTG
TATTTAAAAC GGCAAGCAGC GGGTGGGGGA ATACCGACGG GAAGTATTGC ATCACTGGAA
CGCAACTGGA ACGAAACGGT ACAATCACTT TTTGCCAGAA TGAAAGCCTA A
 
Protein sequence
MEEDPQDDLF RCALCGTAEG DSSNLSSHTS LQTNATVRCG HQFCNSCIDR ELVRKREFPC 
PVCQTPVKRV TLTVRSLDDV QCEKDTSWRR RVLKVFNKTE PDFSSLLEFN NYLEQVEDMI
YSIVNEEPDA EACKAKIKEY ENAHKTEIVI RQSQRADEER SIQDRIAAEQ RSTERLRREA
FDEEKAVANA KKRLKQESTQ VLLGEREEVS AELRQAQMQG YRNELKRQSR GKKSSDFVSP
RVREPADGWK KETLDRQLYL KRQAAGGGIP TGSIASLERN WNETVQSLFA RMKA