Gene PHATRDRAFT_47097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47097 
Symbol 
ID7202172 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp391708 
End bp392892 
Gene Length1185 bp 
Protein Length287 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181200 
Protein GI219121702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTCCA TGAAAGCATC ATTGGCTTTT CTGTGGCTTC TTCTTGCCTC TAATAGATTT 
TGTGTCCAAT CGTTTCGCCC GGATTGGTCG AATCACCAAA CGAGACTTTC TGGAAGCTGC
GTCTTCAGTG CGAAAAATAT TGACCGTCCA AATACGACCG ACAACAAGGC CATGGCGTTT
CTTCGAAGAA TGGGTCGCGT AGGGGGCAAT CGTGATTTTA CGCACGCAAT CGGAATCGAC
GAAGGTCCTT CAACCAAGTC CACCGGAAGT GGGAAAAAAG TGAGTGCAAT GGATAAGCAA
CGCCGCCGTG TGCCTGATCT GGTTGATTCA GATCCTATCT CAATACTTCG CTGGGTTGCC
ACCCTCTCTC TTCCAGCTGC AGAAAAAGAA AGCCGCCTTT CAGTCTTGTG CACTTACAGG
TGTCATTGAT GATCTCTCAG AACCGTTTCC AACCACCAGT TCCGGGTAAG CCGAAATCTA
GTAATGAGTG TGCCTAACGT TGGTCGTGTG CACGCCGCAT CTACTGACAT ACTAAAGAGA
CACCTACTAA CTTCAAATTG ATTCTACAAA CAGAACACAG TGGGCTGGAT ACACAGATCA
AGTAATGGGC GGTGTATCAA CTGGGCATCT TTGTCGGGAA GATTTTGACG GACGAACGTC
TAACGTTTTG CGTGGCAAAG TCAGTTTGCG CAATAACGGC GGCTTTATCC AAATGGCGAC
AAATCTGGCG CACGACGCAA AAGACTCTAG GCTCGTTGAT GCGTCCTCCT TTGACGGCAT
AGAAGTAGAT GTACAGTATC AAGGAGAACA AGAGGAAGAA ACATTCAACA TACAGTACGT
ACGAAGCGAT CGGTCATTTT TTCGTAAACA TTCCGATTTC CTCACGCCTA TCCATGGCTG
CTCTTATTTC TCCAGCTTGA AAAATGTTTG CTGCCCGCTC CCGTATAGCT CATACCGTGC
ACGGTTTTCC GTTCCGAAAG GATCCTGGAT GACAGCTCGG GTTCCATGGA CAGACTTTCG
CGGACACGGG CCGGGTGCTT CCGATATTCC GTTCTCTTCT AATTCGCTGA CAAGAGCCGG
GATTGTGGCT ATTGGTAAAG AAATGGAAGT GCTGCTAGCT GTTTCTGGCC TCCGTTTCTA
CAGAGAAAAA AATTAAACCC ATTGTAGAAA AAGTTTGTTT ATTTC
 
Protein sequence
MVSMKASLAF LWLLLASNRF CVQSFRPDWS NHQTRLSGSC VFSAKNIDRP NTTDNKAMAF 
LRRMGRVGGN RDFTHAIGID EGPSTKSTGS GKKKKKAAFQ SCALTGVIDD LSEPFPTTSS
GTQWAGYTDQ VMGGVSTGHL CREDFDGRTS NVLRGKVSLR NNGGFIQMAT NLAHDAKDSR
LVDASSFDGI EVDVQYQGEQ EEETFNIHLK NVCCPLPYSS YRARFSVPKG SWMTARVPWT
DFRGHGPGAS DIPFSSNSLT RAGIVAIGKE MEVLLAVSGL RFYREKN