Gene PHATRDRAFT_45947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45947 
Symbol 
ID7200829 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp749920 
End bp751656 
Gene Length1737 bp 
Protein Length403 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180306 
Protein GI219119079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATTCATCAC CGGAACAAGT CCTGATCCTC TTCGGAAGGT AGGGAAGAAG AAGGTAGGCA 
GCGGGATAGA GACCGACAGT CCTTGTGCAA ATACACAACA CCCCTCAAAC ACAAGAGTCG
TTTCCCACAC TCACAACCAA TATCTTTGTC CAACAGTAAG CAAATCCAAC AACACTGTAG
TCGCTGTAGC AACCCGATCG AGGAATTCAT CATACCCTAC ACACTCCATG AGTCTTCCGT
TTGCCCAAAA GGTGTACATC GTGGCGGCCA AGCGTACGCC CTTCGGAGCC TTTGGCGGTG
CACTCAAGTC CGTCAGTGCG ACGGACTTGG GTGCGCACGC GACCAAGACA GCACTGGCGT
CCGGCAACGT GGATCCGTCC CTCGTCGACG CCGTCTACTT TGGCAACGTC ATTCAATCCA
GTCCGGATGC GGCCTACCTC GCCCGACACG TGGGACTGCA AGCCCAGTGT CCAATTGCCA
CACCCGCTCT CACTATCAAC CGACTCTGCG GATCCGGATT CGAAACCGTC GTGCAAGCCG
CCAACGGAAT ACGACTCGGC GAATCGCACG TTGCCGTAGC GGGAGGTACG GAAAACATGT
CCGCCGCGCC CCTTACCTTG GACGGAAACG TGGCGCGATG GGGTGGAGTC AAATTAGGAC
ACGGGATGAA GCTGGGAGAC GCTCTCTGGG ATGGACTCAC CGATAGTCTC GCGCAAACAC
CCATGGGACA GACGGCCGAA AATCTAGCCA CCCAGTACAA CATTTCCCGA GCCGTAAGTA
ATCGTGTAGG AGAATCCGGC ACCTCGTGAG AGCCTCTGGT TCCATACGGG ACGGACCGGC
ACGGTGACAA TGCATACATA CATACACGTT GAGTCTCACC GGCATGAATT TTCGTTCACG
GCAGGAATGC GACGAATACG CCATTCGCAG TCAACAAACG TGGGGTGCGG CACAACAGGC
CGGATTGTTC GACGCCGAAA TGGCTCCCAT GGAATTACCC GGCCGCAAAG GCACCACCAC
GGTCGTGGAC ACGGATGAGC ATCCCCGCGT CGATACCCTG CTCGAAAAGA TTGCCAGCCT
CCGTCCCGTC TTTTCCAAAA CTGGCGTCGT TACGGCCGCC AACGCCTCGG GCATTTGCGA
CGGAGCGGGG GCCGTCATTC TCGCTTCCGA ACAAGCCGTG CTCGAACACA ACCTCACGCC
GCTCTGCCGG GTCGTCTCGT ACGGAATTAC CGGATGTGAA CCTACCGTCA TGGGTATCGG
TCCCGTGGAG GCCATTCGGC AGGCCTTGCA CCGAGCCAAT CTCAAGTTGG CCGACATGGA
TCGGATCGAA ATCAACGAAG CCTTTGCCGC CCAAGTCCTG GCATGTGCCA AGGAACTAGG
CCTCGATTGG GACAAGACTA ATCTGCACGG TGGCGCTATT TCACTCGGAC ATCCCTTGGG
CGCGTCCGGT TCCCGCATCG TCGCTCACCT TGCCCACGAG TTTGCCACCA ATTCCGCGGC
ACAGTACCAC ATTGGCAGTG CCTGCATCGG AGGAGGGCAA GGTATCGCTG TTCTCATGGA
GCGAGTGTAG TTGGCGAATA CGACCGTGTA TGCAAAGTTG ATTGTCAGGA AATTTACCAG
GCATGTCACG GAGTATACAG ACGAACGGGC ACTGTACAGC CCATACTTGG CTAGTAGGTA
GGTAGGTGTT TAATCAAACA CAAAGCTGTA AGTAAAACTG AATGTGAACG TGGATCG
 
Protein sequence
MSLPFAQKVY IVAAKRTPFG AFGGALKSVS ATDLGAHATK TALASGNVDP SLVDAVYFGN 
VIQSSPDAAY LARHVGLQAQ CPIATPALTI NRLCGSGFET VVQAANGIRL GESHVAVAGG
TENMSAAPLT LDGNVARWGG VKLGHGMKLG DALWDGLTDS LAQTPMGQTA ENLATQYNIS
RAECDEYAIR SQQTWGAAQQ AGLFDAEMAP MELPGRKGTT TVVDTDEHPR VDTLLEKIAS
LRPVFSKTGV VTAANASGIC DGAGAVILAS EQAVLEHNLT PLCRVVSYGI TGCEPTVMGI
GPVEAIRQAL HRANLKLADM DRIEINEAFA AQVLACAKEL GLDWDKTNLH GGAISLGHPL
GASGSRIVAH LAHEFATNSA AQYHIGSACI GGGQGIAVLM ERV