Gene PHATRDRAFT_37463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37463 
Symbol 
ID7202372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp201712 
End bp203508 
Gene Length1797 bp 
Protein Length598 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181506 
Protein GI219122343 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT CCGTAGAAAA TCGTAGGAAG ATTCTTTTGA CAAAGACTGG CATTGCAGTC 
GCTGGAGAAC GTCTGATGGT GTCTCTGAGT AAGCTTTTGG TTGGCATCGA GCATCGAACA
GTCGAGAAAA TAAAAATAGA GCGAATTAAT CGATGCTTTG CTCGGGGCAA CCCCGAATCG
GAATTCGACC GTGAGGTCGG CGGCTCCCAA ACCTTTCGGC ACTTGGAAAT TATCCGAGAC
GCTCTCAAAC ACGATCCCAC CCTTCGTTCC TTCGTTGTCG ATAACAATAT GTTGGCAGCG
ACGAATCTTC CGACTAGAAC CGAGGATAGC ATTTGGCTGA ATCTCCCTAC TTGGCAAGCT
CAACTAGGAA TCCAGGATAT TCAATTTTCG TGCTGTACTG TTCAAACAAT CGAACCTACT
GTATTTCTGC AAGGCTTTCC GGAGTGGAGG ATCCATCACG GAAGCGGTCC TCGCGGATTC
GCGTTGGTGT CCAACATAAC AAAAGGAGTG TCCGCTTTCT TTTCTGATCA GCCTCGTCAC
ATTAATACGG AAGGATCGAG ACAGGGAGAG GCCTGTCCCG ATGCGATGAT TCTGCGAGTT
TTTCGATGTG CGCCTCGACC TTCGTTACGG GCCCCGATTG AACATGTGCT CAATCAATCC
CGTCCACCTA ACGTCTTTGA CCTGTGGATT CGTCACCACG AAACGCCAAC CGACCTGCCT
GGACTAGGAT GTTTCAAAAT TGACGCAAAT GCAAAAGCAA CGCTTTTAGA ACTATTGACG
GATGTCGACT TTGTCGAAGA GACCAATCGA GCAGAGATAG AAGTAGAGCC GCATCAGTGC
ATGCCATCAC CCCTGGTTCC AAATGATGTT ACGCCCGGCT CTCCTTTAAT TCGCACGGCC
CCTCCAAATA AATTTTCGAC ATCACTGGTA TTACAAACTG TTTCCAATCC TGGATCTTTG
TTAGCGCCTT CCATGCCCTC GCCAGTATCT ACTCAGCAAG CCATTGTCAA TTTGTTGTCA
AGCAGCGATA GCACGGAGTC TAACATTTCT CTTGAGGATG ATCATAACGT CGGTCATCGC
TGCAATGCTT TTGAAGTCAT AATTTCCCCC GCAGTTAGCT TTGCGGAGAT GGAAGCACTG
GTTGCGAAAG ATAGCGAAGA AAATAGTCAA TTCGGTTCCA TTTTTGTAAA GGGTGAATCC
GAAACACTGG GGACAGCGCC CTGCAATCTT GCTGAGAAAA CAAATTGCGA TCGACACAAG
TGGGCTTTTT GTTTTCCTGA AGAGCTTCAG TTTTCTGACG TACTTAGTCG AGATTGGCTG
CCAAGTGATT TGTCTAACAA GGATTCTCCA CACGACGAAA ACCAACCTGA AGATTTACTG
TTGGACTCTT TTTCAGGTCT CGCAACGTTT AACTCAATAT TCGGAGACCC ATGTACGGTA
GAGGATGCAG AGGATCCTTA CAGAGAGACC TTGAAGATTG ATACAGCTCT CGCCACTCCA
ACAGGGCAGT CTCAATTTGA CGATTATTCT CCGCTTCGGT CGTTACTTGC TGCACTTGCT
TTTGAGGGCG TCTATTGCGA CGAATGCAGC TCAGCTATCA CGGATACACG TCCCATACAA
ATTTATGAGA CTCGGAACCT ACCGGGTCCG GCAGAGACGT CTAGGATGTT GACAATGCAG
CGAGAGAGTT TTAGTATCAG CAACACAAAG GAACAAATAG GAGACGTATC ATACACAGCA
GACGGTAGAG AAAACGTAGA GCCCTGCATT GAGGAAAACA TGCGGTGCCA TTGGTAA
 
Protein sequence
MKKSVENRRK ILLTKTGIAV AGERLMVSLS KLLVGIEHRT VEKIKIERIN RCFARGNPES 
EFDREVGGSQ TFRHLEIIRD ALKHDPTLRS FVVDNNMLAA TNLPTRTEDS IWLNLPTWQA
QLGIQDIQFS CCTVQTIEPT VFLQGFPEWR IHHGSGPRGF ALVSNITKGV SAFFSDQPRH
INTEGSRQGE ACPDAMILRV FRCAPRPSLR APIEHVLNQS RPPNVFDLWI RHHETPTDLP
GLGCFKIDAN AKATLLELLT DVDFVEETNR AEIEVEPHQC MPSPLVPNDV TPGSPLIRTA
PPNKFSTSLV LQTVSNPGSL LAPSMPSPVS TQQAIVNLLS SSDSTESNIS LEDDHNVGHR
CNAFEVIISP AVSFAEMEAL VAKDSEENSQ FGSIFVKGES ETLGTAPCNL AEKTNCDRHK
WAFCFPEELQ FSDVLSRDWL PSDLSNKDSP HDENQPEDLL LDSFSGLATF NSIFGDPCTV
EDAEDPYRET LKIDTALATP TGQSQFDDYS PLRSLLAALA FEGVYCDECS SAITDTRPIQ
IYETRNLPGP AETSRMLTMQ RESFSISNTK EQIGDVSYTA DGRENVEPCI EENMRCHW