Gene PHATRDRAFT_23582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23582 
Symbol 
ID7198515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp193925 
End bp196298 
Gene Length2374 bp 
Protein Length764 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184751 
Protein GI219129133 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCGATGCA TTGCTCGGAG CTGTCCTCGA TACACTGTTT CAGCTCATGC AGGAGAGCCC 
CGAGTCCGCC GCTGGCGCCT TGTTCGATTC CAATCCCGCC TGGCGTCAAG ATCTGGAACA
AGACATGGAC GCTAACGATG ACGATGACGG CTTGGACAGT CCCACGGAAA CAAGTATGGC
GCAGGGTACG CTCGACATGA TTGCCTGCGA ATTACCCAAA AAGTACGTCT GGCCGGCCGC
ACTGTCTCGT TGTATTGATC GCATGAATGC ACACAACGAC GCCAACGCGC GCAAAGCCGG
GGTGGCTGGA CTTGGCGTCA TTGCCGAGGG CTGTTGCGAG CCCCTCACGG CCGCCCTGCC
CACCGTCATG CCCATGGTAT TTGCGGCCGC GCAAGACAGC TCGCCGCAAG TCCGCGAATG
CGCCTGCTTT TGCCTCGGGC AAATCAGTGA ACACTGTCAA CCGGAGATTC TGCAATACAG
CAACCAAATT TTGCCCATTG TCTTTGCCTT GTTGGACGAC CAAGCCGTGA CCGTCCAGGC
CACATCCTGT TACGTGCTGG AAATGTTTTG TGAACGCCTG GAACCGGACG CGGTGCGCCC
GTTGCTGGAT CCCTTGGTGC GCAAACTCGC GCACATGCTT GAGCAAACCA ACAAGCGATC
GGTGCAAGAA ATGGCTGTGG CTGCCTTGGC CGCTACGGCC GTCGCCGCCG AACAGGAATT
TTCCCCCTAC GTCGAAGGCG TAGCCAAACT CATGACGACA CTCATGAGTT TGCAGGATCC
GACTCTGTTC TCCTTGCGTG GTCGGGCTTT GGAATGTATG GGGCACATGG CTATTGCGGT
AGGCAAGGAA AACTTCCGCC CCTATTTTAC GGTGACCATG GAATGTGCCA TGCAAGGTTT
GACCTTGGAA AGCACCGATT TGCAAGAATT CGCCTACGCA GTTTTTGCGA ACTTGGCCAA
AGTCATGAAG GAAGAATTTG CCCCCGCCCT TTCGGATTTG GTCCCGCATT TGATTCAAGT
AGTGGACATG GACGAAGGTC AAGTAGAATC AGCGGGTCAA GATAGCAACG AGGCGTTTAC
CGGTTTGGAC GAATCGGACG ATGAAGGCGA CAACGAGCAG TACGTGTTGC ACGTTCGTAC
TGGCTTGATG GAAGTCAAAA AGGGCGCCAT CACGGCCTTG GGTGAGATGG GTGCCCACTG
CGGTACCGAC TTTTGCCCCT ATTTGGAAGT CTGTATGAAG TCCCTGGAAG AAGCTGCCAG
CAACTGGCAT CCCCTGATCA AGAGCGAAGC GGCCGATGCG ATGCCGTCAA TGATTGTCCC
ATCCATTGCC GCGTACCACA ACGGCGAAAT CTCATGGACG AAGGGTGATG TGACGGGAAG
TAGTCCTATG TCGCCCCACA CGGCAGCGTT GGTTCATTGT GTCTTGAAAC AAGAAATAGT
ATTGATGCAG GACGATGATA AGGGCACGGT GGGCAAAGCA TGCGAAGCGG TTCAATCGGT
GATTGAAATT TGTGGACCCC ACGCCTTGGT GCCGCACTTA AACGAGTGTC TCGGCAATGC
TCATCTACTC TTGACCAAGT CCGCCCCATG TCAGACGGTA GATGCTTTGT ACGGCGAATT
GCCGGACGAT GACGATGACC ACGACGGTAT CATGCAGGCT GTCTGCGATT TGGTAGGCGG
ATTTGGTCGC GTCCTGGGAT CGCAGTTTGC GCAGTATCTG GGCCAGTTCT TACCGGCCAT
TTGCGAATAC GGCAAATCAT CTCGCCCCGC AAGCGATCGG TCAATGGCGG TCGGTTGTTT
GAGTGAAATC GCGCAGGAAT TGGAAAGCTC AGTTCTAGAC TATTGGCCCA CGGTCTTTCT
ACCGGCCATT TTATCCGGCT TGGCCGATGA GGACGACAAC GTCAAGCGCA ACGCTGCTTT
TTGTGCGGGA GTGTGTTGTG AACATTTGAA GGAAGCCATA ACGAGCGATT ACCAGAACAT
TCTGCAACAG CTGGCACCTA TTTTTAACCT AGACCCCAAC GCGACGGATT CTTCGGCGGC
GTGTATCGAC AATGCAGCGG CCGCCGTGGC CCGAATGATC ATGGCGTCCC CCCACCACGT
TCCCTTAGGT CAAGTATTGC CGGTCTTCTG GCGAGCGTTG CCGTTGAAAA CAGACATGAC
GGAAAACGAG ACTGTCTACA CATGCTTACT GGGATTGCTG AGTATGAAGC AACCGGATTT
GATGACGGCG ACCGGTATTT CCGAAGTACG ACGTATTGTC CACGCTGCCT GTCAAGCGGA
GAGTGACGTG AGCGACGAAA TCAAGGCGAA ATTGATACAA GCACAGCAAA CCCTCCAATA
AAACAACACA TAAAAAATGA AAAAATGTCA TCGG
 
Protein sequence
MQESPESAAG ALFDSNPAWR QDLEQDMDAN DDDDGLDSPT ETSMAQGTLD MIACELPKKY 
VWPAALSRCI DRMNAHNDAN ARKAGVAGLG VIAEGCCEPL TAALPTVMPM VFAAAQDSSP
QVRECACFCL GQISEHCQPE ILQYSNQILP IVFALLDDQA VTVQATSCYV LEMFCERLEP
DAVRPLLDPL VRKLAHMLEQ TNKRSVQEMA VAALAATAVA AEQEFSPYVE GVAKLMTTLM
SLQDPTLFSL RGRALECMGH MAIAVGKENF RPYFTVTMEC AMQGLTLEST DLQEFAYAVF
ANLAKVMKEE FAPALSDLVP HLIQVVDMDE GQVESAGQDS NEAFTGLDES DDEGDNEQYV
LHVRTGLMEV KKGAITALGE MGAHCGTDFC PYLEVCMKSL EEAASNWHPL IKSEAADAMP
SMIVPSIAAY HNGEISWTKG DVTGSSPMSP HTAALVHCVL KQEIVLMQDD DKGTVGKACE
AVQSVIEICG PHALVPHLNE CLGNAHLLLT KSAPCQTVDA LYGELPDDDD DHDGIMQAVC
DLVGGFGRVL GSQFAQYLGQ FLPAICEYGK SSRPASDRSM AVGCLSEIAQ ELESSVLDYW
PTVFLPAILS GLADEDDNVK RNAAFCAGVC CEHLKEAITS DYQNILQQLA PIFNLDPNAT
DSSAACIDNA AAAVARMIMA SPHHVPLGQV LPVFWRALPL KTDMTENETV YTCLLGLLSM
KQPDLMTATG ISEVRRIVHA ACQAESDVSD EIKAKLIQAQ QTLQ