Gene PHATRDRAFT_44688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44688 
Symbol 
ID7197894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1266336 
End bp1268236 
Gene Length1901 bp 
Protein Length539 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178398 
Protein GI219115205 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCCC CGATCCCCAC AACAACAACT CGTGGTCGTC GTGAATCCGC ACCGGATTGG 
GTCTTGACGA ACTACGACGT TGACACTTCC GTTCCCCCGG ACGTCGAGAC GGAACTCCGG
CGATTGCGCG TCCTCCAATC CTACGATATT CTCGATCGCG AGTGGGATAC AGCCTACGAT
CGTTTGGCAC AAATCGCGGC ACGAGCCTTG CAAACCCCCA CGGCCTTGGT CGGTGTGGTC
GATCTCGGTC GGTACTGGCG TGTGGCCCTG CACAACTGCA ACAGCAACAT TCACCATAGT
AACAATCATC ATCACCGTGA TGCTCCACGG GAACTGCCAC GAAAGCAGGC CATAGCGTCA
CACGTGATAC TCCAACGTCA GGGATACTTG GTAGTCATGA ATGTCGGGAA GGATTCGCGT
TTTGTCGATC ATCCCGCCGT CCGCAACGGC ACTTTCCAAT TCTACGCCGG AGTCGTACTC
CGCTCGCCGG AAGGGTACCC TCTCGGTGTG CTCGCAGTTA CCGACACGCA ACCCCGATTG
CACGGCGTCT CCCACGAACA ATTGCAAACC CTGCGAGATT TGGCGGACGC CCTTGTCGAT
TTGATGCACA CGCGGCGGCG ACCGAAAGGA GATACGGGGC CTGCCGCCAT CACCCGTCCT
ACGGCTCCTG CTTCGACTAC TACTACGACT CCTCCGTCCA ATCCACCATC CCCCACCAAC
CACACCAACA CCAAGGATAC CAACAACAGC TCATCCTGGG GAGTGCAACG GTCCGCACGC
TTTTTGCGAC AGTACTTGGC CAAACTCAAT GACGACTCGA CGCTCGAAAA CGTCCTCACC
AAGGACCAAC GCGAACTACT CCGCTCATCC TACGATGCCG CCGCCTTTCT CCACGGCTCC
CTCTTACCTC CCGAACAACG ACAAGAGCTA GTCCGAGAAA ATCGTCCCGA TCAAGTCACC
AGCGTACAGA TCGCTAGTCT GGTACAGAGT GTCGAACTCG CCATGGATGC CTTTCCCAAG
ACCGTGCCCG TCCGCTACCA AATCGACGAC GAACAAATTC CACCCTTCGT CCTTCTCGTC
GAACTGAAAA TATTTCGATC CTGCATCGCC CTCTTGACGA GTGCCTGTGA ACGCACCCGC
CAGGGCGTCG TCTGCTTGCG GGTCTTTGTC CAACAACGCA CCGTCACGCA GAAAGAACTC
GTCTTTGAGT GCGAAGATAC CGGACCCGAT GTGGAACTGG AACAGTACGA CGATTTATTC
GACGCTCCCC TGGACCATAC CGCCGACGTT GGTGAAGAAG ACTGTATCCG GGCAGATCCC
CATACGGGAA AAATTCGCAA GGCACTCCGG TGCGCCACCG TCCCCAACAG TCGCAGGGGA
CATGGCGTAC ACGCTCTGGC CGACTTTATT GGTTCCATCG AGGGCGGTGA CTATGGATTC
AGGCCCCGAG AAACCGAAGA CTTTGAACCA CATGGCACTG GAACGGGGTC CGTCTTTTGG
TTCAGTATCG CCTTGCACAC CCCACCAGCG ACACGACACG GTACGGACGC AGTTGTGCCG
CGACGACCCC AGCCTTGGAT CATCTCCAGT GCACGGGGGC ATGGATCCTT GGCAAAATAG
TAGATTTTCC TATAGTTGCT ACCTAGGTAT TTAACATGTA AGTATTCCGG CTGGTCTTGT
CGTCTGCAGT GGACTTATTC GTCATTTCAT ACCGTCACAG TCGCGTGCTC GGTAACAAAG
CAGCTTGGTG TTTCAGAGCC CGTAGCAATG TTGCGTGCCG ATGCTTTTCT CCAATGTGGG
CTTCGATGCG TTGACGAAGC TGGCGAAGTG ACAACGGGCG AGACCCGATC GCTGCTCTAC
TTCCCTCTTC CAGATTGAGC AGAGTCCCAA AGAAGTCGGT C
 
Protein sequence
MAPPIPTTTT RGRRESAPDW VLTNYDVDTS VPPDVETELR RLRVLQSYDI LDREWDTAYD 
RLAQIAARAL QTPTALVGVV DLGRYWRVAL HNCNSNIHHS NNHHHRDAPR ELPRKQAIAS
HVILQRQGYL VVMNVGKDSR FVDHPAVRNG TFQFYAGVVL RSPEGYPLGV LAVTDTQPRL
HGVSHEQLQT LRDLADALVD LMHTRRRPKG DTGPAAITRP TAPASTTTTT PPSNPPSPTN
HTNTKDTNNS SSWGVQRSAR FLRQYLAKLN DDSTLENVLT KDQRELLRSS YDAAAFLHGS
LLPPEQRQEL VRENRPDQVT SVQIASLVQS VELAMDAFPK TVPVRYQIDD EQIPPFVLLV
ELKIFRSCIA LLTSACERTR QGVVCLRVFV QQRTVTQKEL VFECEDTGPD VELEQYDDLF
DAPLDHTADV GEEDCIRADP HTGKIRKALR CATVPNSRRG HGVHALADFI GSIEGGDYGF
RPRETEDFEP HGTGTGSVFW FSIALHTPPA TRHGTDAVVP RRPQPWIISS ARGHGSLAK