Gene PHATRDRAFT_23694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_23694 
Symbol 
ID7198735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp228949 
End bp230665 
Gene Length1717 bp 
Protein Length521 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184921 
Protein GI219129491 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.728376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGGAGATGC TGGTGTATCT CCACAACCCA AACACGGCTT CACCTACCAA ACAGGTACGT 
GCGCGCTGGA GTTTTGGGAG CCCATAAAGG AGCTGGGCGA AGGCAGCATT TCTAGCATTC
ACATGGTGAG GCGACGAGAA AAACGAATCC ACGTTCCGTA CAAAGAGCGC GTTGATATTA
TGAGCTTGGC GAGTAACGGC TCGACCATGA AGAATGACGA CGAACTCTTG ACCCGAAGAA
GCAGCTTTTC TAACAAAGAA TCATTCGCAC TGAAGAGTAT TATCAAAGCC TACGTACAGA
ATGATAAGTT TTTACAAGAA ATGCGCGATG AAATTTACAC CATGAGTCAT TTAGACCATC
CCAACATTGT CAAAGTCTAC GAAGCTTACG AACGAAAGCG CCATATTTAT CTTATTATGG
AGTACTGCCG CGGCGGTGAT TTGTGGGCCC GTCAGTTGAA TGAAACAGCT ACGGCCGCGG
TAGTACGTAA AATCTTGCTA GCCGTATCGT TTCTACACGA TCACAACGTT GTTCACCGAG
ATCTCAAACT AGAAAATATT ATGTTCGATC AACCTGGCCC CAATGCCGAA GTCAAAATTA
TCGATTTTGG ATTAGCCACC CGATACTTGT CGAACGAGCA CAAACAAATG ACTGATCGAG
TTGGGACCTT GTACAGCATG GCGCCACAAG TTCTACAAGG TGTTTATGAT GCCAAATGCG
ATTTGTGGAG TATCGGCGTG ATAACGTACG TTTTGCTGTC AGGAGGTACT CAACCATTTT
GGGGCCCACC GCGGGAAATT CCATGGGATA AACGGAAAAA AATTATGGTA GATCGAATAA
TGCGATGCGA GTACATGCGC ATGAAAGGCA CAACTTGGGA CGGTGTATCA GAGGAAGCTA
AGCGGTTTGT GAAGTCTTTG TTGCAAATCG ATCCCGCCAA ACGACCGTCG GCCAAGGAAG
CGTTGGCTTC GAAGTGGATG AAACTGCACG AAGAGGACAA GCCAGCACTG ATTGCTTGCA
CACCCAAATC TCTTCAACGT GATCAACTGC ACCGATTCGA ACGGCAGCTT CGTATTGTGC
TGACCAACAA ACTCTCCGAA GAGGCATTAA TGAGTTTGAA AGCAGGTTTA GAAAAGCATG
ACGAGACCGA CGAAGGCCGT GTTTCGCTTG AGGTGATGCT TCGGTATTTA TTAGAAAATG
GTTTGGAGCA AATTTCGCTT GCCGCACTGC ACGAGCTGGC CAATGGCGCA GGGAAAGATG
CGATGATAGG CTACACCGAA GTAATTTTCG CTTCCTTAGA ATCCAAAGGA CGTCGTGAAT
CAGAACGTAT GGCAGAAGCT TTGGCGGAAA TGGACATCAA TTCTTCGGGA ATGGTTGCGA
AAGCGCGAGC GTTGTTAATT TTGTACCGCG TCGTACCTGA CCACACCCTC GACATTGTGA
AGGAAACTTT CAACGAGGAC GACACAGACC AGATCTCATG CCAGGTAGTC CTTGATCTGA
TTAGCAAGCA AATGGCTGAT CGAATTCACA CCCTTTCGAG CCATCACGAC TCCTCCGATG
AAGTGAAGAA TCTGGTTGAC GCCAAAAACG CTGTCATCCC AGGTGGGAGA AATGATCCGT
CTGAAAGACC TGAGTATGTC TTTGACGCAT CCACCAATTC GGTTCGAAAA TATGCCGAAA
AACAATGATT TGATCTCGAT GCTTCGGAAG GGTCCAA
 
Protein sequence
MVRRREKRIH VPYKERVDIM SLASNGSTMK NDDELLTRRS SFSNKESFAL KSIIKAYVQN 
DKFLQEMRDE IYTMSHLDHP NIVKVYEAYE RKRHIYLIME YCRGGDLWAR QLNETATAAV
VRKILLAVSF LHDHNVVHRD LKLENIMFDQ PGPNAEVKII DFGLATRYLS NEHKQMTDRV
GTLYSMAPQV LQGVYDAKCD LWSIGVITYV LLSGGTQPFW GPPREIPWDK RKKIMVDRIM
RCEYMRMKGT TWDGVSEEAK RFVKSLLQID PAKRPSAKEA LASKWMKLHE EDKPALIACT
PKSLQRDQLH RFERQLRIVL TNKLSEEALM SLKAGLEKHD ETDEGRVSLE VMLRYLLENG
LEQISLAALH ELANGAGKDA MIGYTEVIFA SLESKGRRES ERMAEALAEM DINSSGMVAK
ARALLILYRV VPDHTLDIVK ETFNEDDTDQ ISCQVVLDLI SKQMADRIHT LSSHHDSSDE
VKNLVDAKNA VIPGGRNDPS ERPEYVFDAS TNSVRKYAEK Q