Gene PHATRDRAFT_46523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46523 
Symbol 
ID7201600 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp597089 
End bp599378 
Gene Length2290 bp 
Protein Length715 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180865 
Protein GI219120244 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCTTTGGG AAAATCGTCG GATCCTTACT CTTTTGCAGC GTCGCATTTT TGTCTTGTAC 
AGTTCATCTC CTTCCCATGA GATCTATTTT GCGTCAACAA GATCAACCCC GACTGTTACA
CCGCCTTCCC AGCCAGCGGC AGTCGATGCA GGCATCATCA TATTTTCCCC TGGACAACGT
GCTCGACGTA TCCTATCTCG ACTCTTCGCT GCCAAATTCG CGGGAATGCG ACTTTTACAC
GAATCCCACT ATTCTCAGTC GTCTGATTTT GCACCAAAAG TACGAAGCCG CAATGCGCCG
TTCCTCTACG CACAGCGAAG AAGCAAGGAC TTGGGTGGTC GTGCGGCGCC AGACTAGTCC
GGCTTCTTCC GTGAGCAACC AAGCCGCGAC CCCATCATCC CCCGCCAAAA CGACATCGCA
GCGTTCGAAC GTGACTTCCC TGAGCTCGTT GAGTGAAGAT GACGTCAACA ACACAACGCT
TTCTTCGAAC AGAAATGGTG ACGTGAATTG TGAGTATTAT TCCTGTCGTC AGCTCCCCAT
TCACATGGCC TGTGGAAACT TGTTTCGTGT AGTAGATCCG GCTTTGAAAG CTCAGCTCGA
AAAGCTCATT GCCACGCTGG TGGTGGCCTT TCCCGAAGCT TGTTCCCAAC GCGATCACCA
ACACCGGATG CCTTTGCACG AAGCGATCTG GTACCGAGCC GGTCCCGAGA CAATTTCGGC
CTTGTTGATT GCCTATCCCG ACGCAGTTTC CATGCGGGAC AAGTACGGCC GCTATCCCAT
GGCGCTCAAT GAGTGCCGAG ACAGCCCGTA TCGTACACAG ATTCGGCATA TGTTACTACA
AGGTCGAGAC TTTTGGAATA CGGCCCGCAC GGAGGCCAAG CTGCGACTCA AACATCGCAC
CGTGCCTGCC GATTTGCAGA GCGTTGCTTC CCAGAGCGTT TTGGCAGCCA GCGTAACCAG
TACCGACGAC AATTCAATGT ACACACGTGG TGATAGTGTA CGGCAAGGCC GGGCGGGCCC
GGGATATCAG CAACTATCGC CAACGATAAC TGGTTGTGGG GAATACAAAC GTACCATTAC
TTCTTGGTCA CAGTTGGAGC ACCGCACGAA TACACTGGAA GAGAAATTGG CCGAGTCGAT
GCAAGAAAAC TACGAAACTG GAAAAGAAGC AACCAAGTTG CGGGTTTCCA AAGCGAAATT
ACAAGCAAAG TATGACGTAC TGATGGGGAC GGGCCTCGGA AAACAGATAG AACTTTTGCA
AAATGAAAAG ATCGCACTGG AAGTTCAAGT TCGCGACCTC CAACAGCTCG CTCCACTTGC
GGAAGTGCGA TCTTTTCCTC TCAACCAAGA CCACCCTATG GATAAACTTG TCCCACAAAA
TATCGTCCTT CACTGTCAGC CCGAAGTGAC GGTAGACGTG GAATTGCAAG TCTTGAGGGA
AGAAAATGTC CGGCTCAAAG CCGGTATGGG GCTGCTGTCG CAGAAGCATA AAGATTACCA
ACGCCGCTTA GACTTTGCAG AATCTTTGTT GGACGACATG GAAGATCCCG AAGACTTCCC
ATTCTTTGAC GACAACGCAA CCGATTACAG CACGATCTTT ACAATTTCTA CAGGAACGCC
GAAAGAGAAA AGGATACACA CGCCGCGCCG TCCGGAACCG GAGAAGCGGG TCCTGTCGCC
CGCACTCCAA CAAGTACGTC CCAAGTCGGC AATGAAAAAT TTCATGCCGG TGGAGGATGT
CAGTCAGAAC AAGCCCGGTT TCATGGACCC ACTTGATCCA TCGCTGCTGG AAATGTCGCG
AGAAGACGAC CTGGAGTCTA TCCTCAAAGG AGCACAAGAA TATTTTGATA AGTCGTCGGG
CCTCACTCGT CGCCTTGCCG ATAGCATGTC ATTGCCTACA TCACGCATGA CGTCCCAGAT
CACATTACCA AGCATTCTTG AAAAGCCCGA TGGGCAAAGC TCCCGTGACA GCCGGTCTGG
AAGCTTCAAT AGTGGCCAGT TTAGCACGGA GGAAAAGGAG AGTGCGCTCG ACGACGAGAT
CATCGAATCT AGTCTGCATA CTGCAATAGC CGGCAGCGAC AACTTGACAG TGCTTCTGCA
AGAAACAGCC CGGCTCTACA GCGCTCTACC CCGTGATGCT TCCATCCCTC CTTCGCTATC
TCCAAATGTA TCGGATGTCA CGTTGCAGCT GGCACGTATG GAACTGGAGC ATGGCTCGGT
AAACTTCGAA GACCTTTTGG CAGAGGCCGC TCAAATCTAC AGCGGAAGCA GCAATGTGTC
CCACTTCTAA
 
Protein sequence
MRSILRQQDQ PRLLHRLPSQ RQSMQASSYF PLDNVLDVSY LDSSLPNSRE CDFYTNPTIL 
SRLILHQKYE AAMRRSSTHS EEARTWVVVR RQTSPASSVS NQAATPSSPA KTTSQRSNVT
SLSSLSEDDV NNTTLSSNRN GDVNYPALKA QLEKLIATLV VAFPEACSQR DHQHRMPLHE
AIWYRAGPET ISALLIAYPD AVSMRDKYGR YPMALNECRD SPYRTQIRHM LLQGRDFWNT
ARTEAKLRLK HRTVPADLQS VASQSVLAAS VTSTDDNSMY TRGDSVRQGR AGPGYQQLSP
TITGCGEYKR TITSWSQLEH RTNTLEEKLA ESMQENYETG KEATKLRVSK AKLQAKYDVL
MGTGLGKQIE LLQNEKIALE VQVRDLQQLA PLAEVRSFPL NQDHPMDKLV PQNIVLHCQP
EVTVDVELQV LREENVRLKA GMGLLSQKHK DYQRRLDFAE SLLDDMEDPE DFPFFDDNAT
DYSTIFTIST GTPKEKRIHT PRRPEPEKRV LSPALQQVRP KSAMKNFMPV EDVSQNKPGF
MDPLDPSLLE MSREDDLESI LKGAQEYFDK SSGLTRRLAD SMSLPTSRMT SQITLPSILE
KPDGQSSRDS RSGSFNSGQF STEEKESALD DEIIESSLHT AIAGSDNLTV LLQETARLYS
ALPRDASIPP SLSPNVSDVT LQLARMELEH GSVNFEDLLA EAAQIYSGSS NVSHF