Gene PHATRDRAFT_20787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20787 
Symbol 
ID7201661 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp266602 
End bp268739 
Gene Length2138 bp 
Protein Length473 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180975 
Protein GI219120475 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0430146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCAGTCGA ACTAAATACC TTCCATCCTT CTGGATTGTG TGAGAGTGTG TATTAGCGCC 
CCGTCGCCTC GCATTGCACC TATATCCATC CATGGCTCTT CGTCTTTCTC TTTCCAAGGT
ACGTAGACGC TAAATTCCGA ATGTATGGGT GGGATGTTGC GGTGGGCAAC ACAACGTCGT
GTGCCGGCAG CGACATGGCA ACCGACACAC GCGGTGCTCC CCCTATCTTG TTGCTGTGGT
TGTGTGATTG TGTAATGATT CCATCAAAGT TGCTGGTTCA GTGTCAAATG ATTCCGTGTT
GTACAAATTT GTGGAATCAC ACACGAATGG TGTTTGAAAG TCTCTCACTC GCGTTTCCGT
TGCCCCACCA TTTTTCGTTG CGACAGTGGA GTCGTCCGAC GGCTCGCGCA TCGTCGCGAT
GTGCCTTATC CACCGCCACG GCCGCCTTTC CGGATTACGT CTTGCGGGCG CCGACGACGG
ACGTCACGAC GTTGGATTCG GGATTGCGCG TGGCGTCCGA AACGGTGCAA GGGTCGGAAA
CGGCCACGGT GGGCGTCTGG ATCGACGCCG GGTCGCGCTA CGAAACGGCC CGGAACAACG
GCGTCGCCCA CTTTCTCGAA CATTTGGCCT TTAAAGGGAC GGAACAACGG ACCCAGCCGC
AACTCGAACT CGAAATCGAA AATATGGGGG GACACCTCAA CGCCTACACC TCGCGCGAAC
AGACCGTGTA CTTTGCCAAG GTCTTTAAGG ATGACGTCGG GAAAGCCGTC GAGATTCTTT
CCGATATCCT CTTGCACTCC AAGTTGGACG AGGCCGCCAT TGACCGGGAA CGCGACGTCA
TTCTCCGCGA AATGGCCGAA GTCAACAAGC AACAGGAAGA ATTGGTGCTC GATCATTTGC
ACGCCACCGC CTTTCAGGGA ACCGGACTCG GACGTACCAT TCTCGGTCCG GAAGAGAACA
TCCGCTCCCT TTCGCGTACC GACCTCGTTG ATTACATTCA GCAACACTAC ACCGCGCCCC
GGATGGTCAT TGCCGGAGCC GGAGCCATTG ATCACGATCA GCTTTGCGGA CTCGCGAGTC
AGCACTTTGG TGAATTGCCC ACCGCACCCA AGGATGGACT CGAACTCGCC ATGGAACCAG
CCATCTTTAC CGGATCGGAT TATCTGTAAG TTACCCATCG CGATTGTAGT TGTAGCTGCA
GTTGTCGTGG CTATGACGTT GTGTCCTTGT GCCGTGGATG GTTCGGACGA TGATTTCGTT
CTTGTGCCGG GGGACACGTT CCGTCTCGGG GAGGGTTTCT TTTTTCGCGC GCTCGTCACT
GTTTTTTAGA TTATCCACTC CACCTACTCA ATCTCACTCA CCCACGTGTA CCTCGCCGTT
TCGCTCCCCT TTCCAGCGTC AAGTTTAACT CGGACGACAC GGCCCATATT GCCATTGCCT
TTGAAGCCGC TTCGTGGACT TCCGAATACG CTTTCCCCCT CATGCTCATG CAAATCATGC
TCGGATCCTA CAACCGCACT CAGGGACTCG GACGCAACCA TGCTTCTCGC CTCTGCCAAG
AAGTGGCCGA ACACGAACTC GCACATTCGG TCAGCGCCTT TAACACGTGC TACAAGGATA
TCGGTCTCTT TGGCGTCTAC ATGGTCGCCC CCGACAAAAA GGTCGACGAC CTCATGTGGC
ACGTCATGAA CAATCTCGTC CGCTTGGTCC ACACACCGTC GGAAGAAGAA GTCGAACGCG
CCAAGCTCAA CCTCAAGGCT ATTATGCTCA TGGGGCTGGA CGGACACGCC AACGTGGCCG
AAGACATTGG CCGCCAATTG CTCACGTACG GACGCCGCAT GACGCCGGCC GAGATCTTTT
CGCGTATCGA CGCCGTCACC AAGGACGATA TTCGAGCAAC GGCGGCCAAA TTCATCAACG
ACCAAGATCA CGCCCTCGCG GCCGTCGGAG GAATCCATGA ACTGCCCGAC TATACTTGGG
TCCGCCGCCA TTCCTACTGG CTGCGTTACT AGACACACAC ACGCACACAC ACGTGTGTTG
GTGGAGTCGG CGGGACAGAA TAGTGGCGAA AAACGCGAGC GAGGATCGTC GCTATTATAC
ATGGTAACGA GAAAGTAAAG ACCAAAAAAA GACCTTTG
 
Protein sequence
MALRLSLSKW SRPTARASSR CALSTATAAF PDYVLRAPTT DVTTLDSGLR VASETVQGSE 
TATVGVWIDA GSRYETARNN GVAHFLEHLA FKGTEQRTQP QLELEIENMG GHLNAYTSRE
QTVYFAKVFK DDVGKAVEIL SDILLHSKLD EAAIDRERDV ILREMAEVNK QQEELVLDHL
HATAFQGTGL GRTILGPEEN IRSLSRTDLV DYIQQHYTAP RMVIAGAGAI DHDQLCGLAS
QHFGELPTAP KDGLELAMEP AIFTGSDYLV KFNSDDTAHI AIAFEAASWT SEYAFPLMLM
QIMLGSYNRT QGLGRNHASR LCQEVAEHEL AHSVSAFNTC YKDIGLFGVY MVAPDKKVDD
LMWHVMNNLV RLVHTPSEEE VERAKLNLKA IMLMGLDGHA NVAEDIGRQL LTYGRRMTPA
EIFSRIDAVT KDDIRATAAK FINDQDHALA AVGGIHELPD YTWVRRHSYW LRY