Gene PHATRDRAFT_19604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19604 
Symbol 
ID7200309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp227981 
End bp229307 
Gene Length1327 bp 
Protein Length363 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179388 
Protein GI219117187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCACCTCGAG CAACTCTCCG GCAGTGTACC CCAAAATTCC TTACTCGCGG ACTTGACCAC 
CGCACTCCAC GACGTTGCCA CCGATTGGCA ACCCTCCATC GCCCTCGCCG AAGCCGTACT
CGACCTCGAC GCCGCTCCCC GCGAATTCCT CGTACAAGCC GCCTACCGGG AAGAACTCGG
AACTCTACAA CGGGAACTCC AGGACGTCCG GGACCAAGTC CGGGACTGTC ACGCGCACAT
GAACGACTTG TGGGCCTCCA CCACCGGCAA CGCCCAAGCC ACGGTCAAGC TAGAAACAGC
CGACGACGGT TTCCTCTTTC GACTCACCAA CACCAACGAC ACCAAACTCT TGCAGAATCA
ACTCGGGAAC GTGGTGCAAA TCCACAAACT GCTCAAAAAC GGCGTCTCCT TCTCCACCAA
GGAACTCCGC CAGCTCGCCA CCGCCCAGCA GGATTTAATG GCCGAATACG ATCGCCAGCA
AAAAGTCGTC GTCCAAGACG CCCTCAAAGT TGCCGCTACC TACAGCGTCG TACTCCAACG
CGCCTTTGAC GCCGTTGCCA CCCTCGATGT CCTAGTCGGA CTCGCCCACC AAGCCGCCTA
CAGTCCCCAC GGATACTGCC GACCCACTTT GATCGACGGC GACGACTGCG CCGGTCACGG
CATTCAGCTT CAAGGCGCAC GCCATCCCTG CGTAGAAGTA CAGGAATCCG TCTCCGACTA
TATTCCCAAC GACGTCGATC TCACCCACGA CCGTTCCAAC GTACTCCTCG TCACGGGCCC
CAATATGGGT GGTAAGAGCA CGTACATTCG CGCCGTCGGT GCGATTGTCC TGCTCGCGCA
AATTGGGGCC TTTGTGCCCT GCCAATCGGC CACCATTCAC ATTCGGCACC ACATTCTCGC
CCGCGTTGGG GCCGGAGACT GGCAAGATCA GGGCATTTCC ACCTTTTTGG CGGAAATGCT
CGAATCGGCC GCCATTTTGC GGACCGCCAC GGCCCGATCG CTCATCATCG TCGACGAACT
GGGGCGCGGC ACGAGTACGT TTGACGGATA CGGCCTGGCC CGGGCCATTG CCGAATATAT
GGTCCGGAAC GTTGGTAATC TTTGTGTCTT TGCCACTCAC TTCCACGAAT TGACCAGTCT
GGCCGACGTC TTTACGAACG TTCGGAACTG TCACGTCACG GCGCAGCGGG ACGTGCAGGG
GTTGACCTTT CTGTATCAGA TCCAACCCGG TCCGTGTCTA GAGTCCTTCG GCATTCAAGT
CGCCGAGCTG GCCGGCGTAC CCGCCGTCGT CGTGCAGGAT GCCCAACGCA AGGCGCGAGA
ACTGGAA
 
Protein sequence
MNDLWASTTG NAQATVKLET ADDGFLFRLT NTNDTKLLQN QLGNVVQIHK LLKNGVSFST 
KELRQLATAQ QDLMAEYDRQ QKVVVQDALK VAATYSVVLQ RAFDAVATLD VLVGLAHQAA
YSPHGYCRPT LIDGDDCAGH GIQLQGARHP CVEVQESVSD YIPNDVDLTH DRSNVLLVTG
PNMGGKSTYI RAVGAIVLLA QIGAFVPCQS ATIHIRHHIL ARVGAGDWQD QGISTFLAEM
LESAAILRTA TARSLIIVDE LGRGTSTFDG YGLARAIAEY MVRNVGNLCV FATHFHELTS
LADVFTNVRN CHVTAQRDVQ GLTFLYQIQP GPCLESFGIQ VAELAGVPAV VVQDAQRKAR
ELE