Gene PHATRDRAFT_47585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47585 
Symbol 
ID7202803 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp172398 
End bp173488 
Gene Length1091 bp 
Protein Length271 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181860 
Protein GI219123081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCACGGTCAG TGACATTCAA CCGTCCGTTC CGAGATTTCC AAGCGACAAG AGATAGTGTT 
ACGTGTGCAA CTATGTCACC ATGGCTAGGA ATGGTGCAAT TGCTGTGATT GCGAAATGCC
CTATGGCGGG CAAGAGTAAA ACGCGTCTGA TACCTCTCTT GGGTGAACAG GGATCGGCTG
CCTTGGCTCG TGCCATGCTC TCGGACGTTC TCACCAGTCT GTCACGTTGT GTAAGTCCAC
TTTTACATCA AAAAGGTAGG AAACGCGCCC TCTCTTTCTC TCATGAAGTA TTCTCTATAG
GAAGAGTTGA AGCCAGCGAA AAAAATTCTG TTCTATGCTC CCCCAACGCA GCAAGGTCTC
GAAATAATGC GAGAAATTCT CATCAGTTTG TCACTATATA GTCAACCTCA CCAAGAATGG
GTTTTGCTAC CGATGGTGTC GGTTTCATTG GCATCTTCAG ATCTAGGGGA TCAACTCACC
GACGCCTTAG TGCGTGCAAG GCAAGTTCAG GTAGAAGAGC ACCATACCGC CAACGCTTTG
CCTGGACCCG TGATATTTCT CGGCATGGAC GCCCCGGAAC TCCCACTCGG TGAATTGGTC
TCGGCCTTTG AGCACCCCGA CACAGCTCTT TTATGTCCCT CCGACGATGG GGGCTACGGA
ATGTTATCCG TGCCGGCAAC GGCCGATGCC GACTCCATCT TTGATGGAAT CCGGTGGTCC
GATCCTTTAA CGGCAGTCGC ACAACTCAAG AATTTAACGG ATGGCGGTGT CCCCGTTCGG
ATCGGACAAC TGATGCATGA TATGGACGAG CCAGACGACG TCTTAAATTT GTGCGCACGC
TTGCGAATCC ATCATTTGCA GGATTCATCT CTCTTGCCGT CTTTGCCGAA CAATGCTAAG
GCGAACGCCG CACCATCCGT AGATTCCAAG TACGTTAGCA AGCCAGATAT TCTGATGCAA
CCGTCGTCGC TTCTACAGAA GCGAGAAATC TGTTTAGGAA GAAGCATGGA ATGTTCTTGC
CATTACACCA AACAGATTTT AGTAAAATGT GCAGTTATGG TCGTTTGCTA GAGCTTCTTG
CGCTCGCTTA A
 
Protein sequence
MARNGAIAVI AKCPMAGKSK TRLIPLLGEQ GSAALARAML SDVLTSLSRC EELKPAKKIL 
FYAPPTQQGL EIMREILISL SLYSQPHQEW VLLPMVSVSL ASSDLGDQLT DALVRARQVQ
VEEHHTANAL PGPVIFLGMD APELPLGELV SAFEHPDTAL LCPSDDGGYG MLSVPATADA
DSIFDGIRWS DPLTAVAQLK NLTDGGVPVR IGQLMHDMDE PDDVLNLCAR LRIHHLQDSS
LLPSLPNNAK ANAAPSVDSN YGRLLELLAL A