Gene PHATRDRAFT_47820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47820 
Symbol 
ID7203061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp155707 
End bp156949 
Gene Length1243 bp 
Protein Length329 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182166 
Protein GI219123718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00872972 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCCACATCCT TCGTTCCATT CTCTCACCAA TCTCGAACGT TACACATTCA CACTCTGTAC 
TGTGTGTATT GGCAACATTA TTAACAATAC AGTCTTGTAC TGATTGACTG CGCCAGCATC
GATTGATTAA TTGATTGATT GAGACGAACG AAGGAACGAA TTGATCGTAT TGATTTTCTC
CGCGAAAGCG CACAACGACA TGGCTTCGAG TGGTGCGGCG GCGCGGGGTC CCCTGGGTTG
GTTGCTGCGA TCCACTGGAT CTCAATGGCT AGTACTGGGA GCCGGAACGA TTTCCTTCTT
CCCGGAGCAA GTCCGGGAAG TCGCCTGGCC GACGGTGAAG AACGCCTTGG GTTTGACGGG
GTCCGATGCG GATTGGCCTT CCTTCTCGTT GTTGTCGTCC CAACGCAGTC TCCGCGCGGA
CAGCGAGAGT CAACGATCCG TGCCCAGTTC CATCGTCATT CACACCAGCA ACGGCAGTAG
CACGCAGACG TGGATGACGA CTATCGTCAC TTGGACGGTG GGGGCTACCG CCGTCTGGGT
CGCCTACTCG GTCTTTAGCA ACTATCTTCC GGATCAGATC AAAACTATGC TGCCCGTTAC
CCGCCGCGTC TTTGACAGGG CCACGCAGAC GCTGGCCGAT CACGTCTTCC ACGTCAAGGA
TGTGCTTGGC AAGCAATTAC TCAGACTGAC AGCACAGCAG GATGAACTCG CCTCGCAACA
ACAGGAAACC CACAAGGATG TCCGGGTAGT CCAACAGGAT ATGAAACAGG CACGCCTCGA
ACTCATGCAA CTGCTGTCCG GTATGGATCG CTGCGAAGTG CGTTTAGAAG ATTCGGCTGC
CGTACAGTCC TACACGGCCC GGGGAGTCAA GTTGCTAGCC AAGTGCGTCG CCTCTATTAT
GCCCGGCAAC GAACGGATTG GCCACGAACT CGAACGTTTC CAACGGGATG ATTATCCGCT
GTTGGACAAT ACCACGGCCA ACCATCCCCA CGGTAGGGAC GCGGGTCCCG AAAAGGAACT
CTCCTCCAGC CGAACTCCGA AACGAAGCAG CAGTTCCAGT ATCCCCTTGC TCAAACGCGA
AAGTATGGAT CGGTCACTTT CGGACGAGAC GGTCGCGAGC TTGGATAGTA TGACGACGAA
CGATTTGGAG ACGCTGCTTA GCACCGGTCG TATCGTTCTC GAAACGTAAA ATTTGGAAAA
AACATCTTAA TACATACACA TAGTAAAGTG ACAATTACAG TCG
 
Protein sequence
MASSGAAARG PLGWLLRSTG SQWLVLGAGT ISFFPEQVRE VAWPTVKNAL GLTGSDADWP 
SFSLLSSQRS LRADSESQRS VPSSIVIHTS NGSSTQTWMT TIVTWTVGAT AVWVAYSVFS
NYLPDQIKTM LPVTRRVFDR ATQTLADHVF HVKDVLGKQL LRLTAQQDEL ASQQQETHKD
VRVVQQDMKQ ARLELMQLLS GMDRCEVRLE DSAAVQSYTA RGVKLLAKCV ASIMPGNERI
GHELERFQRD DYPLLDNTTA NHPHGRDAGP EKELSSSRTP KRSSSSSIPL LKRESMDRSL
SDETVASLDS MTTNDLETLL STGRIVLET