Gene PHATRDRAFT_47769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47769 
Symbol 
ID7202929 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp781367 
End bp783193 
Gene Length1827 bp 
Protein Length522 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182134 
Protein GI219123648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGCTTGCCG AACCAAAGGA AAAACCTGTT GGCAGCGTTA CTCAGACAAG ATGGAATCTA 
GTGAAAACTT CCCACGTAAC GAAACGATAC TTACGCGGGC AACGATTGTG TTCGCCCTTT
GCGCGTCGCT GAACTCCGCC AACTTGGGAT ACGATATTGG CGTGAGCACA GAGGCTGGAC
GCCTGATCCA AGATGATTTA CAACTCTCCA GATTTGAACG TGAAATGTTT ACGGGAAGCA
TCAACTTTTG GGCAAGTGAG TAGAAAGATA CATGATCCGG CTTTGAGTTG AGGGCAGAGA
CAGCGGTTAT CTGTTTGCCG TTTATGCGGC CTCGTATTGT CACAAAAGCT TGTCCAATCG
ATCAATACAA CTAACCTGCT CTTTTTAAGT GTTTGGTGCA TTTTTTGCTC ATCATTTTAC
TGATACATAC GGCCGAAGGT CAACGTTCAT TCTGGCAGCT GTTGGCTTCA TCGTAGGCGT
ACTCCTGCAG TCGTTTTCAA GCACTTTCGA CCTCTTGATG CTGGGTCGAT CGTTTGTCGG
TCTTGGGGTA GGAACAGGCC TCGCGGTTGG TAAGTACCGA GCTTATCACT CATCGTCTTG
CCAAACGTAA ACAGCTCAAA CCAGCTTACG GATTTCCCAA CAGATCCTCT CTATATTGCC
GAAGTCACCC CGCCACACCA CCGCGGGGAA CTTGTGACAT GGTCCGAAAT CGCCAACAAT
GTGGGGCTGG TGTTGGGGTT TTCAACTGGT TTTTTTCTAG CATGGTTACC GGATGGTCAG
GAATGGCGTC TTATGATTTT GCTTGGTGCT ATTTTGCCAA CTGTCATGAT CGCATTAGTC
ATTTTCGTCA TTCCGGAGTC GCCGCGTTGG TTGATCTCTC GGAATCGTGT AGATGAAGCC
ACGGAAATTT TGCTACAAAC GTATCCTCCG GGCTCCGATG TGGACTTGGT CGTGGAAGAG
ATCAAACAGG CTATTATTCG GGAACGAGTC GCCGAGAATT CCGTGGGTTG GATGGTGTTG
CTACACCCGA CACCCGCTAT TCAACGTATG CTTTTGGTTG GAATCGGCAC TGCTGTATCT
CAACAAATAG TTGGCATCGA CGCAATTCAG TACTACTTGT TGGATGTTAT CGATGAGTCC
GGCATCGAAT CGCGACAAGC GCAAAGTGCA GTACTGGTTA TTCTCGGAAT AGTCAAATTA
TCATTTGTCA TTCTCGGCGG GAAGCTTTTT GACACCAAGG GACGACGGCC ACTCTTGTTT
ATTTCATTGA TCGGTATGGC TGTTTCTTTG GCTCTAGTAA GTCTCGCCTT CTGGATTGAC
ACTGCATGGA GTCAAGGGGT CATCATTTGT GGTCTCGGTC TGTATTTGGC CTTTTTCAGT
GTGGGTGTAG GCCCCGGTGC ATGGCTAATC CCGTCCGAAG TGTTCGCTAA CTGTATTCGG
GGGAAAGCTA TGAGTGTCGC TGCTTTTTGG AATCGTCTCG GTGCCACGAT CATGGCCAGT
ACTTTCCTGT CGATAGCCAA CGGAGTAGGC TGGGCAGGTT TCTTTCTATT GCTGAGTGGT
GCGTCTTTGC TAGTCCTGTT TTTTTTGTAC ACATACCTAC CGGAAACCAA AGGCAGGTCT
CTGGAAGACA TGTCGGTATA CTTTGCTGAG ATCACCAAAG ACGGTTTCAT TTTAGAAGCC
GAGGCAGCGC TATACAAGGA TGACGGAGAT GAGCTTGAAT TGGCCTCGTC ATCATTACAG
CACACTTTGC CCCCGTCGTC GTTGGCACAT CGACCTTTCT CTGAAGCCAA TGCTAATCTG
CACAGCGAGA AAGACCAGCT TTTGTAA
 
Protein sequence
MESSENFPRN ETILTRATIV FALCASLNSA NLGYDIGVST EAGRLIQDDL QLSRFEREMF 
TGSINFWAMF GAFFAHHFTD TYGRRSTFIL AAVGFIVGVL LQSFSSTFDL LMLGRSFVGL
GVGTGLAVDP LYIAEVTPPH HRGELVTWSE IANNVGLVLG FSTGFFLAWL PDGQEWRLMI
LLGAILPTVM IALVIFVIPE SPRWLISRNR VDEATEILLQ TYPPGSDVDL VVEEIKQAII
RERVAENSVG WMVLLHPTPA IQRMLLVGIG TAVSQQIVGI DAIQYYLLDV IDESGIESRQ
AQSAVLVILG IVKLSFVILG GKLFDTKGRR PLLFISLIGM AVSLALVSLA FWIDTAWSQG
VIICGLGLYL AFFSVGVGPG AWLIPSEVFA NCIRGKAMSV AAFWNRLGAT IMASTFLSIA
NGVGWAGFFL LLSGASLLVL FFLYTYLPET KGRSLEDMSV YFAEITKDGF ILEAEAALYK
DDGDELELAS SSLQHTLPPS SLAHRPFSEA NANLHSEKDQ LL