Gene PHATRDRAFT_15689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_15689 
Symbol 
ID7195242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp171893 
End bp173218 
Gene Length1326 bp 
Protein Length441 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183564 
Protein GI219126648 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCGATAACG GCATACGGGT CGTGTCGCAG GAAACATACG GACAAGTCAG TACCGTCGGG 
GCCGTCGCAC AAGTCGGTAG TCGCTTCGAA CTGCCTTACG AAACCGGCAC CTGCAATCTC
CTCGAAGTCC TCGGATTCTC CTCCACCGCG CAGCTCTCCG GCCTCGAAAT CACCAACTGC
CTGCAAGACT GGGGCGGCAC GCCTTTTGTT AATCTCAATC GGGAGCAGTC CCTGCATTGT
ATCGATTTAC TCCGACCCAA CGTGGAAAAA GCCGTCGCCT TGTTGGCGCA GGCGTTGCTG
GAACCGCAAT TTCGTGCCGA AGAAATTGAA GACGCCAAAC GAGCACTCGA ATTTCAAGCC
CTCGATATGC CTCCGGAGCT CTTGCTCGGA GAGGGCCTGC AAGTAGCCGC GTACGGAGAA
TCGCAACAGT TGGGACAAGC CCACTTTCCG GCATCGACGG AATCGCTCAA TAATTTGTCA
CCGGAAACGG TCGCCAACTT TTGGAGTCGT CAGTTACTCC ACAATACTCC CGGAATCGTA
TTGGCCGGTG CCGGAGTCCG ACACGACAAA TTAGTGGAAT ACGCCGACCG ATTTTTTGGT
CACATGCCCG GACCAACATC CAGCGCCAGC ACGACACCAT CGCCTCAGGT TGCCATTACA
CGTTCGACCT ACCGCGGTGG ACAGGTCCGT ATACACCGCC CGTACAACCC GCAACTTGAA
GACAAAGATC TTGTACGCAT TGCATTGGCT CTACACGTCG ACGACGGTTG GCACGGGGAC
GACTTGGTTG GCGTCTGCGT CCTCCAAACC CTCCTCGGCG GTGGCAATTC CTTTTCCGCC
GGTGGCCCCG GCAAGGGCAT GTACAGTCGC CTCTACCGAC AGGTACTGAA TCGGTATAAT
TGGGCCGAAT CGGCCGAAGC CTTTACGGTC TTTTACGAAG AAGCGGGACT CTGGGGAATC
AGTGGTTCCA CACATCCCGG TCGCGCGCGA GAAATGACCA AAGTCCTGGC CGAGCACGTA
CTGCGACTAG CCAGCACACC CGTGACGGAC GAAGAATTGT CCCGCGCCCG GAAAATGCTC
AAAAACAACG TCTTGACGCA ACTCGAATCG CGGTTGGTTC TATTCGAAGA TATGGGACGG
CAGATACTGA CGTACAACAG CCGGCAAGAC ATGCACCAAG TTTGCGCCAA GATTGATGCC
GTGACGGCGG ATGATCTGGT CCGGATTGCG CAAAATTCGT TGCGTCACCC ACCGACGCTG
GCCAGCGTAG GAAGCAACCT TGCCTACGTA CCGCAACAAT CCGAAGTGTC GGAGTGGTTT
CCTTAA
 
Protein sequence
LDNGIRVVSQ ETYGQVSTVG AVAQVGSRFE LPYETGTCNL LEVLGFSSTA QLSGLEITNC 
LQDWGGTPFV NLNREQSLHC IDLLRPNVEK AVALLAQALL EPQFRAEEIE DAKRALEFQA
LDMPPELLLG EGLQVAAYGE SQQLGQAHFP ASTESLNNLS PETVANFWSR QLLHNTPGIV
LAGAGVRHDK LVEYADRFFG HMPGPTSSAS TTPSPQVAIT RSTYRGGQVR IHRPYNPQLE
DKDLVRIALA LHVDDGWHGD DLVGVCVLQT LLGGGNSFSA GGPGKGMYSR LYRQVLNRYN
WAESAEAFTV FYEEAGLWGI SGSTHPGRAR EMTKVLAEHV LRLASTPVTD EELSRARKML
KNNVLTQLES RLVLFEDMGR QILTYNSRQD MHQVCAKIDA VTADDLVRIA QNSLRHPPTL
ASVGSNLAYV PQQSEVSEWF P