Gene PHATRDRAFT_47061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47061 
Symbol 
ID7202134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp306672 
End bp308041 
Gene Length1370 bp 
Protein Length386 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181349 
Protein GI219122012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTTATCCTA CGACGTGATA GCCGATTATG AACAAGAGAG TTGACCACGA TCATGCTGGA 
GTAATGAGAA CCAATCAGAG CTACAGGAAA GGAGGACTAG AGTCCGTTAA GCTGAAGTGC
TAAGGTCGGT GGCTGCATTG CAATATATTT GAAGGAGAAC CTTGCGCCAA TTTGGACCGT
CTTTATTTCC TTAAACCCCC GAATCATCGA TGGAAAGATT CTGCTCTTCA AATGTGCCAT
TAGCAGACGC TCTTCAGCTT CTTTCTAAGA ACGATCCGTG TGTACGAGAT TTACATCTGG
ACTGCTCATG GTCGCTGGAG GATGATGCAA TGCTCAAGAT TGTTGAAGCG ATAAGCATCC
ATTCCAAGCG CTATCCCAAG GACTCCGTAG TTCGAAGTAT TTCTACCGAG ATTCCCCGAT
CCATCGCACG AGTAGTTCTG TTTTGCTCGG CAATTTGCAG GCTCCCACAT GTGGAAGCCG
TCACGATTCG GCGTCGATCC GGAATCGCTT CGAAGTACCT AGGTCACTAT GTACTGTACC
ATATTCTGGA ACCGTTTGAG AGAATGGTAC ATCGCCTCCA GCGACTCGAG GTCTATGATC
CAATACAGTA CCTTTCATCA TCGACCGACT CAAGTACTGA GGTGGAAAAA ATCAACAGTT
TCATTCTAGC CGCTGTTAAC CTTGAAATAT TTGCCCTCAC GAGAATCTAC GCTCCTGGTT
CCAATAAGCT CTCGTCATTA ATTCAGGCTT TGATTGGACA GAAACGGCTT CGGATTTTAA
ATCTGCAGCT TTCGTCTTTC ACATCGGAGC CCAAAGTGAC ACCAGAAGAT CTTCAAGTGC
TTTGTCGATC AACGAGCTTG GAGCAGCTGA CTCTGATCAA TACAGGGATG ATCGACAGCC
ATTTAAAAAT TCTGTCCGAA GAGTTACGCT GGAATTTGGT TCTTAAAAAG ATCGATATCC
AACAAAATTG GCAGACTAGC GAAAATGGCT TTCTACATCT GCTAGAACTA ATGCAAAAGC
AGTTCGTCAT CGTCGAGTTC AATTTGGAAG AGGGCATGGC TTTACCGGGT TTCGAGCAAG
AAAATCAAGA TCGTCTTTAC GAGGAGTCTT GTGCAAGTTT CTTTTTCAAC AAGAGAACGG
TCGCCGCCAA GATTGAATCT TTCGCTCGAA TGAATCGTGC AGGTCGACAG CGTATACAAA
ACGATTCAAA CTATAGCCAT GACGACTGGA TCGATCTTAT TTCTCTTGTT GGCTATGATA
TTGATGCCAT TTTCTACATG ATTCTGCAGA AACCGGAGGT GTGCAATCGC GACCACAGGT
TGATCGCTAC GATTGGTGAC CGGTCAAGGA AAAGGAGGCG GATGGCTTAG
 
Protein sequence
MERFCSSNVP LADALQLLSK NDPCVRDLHL DCSWSLEDDA MLKIVEAISI HSKRYPKDSV 
VRSISTEIPR SIARVVLFCS AICRLPHVEA VTIRRRSGIA SKYLGHYVLY HILEPFERMV
HRLQRLEVYD PIQYLSSSTD SSTEVEKINS FILAAVNLEI FALTRIYAPG SNKLSSLIQA
LIGQKRLRIL NLQLSSFTSE PKVTPEDLQV LCRSTSLEQL TLINTGMIDS HLKILSEELR
WNLVLKKIDI QQNWQTSENG FLHLLELMQK QFVIVEFNLE EGMALPGFEQ ENQDRLYEES
CASFFFNKRT VAAKIESFAR MNRAGRQRIQ NDSNYSHDDW IDLISLVGYD IDAIFYMILQ
KPEVCNRDHR LIATIGDRSR KRRRMA