Gene PHATRDRAFT_38809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38809 
Symbol 
ID7203637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp258591 
End bp259820 
Gene Length1230 bp 
Protein Length409 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182803 
Protein GI219125053 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGTCG GAGTGTTCGT AAACTTTTGG GTAGCGATAC GGGTAAGCGA TCGGCAGCCT 
TCAGTGAGCA CAAGGGCAGA GTCTACGCAA TCGGTAATGT CCATGACTTT CGCCGTGTCG
GGATATGACG AATCTAGCTT GATACGTGAT AGATCGTTCG GCGTTTCCGT ATGGAACGAC
TCCACCACGC TACCAACTTG GATGAAGGAG TACTTTGATT GGCATCGAAC TCAACGGCAG
CTCCTCTTGA ACGAAACCAA CTACGGGCAG TTTACCTTTT TAGTCGTCCG ATGCTTGAAA
CACGACTTGA AATGCGGAGG GACGGCAGAT CGGCTCAAGC CTCTACCATT CTATGTGTTG
CTCGCTTCCC GTATGCACCG CATTTTACTG TTCCATTGGG AACGACCCTT TTTGTTGCAA
GAGTTTCTGG TACCACCTGT CGGTGGCGTA GATTGGCGGT TGCCAACCTC TTGGCGTCGT
GACCAATTCT CCGATACCAT CGAGATAAAG AACCTTAAGG ACATAACCCC GTATCTTGCA
AAGCCTCACT CGGGTCGATA TCGTAAGCCG TTGCCCGCTA TCGCATGCAT TTTGTATCAG
TCGCACGATC ACGGTGCTTT ACAATACAAC CAACTCGCCG TACAGCAAAC GAAGGAAGCA
ACCTACGAGG AAGTTTTCCG GGATTGCTGG AATTCCTTTT TTGTCCCTTC TCCACCAGTA
CAAAGTCGAA TCGACCAATT GCGCCAATCC CTCGGATTGG TCCCCAACGA GTACGTGGGA
GCCCACGTGC GGTCACAGTA TCATTCTTAC AACGGCAACA AGAAGTTAAA AGTTTTGGTA
CAAAACGCCG TTGCGTGTGC CTCTCGCCTT CGTACGGAGG TCCGGCAGAG CATTTACGTT
ACCGCCGATT CCGAGCGCGC ACTCCAGGTG GTCGGAGAAT CCTCTGTGGG AATGCGCAAT
CTACCGGTAG TCCGTCGGAA GGGCGATCGG CCGCCACTCC ATTTGGATCG TGGTGTTGCG
TACTTGGCCA AGAGTGCCAC CAATTGGACA CACCACGACG ATCCACGAGC CTACTACGAT
ATCTTTGTCG ACTTATACCT GTTGGCAGGG AGTCGTTGTA TTGCGTACAA CGTTGGCAAC
TACGGCAAGT GGGCCATCCT CCTTTCGTCG AATCGGTCCT GCACGATAAA TCACGGCAAG
ACAACGTGTC GTTGGAAAAT ATTATCGTAG
 
Protein sequence
MGVGVFVNFW VAIRVSDRQP SVSTRAESTQ SVMSMTFAVS GYDESSLIRD RSFGVSVWND 
STTLPTWMKE YFDWHRTQRQ LLLNETNYGQ FTFLVVRCLK HDLKCGGTAD RLKPLPFYVL
LASRMHRILL FHWERPFLLQ EFLVPPVGGV DWRLPTSWRR DQFSDTIEIK NLKDITPYLA
KPHSGRYRKP LPAIACILYQ SHDHGALQYN QLAVQQTKEA TYEEVFRDCW NSFFVPSPPV
QSRIDQLRQS LGLVPNEYVG AHVRSQYHSY NGNKKLKVLV QNAVACASRL RTEVRQSIYV
TADSERALQV VGESSVGMRN LPVVRRKGDR PPLHLDRGVA YLAKSATNWT HHDDPRAYYD
IFVDLYLLAG SRCIAYNVGN YGKWAILLSS NRSCTINHGK TTCRWKILS