Gene PHATRDRAFT_16987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_16987 
Symbol 
ID7199293 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp291673 
End bp292986 
Gene Length1314 bp 
Protein Length404 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185464 
Protein GI219130630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGACT CTCTGACGTT TCGAGACTTG GGTCTGAGTA CCGCCGCCTT GCGAGCGGTC 
AAGTCGCATC CTGATTGGAC TGCTCCGACG CTAGTGCAAC AACTGGTCAT TCCAAAGCTC
CTGGAGGACA TCGGTTCCCC ACGGAAGCGT TCCATCTGGT GCGAAGCTCC GACGGGTTCT
GGCAAAACGG CAGCGTACGG ACTGCCACTC TTGCAAAATA CACAAACAGC GTGCTTTCGG
GAACCAAACG CACTGATCCA AGGTGGGATT TCCTCCATAA TTATCCTTCC GACTCGAGAG
TTGGCAGTGC AAGTGGGCGT GGTCTTGTCG GAGCTTGCTC AGAATATGTC TCGGGGGGGA
TTCAATATTA TGGTTTTGTA CGGAGGAACT CCATTGCAAT CGCAAGTTGA TCGAATGGAT
GAGTACGCTC GTAGTGGAGA GACCATTCAT GCAGTGGTGG CTACACCTGG CAGGTTTCTA
GACGTGATGG CCCGTGTCGA ACACCCCACT TTACTGGACA ACCTACGCTA CCTTGTTTTG
GACGAAGCCG ACAAGCTAAT GGGTAACGGG TTCGCAAAGG AGCTGGACGG TGTCTTGAAT
CTGCTACCCC GCAAAGTCCC GACGTGGCTG TTTTCGGCGA CCTTTTCCAA GAGTATGGTT
CCTCAGGTGG CAGATGTAAT GAAGCGGCTC GAAATTGTGG AACCGCCCCT GCGGATCACC
TGTGCCAACT CGGATCGCCG GGCACCAGAT GAAACAGCCA GCTTACAGAA GCGTTTGAAG
CGTTTCGCTC AAGGTGAGGA AATGGAATTG GTTGGACCCG CTTCAACTAT TGATCTCCGG
ACGATTCGTT TGCACCAGCG AGACCGCACG CAAGTTTTGC GGAGCCTGTT GGAAGCGAAT
AAGGAATGGG ACCGAGTCTT GGTCTTTGTC GCTACGCGAT ACGCGTGCGA GCATGTTTCT
CGAAAGCTGC GCCGTCTCGG TATTCCGAGT AGCGATTTGC ACGGTAAGCA GGATCAAGAC
ATTCGTTCGC AACAGCTTGA AAGTTTCCGC AGGGGCCATA CACGAGTTCT CCTGGCAACC
GACTTGGCCT CCCGCGGCTT GGATGTGACT GCTTTGTCAG CGGTCGTTAA CTACGATCTC
CCCAGATCCT CTGCAGATTT TATACATCGG GTGGGACGAA CCGGGCGAGC AGGGTGTAAA
GGAGTAGCAG TGACCTTTCT TACAGCCGAT TCGGAGGCGC ATTTGAACTT GATTGAAAGT
CGTCATCTGG CGGAGCCCGT TGCACGAGAA ATCTACCCGG GTTTTGAGGT TGAC
 
Protein sequence
MADSLTFRDL GLSTAALRAV KSHPDWTAPT LVQQLVIPKL LEDIGSPRKR SIWCEAPTGS 
GKTAAYGLPL LQNTQTACFR EPNALIQGGI SSIIILPTRE LAVQVGVVLS ELAQNMSRGG
FNIMVLYGGT PLQSQVDRMD EYARSGETIH AVVATPGRFL DVMARVEHPT LLDNLRYLVL
DEADKLMGNG FAKELDGVLN LLPRKVPTWL FSATFSKSMV PQVADVMKRL EIEMELVGPA
STIDLRTIRL HQRDRTQVLR SLLEANKEWD RVLVFVATRY ACEHVSRKLR RLGIPSSDLH
GKQDQDIRSQ QLESFRRGHT RVLLATDLAS RGLDVTALSA VVNYDLPRSS ADFIHRVGRT
GRAGCKGVAV TFLTADSEAH LNLIESRHLA EPVAREIYPG FEVD