Gene PHATRDRAFT_40901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40901 
Symbol 
ID7198736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp231174 
End bp232391 
Gene Length1218 bp 
Protein Length405 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184922 
Protein GI219129493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTCA ATGCATTTGA AGTCAGCGGC AAGATTGATT ACACCAAGCT GGTTGACAAA 
TTTGGATCCA ACCTCATTTC AGACTCTCTT ATGGATAAGC TGGAAGCGTT AACGGTTGGA
AAAGGCCGAG TTCCCCGGAT GCACCGCTTT TTACGCCGGG GAATGTTCTT CAGCCATAGA
GATCTCGATA CCTTGCTGCG TCAGGTAGAG GCCGGTGCTC CAATGTACCT TTACACTGGG
CGAGGGCCTA GTTCCCAATC GATGCATCTA GGGCATCTTA TACCCTTCCT TTTTACCAAA
TGGTTGCAAG ATGCCTTGGA CGTCCCGTTG GTCATCCAAA TGACGGATGA CGAAAAGTTC
TTATTTAAAG GACATTACGA TGACCAAACC GGCGACAATT TATTAGACTT TCAAAGTTTG
ACCATGGAAA ACGCCAGGGA TATTATTGCG TGCGGCTTTG ACTACAACAA GACCTTTTTG
TTTTCAGACT TGGATTATGT CGGTAGCATG TATCCAAACA TTGTTCGCAT CTGGAAGGCG
GTCACGACCA ATACGGTAAA CGGAATTTTC GGTTTCGATG GATCTTCAAA TATTGGCAAG
ATTGCTTTTC CCGCCATTCA AGCCGCGCCG TCTTTTGCCA GTAGTTTTCC AGTCGTCTTG
GAAGCTGACC GTAATTCCAA TCATTTGTGT CTGATCCCCT GCGCGATTGA CCAAGATCCT
TACTTCCGCA TGACGCGGGA TGTTGCGCAC AAACTAGTTC ATAAGCAACA TGGTCTCGGT
GGGAAACCGG CACTGATTCA CTCTAAATTT TTTCCTCCGT TGCAAGGCGC CGAAGGCAAA
ATGTCGAGCT CCAACACGAA CTCGGCTATA TTTTTGACGG ATTCGCCGGA TGACATTGAG
CGGAAAATTA AACAACACGC CTTTTCTGGT GGACGAGAAA CCAAAAAGGA ACAGCAAGAG
CTCGGAGCTG ACTTGGAGGT AGATGTGTCC TACCAATGGA TGCGGTTTTT CTTGGAAGAC
GACGACGAAT TGGAAAAGAT TGGCCAAGAT TACGGTAGCG GATCCGGCGA ATATTGGAAC
ACTGGCAAGG TGAAGGGGCG CCTGATCGAA ATTCTAAAGG AATTGGTAGC GGAGCATCAA
GAACGACGGG CAACAATTAC CGACGAAGAA GTTCGCAAAT GGATGGCTGA GCGTAGCATC
GTTAAGAACA GCACTTGA
 
Protein sequence
MVVNAFEVSG KIDYTKLVDK FGSNLISDSL MDKLEALTVG KGRVPRMHRF LRRGMFFSHR 
DLDTLLRQVE AGAPMYLYTG RGPSSQSMHL GHLIPFLFTK WLQDALDVPL VIQMTDDEKF
LFKGHYDDQT GDNLLDFQSL TMENARDIIA CGFDYNKTFL FSDLDYVGSM YPNIVRIWKA
VTTNTVNGIF GFDGSSNIGK IAFPAIQAAP SFASSFPVVL EADRNSNHLC LIPCAIDQDP
YFRMTRDVAH KLVHKQHGLG GKPALIHSKF FPPLQGAEGK MSSSNTNSAI FLTDSPDDIE
RKIKQHAFSG GRETKKEQQE LGADLEVDVS YQWMRFFLED DDELEKIGQD YGSGSGEYWN
TGKVKGRLIE ILKELVAEHQ ERRATITDEE VRKWMAERSI VKNST