Gene PHATRDRAFT_39735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39735 
Symbol 
ID7195450 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp575863 
End bp577153 
Gene Length1291 bp 
Protein Length311 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183636 
Protein GI219126798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.326114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGT TTTTGAAGTC TGCTCCAGCG TTGCTGGAAG ACCCCCGGGA GCTCCCACAA 
CCAGCTTCTG AAAAGCGGGG CTTCGGAGGA GCCAAAATTG CCACACGAAT CGAACCATCC
CGCTCCGAGC AGTTCCTATC GTGCCTCGTG AATTGCCAGC ATTTTCACCA GCTACTAGAG
CTGCTTGACT CCGAACCCCT TAGATTTTTC CACGAAATCA TGGAAGTTAT AAACACAAAT
GAAAGGGGAG AAATACAGAA TAGTATTGGT CTACACCATT CTGACGACAA CGAAAATTCG
ATTCAGATGC TGAGCTCTTT GCTCTCAAAA CGTGAGGACA AGGTCCCAAC GTATCGTGTC
AAGAAGGCAG CAGTCGAGAT CAATCATTCA AGTGTTGAAG ATGAAACTAC GGTTGCTCAT
ATTTCTTTTT CCAAGTTCCC CACTGATCTC CTGCAGCAAA TTGTTATAAC CTCGGAAGAA
AACCAAGACA GCTCGGATCA AGCGAAAGGT GTTAAGCAGC TTTCTCTCGA GTATCTTGTC
GAGCTAGAGA AGAACCCATG GCGCTACCAG AAGCTTTTTA ATTTTGGCGA TGGTGGGAAA
GCAGCCGAGT CTTTTGTATC AATTCCTGAT AAAGTCAAAC TTCGCGAGGC TATTGGTGAG
CGTCGTTACA ATGTTTGGAA ACTTTCTCTC GAACGCGATG GCGACGACAA TAGCACTATT
TCGGAAGGTG GCAGAACTAG AACGGGAAGC CAGTCATTGA ATTTGGGAAG TCTTTTAAAC
AACGGTACTG ATGAGATGAC ATCGTCTTTC TCTGCGTCGG CTTCGGTGAC TGCACCGCAG
GATTCCAGTC GCGATGCGTC TCTTAGAAAG GTATCTGATG CAAAACATAG AGACGTTGTT
CGACGTTGTC TTGAGCTTGC AAACTTCGCT ACCGGTAAAA GTGCAGGCAA AATTAGCGAG
AAAGAACTGT CAGTGATTTC GGAAGCGGAG ATTGCGCTTC GTAGCCCTTC AGCACGACAA
TATTTACTCA CAATCCTCGG AAAGCGGTGG CAAGGAGATG GGACCAAGAA CAACAACCTG
AAACCAAACC TTCGCTCTTC GAATACTGGC GAGAGGCTAG ATCGTCATGC CTTTGAAGTG
CTGGTTCGGA TTGGATGCAC TATGCTGGAT GCGTGCCTGG ACTTCAAGGA GTATGAGTCG
GCATATACTT TACTGAAGTA CACAGCTGGG CTATACACGA GCGCAAGCAC AGAAACTGCA
GTCGCAACCA ACTACGTCAC TGCTCGACTA A
 
Protein sequence
MEKFLKSAPA LLEDPRELPQ PASEKRGFGG AKIATRIEPS RSEQFLSCLV NCQHFHQLLE 
LLDSEPLRFF HEIMEVINTN ERGEIQNSIG LHHSDDNENS IQMLSSLLSK REDKVPTYRV
KKAAVEINHS SVEDETTVAH ISFSKFPTDL LQQIVITSEE NQDSSDQAKG VKQLSLEYLV
ELEKNPWRYQ KLFNFGDGGK AAESFVSIPD KVKLREAIGE RRYNVWKLSL ERDGDDNSTI
SEGGRTRTGS QSLNLGSLLN NGTDEMTSSF SASASVTAPQ DSSRDASLRK LGYTRAQAQK
LQSQPTTSLL D