Gene PHATRDRAFT_50512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50512 
Symbol 
ID7199238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp273034 
End bp274245 
Gene Length1212 bp 
Protein Length372 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185407 
Protein GI219130511 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATTCTGTC GTCACTACCA TATACGATTC CGATTTCGTT CACCCATAAC AGGTGCAATT 
GTTTTTCACT CGGCACCGTT TGTCTCGACC GCAATGTCTC CAGCAAACGA GAAAAAAGTG
TCCATGGATG CACTCGCCAA ACTAGCACAA TCTCTCCAAA AGGAGATTGC CGATGCGGAG
CCATCGCTCG AACGATGCCA AGATTTAATT TCTGCAACGG AAAAGGAAGG ATCAGCCGAA
ATTATAACTA TTGAAGCGTT GGATCGAACG AAACTCGGCA AGATTGTTAC CAAGGCGGTC
AAACACTTCC GTCGCGTGAA ACGGCACGGA AAGCAAGCAG ACTGGCAGTC GCTGCAGTCG
CGGGGCGAAA CGTTACTCGA GCAGTGGAAG CAAGCAGTGG CAAAAGAGGA GAGCCAAGGC
GAAGAAACTC CCAAAGCGAC GGGCGAAGCA GACGATGTCG CGTTGGATGG CAGTACGGGA
TTGCCAACTT CGCAATCGGC ATACCGAGCT CGATTAACGA AGCAAAAGAA AGAACTCTAC
AAAGATCCTC CAGAATTGCC TCCTCCACCG GTCAAGATTG AATCGGAGTA TTGCGGCCTA
CCAAAGCGTG ACTCCAAGAC TGCTTCTTTA ACGTTCGAAT GTGGTAAAAC GAGCAAGCAT
TTAGGGTCGT GGCTCCAAGA CTTTCATCCC AATCGCACTC CGGAAGAAAT TCTGCGGGCT
GGCGCCTTTG GGGGTACCTA CTTTCGATCC ATCGCCTCCG CCGTAACGAA TCAAACGTAT
TCATCGAATC AAGTATTAAA GGATACGGTC CCGGATGAAT GGATCGCAGG GCTAGACCGT
AAGAGTATGC TGACGTCCGC GACATATCGG CAGAACGTGA ATAAATACGG TTCCAAATGT
GGTGGATCAC TGGGAATGTG GGAAAGTAGT GGTTGGATTA CAGATGTTGA CCCGTACGGC
TGGTTTCAAT GGTACTGTCG CTTTTACCAA GGCCGACGTA GCTCTGACGA CCAAAGACAA
ATTTCACGAT GGTTAAAGAA CACGGGTCCG AAAGGTCGCT TTCGCTCGCA GCTGTGCAAC
AAGATTTTAG CCGCCAATAC CCGTCACGAT GACACCAGTA TTAGTCCAGT GATTCGTCAG
AATTTGTTAC ACTGGGGATT GGAGATTACG CCCGAAATAC TGGAGGCGCA CCGCAAACGG
ACCGGCAAGT AA
 
Protein sequence
MSPANEKKVS MDALAKLAQS LQKEIADAEP SLERCQDLIS ATEKEGSAEI ITIEALDRTK 
LGKIVTKAVK HFRRVKRHGK QADWQSLQSR GETLLEQWKQ AVAKEESQGE ETPKATGEAD
DVALDGSTGL PTSQSAYRAR LTKQKKELYK DPPELPPPPV KIESEYCGLP KRDSKTASLT
FECGKTSKHL GSWLQDFHPN RTPEEILRAG AFGGTYFRSI ASAVTNQTYS SNQVLKDTVP
DEWIAGLDRK SMLTSATYRQ NVNKYGSKCG GSLGMWESSG WITDVDPYGW FQWYCRFYQG
RRSSDDQRQI SRWLKNTGPK GRFRSQLCNK ILAANTRHDD TSISPVIRQN LLHWGLEITP
EILEAHRKRT GK