Gene PHATRDRAFT_50582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50582 
Symbol 
ID7199363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp188474 
End bp189651 
Gene Length1178 bp 
Protein Length372 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185539 
Protein GI219130788 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGGC TAGCAAATAA AAGTCGCATG TACGTTGCGC TGATTGCCTT GTGTTTCTTA 
GCGTGCCTAC AATGGAAACG GCTGTGTCTC GCAGGTCGGT CCGGGATTTT CGCGGAGGTG
AAAAGCCTTG GGCTGCCGTA TGGAGAAAAA GATTCCAGCG TTCCGGCTTA CCACGACAAG
GCGTGCTTGG ACAAAAAGAA GATCCTAGAG ATTTTGCTTA ATGGTGGGAA AAACATATCG
CAAATTGATT GTGCGTTACT CCCAGTATGG CAGAATATTG TCGAAAACTA CGGAGACGGA
CCGGTTGTCC ACGGCCTCGA CACTTGCGAG GCCTACCGAG AGAGCATCTC CAACAATTCT
CATCCAATGC TACGTGTGGC TGGACTGTTC AATTCTGGAA CTAACGCTAT GGCAGCTCTA
TTAGAGCTCA ACGTGGAAGA ATTGGGGGGA AACTTTCAAC ATGAAGTACC GTGGGGGAAG
CATCTACCAG TACGGTTTCG CAAGACCAAT AAATTTCCGC CAAGATCGAA AGTTGACTAC
CGACAAGTTA TGCCAGTAGT GATGGTGCGG GAACCGTTCC GGTGGATGAA GTCTATGTGC
AAAAGGAGTT ACGGCGTTGA ATGGATAAAG CCAAACAATT TTAGTCACTG CCCGATTTTG
ATTCCCAATG CTCACGATAA ACTGAAGCCC CGCTTTCGAG GAAAGAAAAC GGCAGAGACC
AAGGTCGGTC TTAACCAAGT GACAAACACG GTCCCCATTC CTGTAGAGCA TTACGATTCG
CTTGCTAGTG TATGGACAAC GTTTTATCAG GATTGGAATG ATGTCGACTT TCCGCATCTG
GTCGTGCGAT TTGAAGATAT ACTCTTTCAT GGTCCTCGAT TGCTAGAAAT TTTGAACGAC
TGTATTGGCA TGAACGCCAC TCAGAAGCCT TTTCGTCATA AGATGACACA AGCCAAGAAA
CACGGACGGT CCTCTGATTT TGCAACGGCT CTTGCACAAT ATGGACCCGG CTTTCAGAAA
CTTCGTGGCC TCACACAGGA CGATATAAAC TTTGCTAGAG TCGCTTTGAA CTCCTCCTTG
CTGGAAACCT TCCATTACCC GGCTGTACCT TTGTTCTAAG CAAATACGGG ATTTTCACCG
ATCCGATTAA GTATTGTTCT AACTGCTCTT GCATTGCG
 
Protein sequence
MQRLANKSRM YVALIALCFL ACLQWKRLCL AGRSGIFAEV KSLGLPYGEK DSSVPAYHDK 
ACLDKKKILE ILLNGGKNIS QIDCALLPVW QNIVENYGDG PVVHGLDTCE AYRESISNNS
HPMLRVAGLF NSGTNAMAAL LELNVEELGG NFQHEVPWGK HLPVRFRKTN KFPPRSKVDY
RQVMPVVMVR EPFRWMKSMC KRSYGVEWIK PNNFSHCPIL IPNAHDKLKP RFRGKKTAET
KVGLNQVTNT VPIPVEHYDS LASVWTTFYQ DWNDVDFPHL VVRFEDILFH GPRLLEILND
CIGMNATQKP FRHKMTQAKK HGRSSDFATA LAQYGPGFQK LRGLTQDDIN FARVALNSSL
LETFHYPAVP LF