Gene PHATRDRAFT_44529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44529 
Symbol 
ID7198065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp816033 
End bp817203 
Gene Length1171 bp 
Protein Length360 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178583 
Protein GI219115575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCGTCCATT GCAACCCATC CAGCACTTTA AGCCAAGCCC TTCCTTGAAC AACAAACTTT 
CGAATCCAAA AAGCTTGCAA CGCAAACGAT GGGTCTCTTG TCGTTTCTCT TGTGCTTGAA
GGCACTACAG ACGGCAACGG CCTTTGGGAG GCAACCGCTT CGGACGATGC GTGCGTACGC
TCCACGGAGC TTACGAAGCA CTCCGTTGCC CTTGGCTGGG CAAGCTACGG AGCTGCCGGA
TTCCCTAGAA GACGCAGCAA CCCGTGCCGC GCAGGCGACG GCCAATTTTG CGTCCCAGTC
GGGGCCGTTG GCACGCTGCC GGGTAGATTT TGACACCAGC GTTGGCGACG AAACCTACAC
ACTCCTTAAA TCTAGTACGG AATTTATGCA AAATTTCGTT TCGGCTTTGT GCTACGCCAC
CATTCCGGGA TTGCAGGAGC AAAGACAGGC GGAAATGATG AAGGTTGCGG AAGCGCGAGC
GGAGCTTAAA GTGCTCTTCC AGGAAAACCC TGAAGACGAT CGCGTCCGTG AGCTTCAAAA
AATGCTTGCT GCGAATGGGC GTGATCCGGA CGCGGGTCCG TGGACAGGCC CGGTAGCGCG
TGTGTACTTT CCCGATGAAG GCAGTGCGGC TTTGGCACGC CGAGACTGGT TGGGGGTGGA
TCCGAAGGTA CCTCCGTGTG TCCAATTTGC TTCCTGTGGT GGAGTGCAAG TTGGCGACAT
TCAGAAAGAT CGTATTGTCT TTTTCTTTTG TCCAAAGGCG AGCGAAAGTG AATACGTGGA
AAAGATTCTA ATCAACACGG AAACGACAGC GACAGATTTG CAACTTTCTG TCTTGGTGAA
TCCAAACTTG GTGGATATGG GAGTCACGGG CTTTGGCATG GCGGGGCGGC GTCTGCGCGA
GCGTCTCATT GATCCCTTGC AGAATACTTA CTACCTACGA ACGTTGCCTT GGGGGGCACT
CACCCGGCTC TGGCCCCAGG CCTACAGCGT CTGGCAAGAA GATACCGACG CCGAAGGCGG
CTACCGCCTT ATTAAAACGT TGGGCCGCCT GCCGTCGAAC CCTGAAGTCG AAGATATCTA
CGATATTGAA AACGGAAACA TGGAAGCACG ACAGTCGGGC GGACCGCTGG ATCAGCTAGC
AGACTTTGTC AACGGTATGA TGCGATTGTA A
 
Protein sequence
MGLLSFLLCL KALQTATAFG RQPLRTMRAY APRSLRSTPL PLAGQATELP DSLEDAATRA 
AQATANFASQ SGPLARCRVD FDTSVGDETY TLLKSSTEFM QNFVSALCYA TIPGLQEQRQ
AEMMKVAEAR AELKVLFQEN PEDDRVRELQ KMLAANGRDP DAGPWTGPVA RVYFPDEGSA
ALARRDWLGV DPKVPPCVQF ASCGGVQVGD IQKDRIVFFF CPKASESEYV EKILINTETT
ATDLQLSVLV NPNLVDMGVT GFGMAGRRLR ERLIDPLQNT YYLRTLPWGA LTRLWPQAYS
VWQEDTDAEG GYRLIKTLGR LPSNPEVEDI YDIENGNMEA RQSGGPLDQL ADFVNGMMRL