Gene PHATRDRAFT_47239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47239 
Symbol 
ID7202208 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp895262 
End bp897098 
Gene Length1837 bp 
Protein Length564 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181465 
Protein GI219122254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.403327 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGATCA ACAACGCCAC CGAGACCAAG TCTCGCAAGG ATCGTCCTGC TGGATCCATC 
CATTTCGGCG ACACCGACTT CGATGTCGAC GAGCAAGCTG GAGCCGCTTC GTGGGGCGAA
GTATGCCACG CTTGTTGTGT CCATTCTGGA CAGGAATGGG CCATGATCGC TCTTGGGATC
TTCCTGGTTT GCTTCTTCCT CTACTTCTTC CTGGTGGGTC TTGACTTGCT TGGAAACGGT
GCGAGGGTTA TGACTGGATG CACTGCCGGA GAGTTGTTTG GTGACGACAC TAATCCCATT
GCTGGTCTGA TGATTGGTAT CCTTGCGACC GTATTGCTCC AGTCTTCTTC TACGACGACC
TCGATTGTTG TGTCGTTGGT TGGTTCCGCT GTCTCTGTCC GTCAAGGAAT TTACATGATC
ATGGGAGCAA ACATTGGCAC TTCAGTAACC AACACGATTG TTGCTATGGG TCAGATGGGC
GATGGAGATC AGCTGGAGCG TGCTTTCGCT GGTGCCACTG TCCATGATAT GTTTAACTTT
TTGTCGGTGG CTGTACTACT CCCTGTAGAA GTCATCACAG GATATCTTTA TCGGCTTACC
AAGGCAATTG TCAAGGATGC CAATCTCGAA GACGGTGAAA GCTGGGATGG TCCCATCAGT
AAGCTGGTTG ATCCTCTTTC GGAAAAGATC ATCATTCCCA ATAGTAGTAT TACCCGGGCT
ATTGCTTTGG GTGACGCAAC CTGCAATGAC GGCGGCGGCT TCTACCCCAT GAATTGTACG
GAAGACACGT ATTTGGGTTG TGGCGGCGCA TTTGGTCTCA TTGCCTGTAG CAGCGATAGT
GGTAAATGCC CTGCTTTCTT TCAAGGTGAC GCTTCGGCAA GGGATGACAA GGTCTCTGGA
GGTGTTGTCT TTTTCATTGC TATTGTCGTC CTTTTTGTTT GTCTCGCTGG GCTTGTAACT
GTTCTTCAAA AGTTACTGCT TGGTATGTCC ACTCGCGTTG TCTACAAAGC CACTGATATA
AACGGATATC TTGCGATTGC TATTGGTACT GGTCTGACCA TGCTTGTGCA GTCCTCCTCC
ATTACTACGT CCACTTTGAC TCCGTTGGTT GGTATAGGAG CGCTTCGTCT TGAGCAAATG
TTGCCCCTTA CACTTGGTGC TAACATCGGT ACAACTCTGA CTGCCATTCT GTCTGCCCTC
GTGTCTGCCA GCAAGGATTC GCTCCAGGTT GCACTTGCCC ATTTGTTCTT TAACTTGACT
GGAATTCTCA TCTGGTACCC TGTGCCTTTC ATGCGTCGTG TCCCTCTCGG AGCTGCTCGT
AGACTTGGAA AATTGACGCG AATCTGGCGT GGTTTCCCCA TTCTTTACAT TGGAGTGATG
TTTTTTCTCA TTCCGCTTCT TCTGCTTGGC CTGTCGTCTC TTTTCGATGA TGGCAGCACT
GGTTTTACTG TCCTGGGATC CTTTCTTACC ATCCTTTTGT TCCTTACCAT TCTTTACGCT
GTCTACTGGT TCCGTTACAG AGACGGTCGG CAGAAGTGCT CAAACAGCAT GGCCCAGCGT
GAGAAGAATC GCGTCGTAAT GAAAGAACTC CCTGACGACA TGGTGTATCT AAAGGAACAC
ATAAAGCGTC TTATTGAACA CACTGGACTC CCCGAAGACG AGGATGTCCA GCCAAGTATG
AATCTCCTGA TACTTCGGAT GCTGAAGTTG ATACCTAAGC ATCGTGTGAG AGATCGATGT
CGCTGTGTAT CGGTTGGAGG ATCCCTCTTG ATTATTTGTT TAGTATATAC TACATACAAT
CTGTGCCAAG AGTATAGTTC TCGCAGATAC TGTTTGA
 
Protein sequence
MEINNATETK SRKDRPAGSI HFGDTDFDVD EQAGAASWGE VCHACCVHSG QEWAMIALGI 
FLVCFFLYFF LVGLDLLGNG ARVMTGCTAG ELFGDDTNPI AGLMIGILAT VLLQSSSTTT
SIVVSLVGSA VSVRQGIYMI MGANIGTSVT NTIVAMGQMG DGDQLERAFA GATVHDMFNF
LSVAVLLPVE VITGYLYRLT KAIVKDANLE DGESWDGPIS KLVDPLSEKI IIPNSSITRA
IALGDATCND GGGFYPMNCT EDTYLGCGGA FGLIACSSDS GKCPAFFQGD ASARDDKVSG
GVVFFIAIVV LFVCLAGLVT VLQKLLLGMS TRVVYKATDI NGYLAIAIGT GLTMLVQSSS
ITTSTLTPLV GIGALRLEQM LPLTLGANIG TTLTAILSAL VSASKDSLQV ALAHLFFNLT
GILIWYPVPF MRRVPLGAAR RLGKLTRIWR GFPILYIGVM FFLIPLLLLG LSSLFDDGST
GFTVLGSFLT ILLFLTILYA VYWFRYRDGR QKCSNSMAQR EKNRVVMKEL PDDMVYLKEH
IKRLIEHTGL PEDEDVQPIL ADTV