Gene PHATRDRAFT_47751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47751 
Symbol 
ID7202736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp727817 
End bp730099 
Gene Length2283 bp 
Protein Length760 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181965 
Protein GI219123300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGCAC CGCGTCTGGA TGGGAGCAAG CGACAAACTG TCGTCGATCG AAATGCTAGT 
ATGTCTTTGA TGGCGTCAAA GTCGCGACAA TTGTGTTTGT TGGGACTCGT TCTCGTCAGC
TGCATCGTGA ACGGTATTTA CTGGTCATCG GTGTTGGTCG GTGATCGATG GGGCGATGTG
GAGCTTAGTT TGCCTTCGCT GTCGGGGTTG TCCTCCTCGT CTGTATCCAC TTTGGCATCC
ACTCTGGCGC GTCCCGCAAC GCCTTGGTCC GTCTTTTACA ACGTCTACTT ACCGCTCCGT
AACGCGACGC ACGCGTTGAC CATTGTGGAG GAGCAACTGG AACAGATGAG TAGCTCGTAC
GGCGCTACCC GGCCAGGATT CGCTCCGGTT ACGGTACACG TCAATACGAT TGGCGATCCG
ATTGGAGCAG AAATGGTGCG GCAGTACTGT GCCGCACACA CCAATCTACA CTGCGTACAC
ATGCAACACT ACGCCGACGG CAATTTCGAA GACGTTACCT TGTCGCGCGT TCACCAGTTT
TGCCGAAACG CCGAGATGGA CGACACTCCG TCACACGATC CGGTCGTTAT CTACTTGCAC
TCCAAGGGGA CCTACCATCC CAGTGAAATG AACGATCGGT GGAGACGTTT CATGACCAGT
GCAGCCATGG GGGAAGAGTG CTTGGAGTTT CAAGGTCTAC CAGACCGAGA TACCCCGGAT
ATAGCGCCAC CGCACCTCTC GTCAACCGAT AATCCACAAT GCAACGTCTG CGGTTTCATG
TTTTATCCCA TATGGACACC GTTTTTTCCG GGTAATGTTT GGACCGCCAA GTGCAGCTAT
ATTCGAAAGC TCATGCCTCC ACAAGTCTTC AAGAAACAAT CCGATTCGAC GGCTGACAAA
GCCGTGGCTC TCATGAGAGA AGGACGTTTC ACCATGAACT TGTTGTCGCC GTATTTGGAG
GAGGGAGGGT ACTTGGGACG CGAACGATTC GCTGATGAGC ACTGGGTAGG GAGTCATCCG
TCGCTTGTGC CTTGCGACTT GGCTCCTCGA CCTCAACTAA TGAAATGGGT ACATCCGCGA
CTACGGAAAA AGTTCGGCGC ACCACCATTT AAGTGGTCGT TAGCACCGCG TCACGGAATC
AACGAAGACT GGTTGTTCTT GCAGGGGATT AAGAAGAAGT ACCGGCTGGT GGCCGACCCC
GATTGGCGTC ATCGAGAGTT CTTCTTGTTA CCGGGAGCAC TGTGGAAATG GATCACCTGG
TACGATGCGG TACCGCCGGA TTCTTCATAC GTGTGGAAGT GGTTTCCGGA CGGCCTAGAG
TGGCTAAAAC GAGCGCAAAC ACTGGGGACG CAAGGGTTGC TGACGTATTT GACACACGAC
ACCTATCTGC ACCCACCGGA TGAACCTTCG AGGGTTCATC CGTTTCAAAT CGCCACGACC
GATAAGAGAG GCGAACCAGT ACGAACCTTC TTTTATCACA TTCAGATACC TGTAAAGAAT
GTTCACAAGG CCGTCTTTCG TCGAGTCGTA TACGAACGAC TAGAAGCGAT CGGCTATTTA
TCCCCGGGAG CGACGGTCTT TTTTAATACC GTCGGAGATG CAAGTGTACT CGATGTGGAA
AAGTTAAAAG AATCCTGTGA GGAAATATAC GGGCTCAATT GCGTAAACAT GGAGCAGCTG
GATGCCGGGA TGGACATTAA AACGATGAGC CGAGTGTACG ATTTTTGTCG CATTCACAGC
TCTTTGCGAG TTGGCTACGT TCACACATTG GGAGGCACAG AACGATCAAG CACCCCTCGA
AACGAAGAAC GGCTACTGCA AACCAAGGCA ATAGCCACCA ACTTGTGTTG GAAAAGCACA
CAAGCGGATT GTGACGTTTG CTCACTAAAG ACGACGTCTC GAAATCCAAT TTGGGGTGCG
GATGCGGTTG GCAATCGGAA GAAGGTTTCA AAGCGCAGGA GAAACTTCAA GACTCTTCAG
ACAGCCAGTA GCCAGCCTGA CCCTATTCCT GCCATATGGA CGGCAAGTTG TGCCTACATC
GCCAGTCTCG ATTCGCCAAA CGACTTTGCG TCCAAGACGG CACAAGTCAA TTGGTCTCTT
AGTCAAACCA GCGGACAACG TTGGATTTTG AGCGATAACA GTGCCAATCG TTGCAAAGTG
GAGGTAGAGA CTGTCGCAAA ACTTGCGCAA CTCGCTAAGA ATAAACAAAG GGCTCTGAAA
TTATGGCGCG CAAGACAGAA GAGTACGACG GCAGACGAAG AAGTTGGCAC GGTGGAACAC
TGA
 
Protein sequence
MTAPRLDGSK RQTVVDRNAS MSLMASKSRQ LCLLGLVLVS CIVNGIYWSS VLVGDRWGDV 
ELSLPSLSGL SSSSVSTLAS TLARPATPWS VFYNVYLPLR NATHALTIVE EQLEQMSSSY
GATRPGFAPV TVHVNTIGDP IGAEMVRQYC AAHTNLHCVH MQHYADGNFE DVTLSRVHQF
CRNAEMDDTP SHDPVVIYLH SKGTYHPSEM NDRWRRFMTS AAMGEECLEF QGLPDRDTPD
IAPPHLSSTD NPQCNVCGFM FYPIWTPFFP GNVWTAKCSY IRKLMPPQVF KKQSDSTADK
AVALMREGRF TMNLLSPYLE EGGYLGRERF ADEHWVGSHP SLVPCDLAPR PQLMKWVHPR
LRKKFGAPPF KWSLAPRHGI NEDWLFLQGI KKKYRLVADP DWRHREFFLL PGALWKWITW
YDAVPPDSSY VWKWFPDGLE WLKRAQTLGT QGLLTYLTHD TYLHPPDEPS RVHPFQIATT
DKRGEPVRTF FYHIQIPVKN VHKAVFRRVV YERLEAIGYL SPGATVFFNT VGDASVLDVE
KLKESCEEIY GLNCVNMEQL DAGMDIKTMS RVYDFCRIHS SLRVGYVHTL GGTERSSTPR
NEERLLQTKA IATNLCWKST QADCDVCSLK TTSRNPIWGA DAVGNRKKVS KRRRNFKTLQ
TASSQPDPIP AIWTASCAYI ASLDSPNDFA SKTAQVNWSL SQTSGQRWIL SDNSANRCKV
EVETVAKLAQ LAKNKQRALK LWRARQKSTT ADEEVGTVEH