Gene PHATRDRAFT_47047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47047 
Symbol 
ID7202141 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp267563 
End bp271764 
Gene Length4202 bp 
Protein Length1325 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181172 
Protein GI219121644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCC TTGTGGTCAC AACGAAGATA TAGAGACGTT GTGGTCTTCC TGAGACAACT 
CCCCTTAAGT ATAAGATTCA TTTCATTCGG AGTAGAAAAG AGAGAATAGT AAGTAGTACG
AGTAGCTCGT CGACATTTGC TGATGTTGCT TGTGAAGAAA ACAATGACCC TGCAATAAAT
CATCAAGTTT GATTTACTGT CAATTGACGT GTGACAGCGA GTGCTCGCTG GAAACATAGT
GGAACCCCCC GCCCCTCCCA CCCCGAAAAC ACGACACGCC GCCGGCACCG GAACGACCAC
CGATAGGCCG GTCGACAGGA AAATTGCGGA CGAAAAGTTG CTTTCGGCAA AACCCCAATT
GACTGGGAAA CAGAAGCGTG AGGCTATCGA AACAAACAAC GGTCATCCGC GGAACACAGT
GTGGCTACTA GCGACCAATA CATCACAAGC CGCGCTCGTT TCCGTTGCCT CGTGCCGAGC
CAGCTCGGTG GGCGTCGGAA TCTCTGGGAA GACAAGCATT CGTCCGAGCT GCTGGACACA
CCACTACCTG ACAGGCAGGC AACACCGTGA GCGCTACCGT TCCATTCCGT ACGTGAGCTA
TGAGTCATCC ACAGCAAGAA CCTTGTGATG GAAATGCTAG CGAACATCGA CTGCCGGTAT
CGCCTCGGCT ACAATCGCCG TACCGAGATT TCGTGTCGCC AGGTCGGGAA TTCATCCATT
TCGACGAGGA AGAAGACAAC AGAGCGACGA TTTCCCGAGG CTTGAACAAG GCAGCATCGC
GTCGGATCAA ACGAGGAGAA TGCCCCGCTT GTGGAGCCAA ACTTTTCAAA AATAGTATGC
TGGGCAAGAA GAAGACTCCC TTGACCATTC CGGGGCAATC ATTGAACGGT CGCTGTCTAT
TTTGCTTTCC TATTGTCCGA GAAATCGTCG GATCACACTT GGATCGGTAT CCGGACGAAC
CAGGCAAAGC GGTCGTGCCT CGATTTCTGT CGGTCCCCAC TGACGATACT CCCGACGATG
GTACCGTCAT GTCTACCATT ACACTCGATC ATCATCTAGG ACAATTTGCG CAAGAAGGCG
AAAAGGTCGC CCCTCCACCC CGACTACCTC CACCGACCTA CCCTCCGCAA CAGGCTTCCC
AGAGTGATCA CATGCGCTGG CCGCCTGGGA CCGACCGCAG TACAGCTCCC GTACTTACAC
CACCTGCTTC ATCGCGAAGA CGTAGATCGA GCTTTGACAA TGATGAAGCT GACGAAAACG
ATCGAGGGCC GGCACTACCC TGCCGATCAG TGAGTCCGTC ACGGTTTAGT GAGCTCGACC
CGTTGAATTC GGAGTTCCGG GCTAATCCAA TGGTCGCACC GTTACGGAGA GCCTCGAATA
ATTTCGCTAC GTTCATGGAT CACGACCCAC AGCACGAGTC CAACAGCATC CAACCCCGTC
GAGTACAACC CAGTTCCGCC CGTGCACGGG TGGAATGGGA CGGCTACCCC CTCACTGTTC
AGTCACACCA GCATAGCGGA GGGTGCTTAT TCCCGAGTCC ACATCCTCGC CCATTCTGGA
AAAGTCAACC CCGAACTCTA CAGACCAGTA AAGAAGCCGA CGACGAGAAG GAGGAAAAAT
CGGAGATTCA ATCACCTGCC CATGCGCATG ATCCCCCGTG GCAACCATCC GAGCTGAAGC
CAAAAGGGAG TCCCCGGGAA ATTGTTTTTG ATTCGTCCCG GAGTCCCTAC GACCCTCCCG
AACCGGCTAG CGTGAACTAC AACACCCTAT GGCAACAAAT GTCGAAGTCT ACGCCCATCG
CTCATGAAGA AATTGTAGTC GACCCAAACA CATTGCCTTT GGATGATGGT AACGCATCTG
CTTTGAAGCA AGAAAGGAAA CCAATCGTTG ACAGCGCCGA CCATAGAGTT CGGAACGGCA
GTGACCCTGG GTGGGAATAT TCTATGGCCT CGTTTGCCTA CCATCAGGAT TCTATCTCCT
TCCAAAATAG CAGTTTTAGT TTTCTGTCGC ACACGAAGTC AACATCGCCA CAGTCCGCCA
CCCAAACTTC ACCTAATTTG GAATTGGCAG TATCGTTGGA GGACATTCCT CCCTTGGTAA
AGACTTTGCG TGATGATGTA CCCAACACGC ACGAATCGGC GTTGCGGCAC TTAACTGCGA
CCGTTTGGCA GTGCGGGAAC TTGGCCCGTC AAGCAGTGAT TGAAGCAGGT GGCATTCCAG
TTTGGACCAT GATTGTTTGG CAAGATATGA ACGACGAGGC CATCCAAGTG GCTGCGCTCG
ACCTAATTTT CGCCGTGGCA ATTGGAGATT GCATTGATGC CACTTACGAT TATTTGGCCA
ATGATACGTT TGATTATGCT GTAGATGCCT TGCTTATTAT GATGCAATCA CTAATTCACA
ACGAAGATGT CCAGACTTTT GGCTGTCGAG TGCTGGCCTG TTTGGCAGGT GCTTCCGGTC
GTAACGCTAA AGTCAACGAC GGGGCTTTGT CAGGTGCGGT GCTTACTGTC TTGCGAGCAA
TTGATTCCCA CAAGCATTCC TTGTCTCTTC GGGAGTGGGG AATCCGTGCA TTGTATCAGC
AGTGTGCGTT GTCGAAAGAC ATGAATAGCA ACAGAAAAGC GTTAGTGGAA GCGAAGCTGG
ACCACAATAC AAGTGGACTA GACGTGATCT CGTATTGCTT GGACGAGGTG GGGTCAAATG
CTGTCATGGC TGAATGGATA TGCAATCTTT ATTGGTGTCT TTCTTCAAGT CAAGAGATCG
CCCAGATTCA AGTACCGGCG ACAGAACCGC TATTGGAAAT GACGAATATC GTACGAAAAT
ATCAGAAGAG CCGAGGGTCG GTACTGCTTC TACAGGCGGC GCTAGCAGCT ATTTCAAATC
TATGCATGCT TGCGGAAAAT CGGAAAGGTC TAGATACCAC TGAGGTGGTT CTCCTTGCTT
TGGAGGTGCT CGATTTTCAT CAAGGATGCT CCAGTGTCGC TGTGGAGGTG TGCGGTCTCA
TAGCGAGTCT CCTTCCTACA GTACGAAGCA CAGAATGCAT CCCCGCAGGA TCTATTCAGA
CCCTGTGTTC AATCTTGGAG TTCCCTCGAG ATTTAAAGAT GAACAGAGAA GCCCTACGAG
CATTGAATGC TGTGTTGGCT TCTTCTACTC TTGCACGAAA GCGTCTTTGG GAGACGACGT
CGCTATCCTG GCTCACAGAG GTCTCCCGTC TTCATTGCAA TTCGGTCGAA TGGCAGGCAT
TGAGTTGTGT CATGCTTTCA AATCTGTTAA TTGCGAATGA ATTGGACACA GGCGAAACAG
AGAAGTGGAT TCTCTCTGAG CTGTATTTGA TAATATCGCG GTGTACCGAT GCACTAAAAG
TCCAGGAGGT GGGAATTAGT ATCCTTTCAA AAATTTCGAA AGATGAGACC CTTTCGACAC
TGTTGGACGA GGAAAGTTTG AAGCTGGTTG TTGATATGAT GTCGAAGTTT CCGCTCTCAA
AAATTATACA GAGAAAAGGC TCCTTCCTCA TCTTGAACGT AGCCCGTGGT CCCATAATAC
TCAAGCACTT GTCAGTCGCA GAGCGATGCG CGAGCTCACT AGTTATTACA CTGCAAAACC
ATCTTGAAAC GTCCGATATT ATCGGATTCG CTTGCGATGC TATTTGGGTA CTTATCCACG
GTTCCGACAT GCTTAAGGAA ACCGTTGTAA CACAAGGTGG CATTGATGCT CTGTCTTGCG
CTCTGGTGTT GCATCAAAAC GAAGTTAGTA TTTTGGAAAA GGCTTGCGGT GTACTGTCAT
GCTTGAGCTC AAGAGAGTCT CATATTCAAA CAGTGGTCAA CGCACAGAGT GTTTTTAACG
TTGTTGATGC TATGCGGAAC AACCCTAATT CCGCTTCACT TACTCAGTAT GGATGTTTGT
TGCTAAAAAA TGTCATCGTT ACAAGCAGGG AGCAGTCAAT ATTGGCCTCT GGTGCAATTA
GTGTTGTAAC TGCGGCAATG TTGAAGCATC CCCACGAGAG TGGCATGCAA AGAGAAGCAT
GCAGTTTTCT CTGGGCCATT ACGTCGGCAT CAGGCGACTG TAAATCGAAA GTGCTCGCAT
TGGATGCGGT ATCCCTTTTG ATGACGGCGC TATCAAGTGA TAAAAAGGAT GTTCAAGACG
CCGCTCGTGG CGCCTTCAAT ACTATTGCTC TCACATCCAA TGAGAGTCTT TCTGCTGTAT
AA
 
Protein sequence
MDVLVRVLAG NIVEPPAPPT PKTRHAAGTG TTTDRPVDRK IADEKLLSAK PQLTGKQKRE 
AIETNNGHPR NTVWLLATNT SQAALVSVAS CRASSVGVGI SGKTSIRPSC WTHHYLTGRQ
HRERYRSIPY QEPCDGNASE HRLPVSPRLQ SPYRDFVSPG REFIHFDEEE DNRATISRGL
NKAASRRIKR GECPACGAKL FKNSMLGKKK TPLTIPGQSL NGRCLFCFPI VREIVGSHLD
RYPDEPGKAV VPRFLSVPTD DTPDDGTVMS TITLDHHLGQ FAQEGEKVAP PPRLPPPTYP
PQQASQSDHM RWPPGTDRST APVLTPPASS RRRRSSFDND EADENDRGPA LPCRSVSPSR
FSELDPLNSE FRANPMVAPL RRASNNFATF MDHDPQHESN SIQPRRVQPS SARARVEWDG
YPLTVQSHQH SGGCLFPSPH PRPFWKSQPR TLQTSKEADD EKEEKSEIQS PAHAHDPPWQ
PSELKPKGSP REIVFDSSRS PYDPPEPASV NYNTLWQQMS KSTPIAHEEI VVDPNTLPLD
DGNASALKQE RKPIVDSADH RVRNGSDPGW EYSMASFAYH QDSISFQNSS FSFLSHTKST
SPQSATQTSP NLELAVSLED IPPLVKTLRD DVPNTHESAL RHLTATVWQC GNLARQAVIE
AGGIPVWTMI VWQDMNDEAI QVAALDLIFA VAIGDCIDAT YDYLANDTFD YAVDALLIMM
QSLIHNEDVQ TFGCRVLACL AGASGRNAKV NDGALSGAVL TVLRAIDSHK HSLSLREWGI
RALYQQCALS KDMNSNRKAL VEAKLDHNTS GLDVISYCLD EVGSNAVMAE WICNLYWCLS
SSQEIAQIQV PATEPLLEMT NIVRKYQKSR GSVLLLQAAL AAISNLCMLA ENRKGLDTTE
VVLLALEVLD FHQGCSSVAV EVCGLIASLL PTVRSTECIP AGSIQTLCSI LEFPRDLKMN
REALRALNAV LASSTLARKR LWETTSLSWL TEVSRLHCNS VEWQALSCVM LSNLLIANEL
DTGETEKWIL SELYLIISRC TDALKVQEVG ISILSKISKD ETLSTLLDEE SLKLVVDMMS
KFPLSKIIQR KGSFLILNVA RGPIILKHLS VAERCASSLV ITLQNHLETS DIIGFACDAI
WVLIHGSDML KETVVTQGGI DALSCALVLH QNEVSILEKA CGVLSCLSSR ESHIQTVVNA
QSVFNVVDAM RNNPNSASLT QYGCLLLKNV IVTSREQSIL ASGAISVVTA AMLKHPHESG
MQREACSFLW AITSASGDCK SKVLALDAVS LLMTALSSDK KDVQDAARGA FNTIALTSNE
SLSAV