Gene PHATRDRAFT_47886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47886 
Symbol 
ID7203154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp343064 
End bp345099 
Gene Length2036 bp 
Protein Length528 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182376 
Protein GI219124155 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0086312 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGTTTGGA CTTGATGGTT TTGGTGGCTC CCAAAGTGGG TTCGCGGTGG ACGGAGACAA 
AACTGGAGTA GTGATTGTAG AAGATTGTGT TATGTCTAGA ACAATTGCTG GCCTGTTAGT
GGCCTATACT ATGCTCTTTT CCAATTGGTG AATACCAACC TGAAGCAGAC AAAGAGCTCC
AGACAGTAAT TCTTAGACTA TTTTTTCTTT CCATGTTACT AGAACCATGG AATGGCTACC
ATGGTTGCAC TGGCAAAGGA AAGTAGCGAC GGCTGACTTA CGTACCATTT GTGGCATCTG
ATGACATGCC AAAGTGTCGC TTGTGGCTTT GAACACAATT CTAACAACAA TCGCTCTGAC
TAGGCCATAG CTCTTGTGGC TATTGTCACC GTACTTTCAC AATTGGTGCC AGAAAATCTC
AAACTGTCGT CGAAGTGAGA TTAACAGTAA TGGACATGAA CAATGATAGT TTTCAACCGT
TGTCCAAGCG CCAGAAGACA TCTGATGTCG AGAAGAGCGA CGAAGACCTT TCAAATTTAG
TTCATGGTGA TGAGTTTGCT CTTGTAGGCC GAGAACAAGA GCTGGATGCT CGGCGAAGAC
GCGCAGACAG AAAACTACGG CTTCACAGTG CTCGGCAAAA AGAATCAGGT GAAGGAACAG
ACAGAAACGA CGACACGGAC AAGGATTTCT TATTGCCTGT GAGATATGCT CAAAATGATC
AGCTCCAGGC AACAGCAACT TTAAGCAACA AGGACATGGA AACAGTTTAT CAGCCGGTCG
ACAAGGACAA TTTTGACATG TTTTCCAGCT CTCCGTCTCC GCAGGACAAT ATGGATTACC
AAACGAACTC AGCGACATCC AAGTCGAAGC GAGGAAATGA ACAAGGGGAT TGGGACGACA
GCGAAGGATA TTACAAAGCT GTAATTGGTG AAATTATTCG TTTAGAACAA ACGATAGACT
CCGCTTCTGG CAATTCCAGC AGATCAGAGA TAAGCTTCCG AATTTCAGGA GTTGTTGGGA
AAGGGGTCTT CTCCACCGTT CTCAAATGTA CAACTGTAAG CAATAGTAGC AGTATCCAGC
TTCCTCCTAC AGTTGCCTTG AAGTTCATCC GGCACAACGA CACGATGGCG AAGGCCGCAT
TGAACGAGGT GCAAACTTTA CAGCGCTTAA AGGGATGCGA CGGTATTGTT CCATTGCTGT
TACCACTTAC AGAAACTCCG ATGGAACACC GAGGACATGT TGTCTTGGCG TTTTCATGTA
TGGAATACAA TCTCCGAGAC GTGCTTCAGA AATTCGGGAA AGGTGTCGGT CTTTCACTAC
AAGCAGTTCG ATCATATTTC GGCCAGCTTC TGGCTGCCGC AACGCATCTA AAGAAGAACA
ACATAATACA CGCAGATTTG AAGCCGGATA ACATTCTCGT AAGCGCCGAT TTTTCTTTTG
TTCAGCTTGC GGATTTCGGT TCAGCTATTG ATGCATCGGA GTCCCAACAA AACCAGCCGA
CGCCGTACCT GGTATCACGT TTTTACCGCG CGCCCGAAAT AATTCTTGGT TTGACTCCGA
CATTTGCGGT TGATTTGTGG AGCTTGTCGG TCACGGCAGC CGAGTTGTTT CTAGGAGAAG
TTTTGTTCAA GGGGTCTTCA AACAATGACA TGCTCTACAG CTTTATGCAG CACATGGGGC
CAATTTCCAA TCGCATTATC CGTCAGCATT TGGCAGGATG CCAGCGCTTT CCAATTTCGA
AACAATTTTC TCAGGAAGGA GCAAGCTTCC TTTTTAAGCA GCAGACAACA GATCCCGTGA
CTGGTCGGCA TGTACACAGG ATGTTGTCGC TTGCAACCTC AAGTAACGGA GGGAGGTTTC
CGTCGGCCAC TCCTTTACAT CGCGTGTTGT TGAGGGCAAA GTCCACAAAA GACAATCGCA
TTGTGGTCAA TCGATTTTCA GATCTTCTAG TGGGATGCCT CAGTCTGGAT CCGTCCAGAA
GGATGAGTTT AAAAGAAACT TTGCAGCACT CTTTCTTCCA GCTTGAAAAC TCGTAA
 
Protein sequence
MDMNNDSFQP LSKRQKTSDV EKSDEDLSNL VHGDEFALVG REQELDARRR RADRKLRLHS 
ARQKESGEGT DRNDDTDKDF LLPVRYAQND QLQATATLSN KDMETVYQPV DKDNFDMFSS
SPSPQDNMDY QTNSATSKSK RGNEQGDWDD SEGYYKAVIG EIIRLEQTID SASGNSSRSE
ISFRISGVVG KGVFSTVLKC TTVSNSSSIQ LPPTVALKFI RHNDTMAKAA LNEVQTLQRL
KGCDGIVPLL LPLTETPMEH RGHVVLAFSC MEYNLRDVLQ KFGKGVGLSL QAVRSYFGQL
LAAATHLKKN NIIHADLKPD NILVSADFSF VQLADFGSAI DASESQQNQP TPYLVSRFYR
APEIILGLTP TFAVDLWSLS VTAAELFLGE VLFKGSSNND MLYSFMQHMG PISNRIIRQH
LAGCQRFPIS KQFSQEGASF LFKQQTTDPV TGRHVHRMLS LATSSNGGRF PSATPLHRVL
LRAKSTKDNR IVVNRFSDLL VGCLSLDPSR RMSLKETLQH SFFQLENS