Gene PHATRDRAFT_31816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31816 
Symbol 
ID7196133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp998301 
End bp999965 
Gene Length1665 bp 
Protein Length554 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176704 
Protein GI219109902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0623295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTTG CTCAAGAAGC CGGTGACTCG ACAGAAATTG TCCGCGCCAT ATACGAAGCG 
TTGTCCACGT TGCCAGACGT GTCTCTGCTT TCCCAACCTG AAACAGACAA AGAGGAGGCC
TCGCCACCAA GCATCACTTT TCAACCAGAG ATATGCAGAT CAACGTCGTC AAACCTGTCT
ATCGGCAAAC GATACAATGC ACCTTGTGAG CAGCCGAGCC CTTTCGAAAT TTCTTTGTCT
ACTTTCCCTG CTTTCCTTTC TTTTGTGTTG GACTTTCTTG GAGATCCGGT AGCGGTTTGT
CGGTTCAAGA TGGTAAACCG TTTGTGTTTG AAATACGTGG ATGAGCATGA GCATGTCCTT
ATGAGAGACG CCGTCCGTCT TGGAGGCCTG CAGATTAATG TACGTCCGAG CTTCTGGTTA
TGGGTTACGC TTGAAAAAGA CAGCAGGGAA TCTTGCCATA GGAAAGAGGA CCCCGACGAC
AATGTCATAA TGGGAGCGCG AAACGAATTG ACGCTCCTAG AGCGGAAAGG CAGGGAAGGC
AAGTGGAACA ACGTGATCCA TAGAGATGTG TTACGGTCGT TTGGCAACTT ACCTCCGCAC
AAATCTGGGG CCCGCCTACG TACGGACTCG ATTGTCCGTG CGCTTGCAAC TTGGGGCAGG
AGTAGAATAC TGAAGAGTGG TATTCGGGGA TCCGGCGACC CGCCGCCGCC TTCGCATTCC
TTTTATGAAG AAGATGATGA CGATGTCAGT TTGGCTCCGA CGGACACCGT TAGCGACTGG
GGTGCCGTTT CCCCTGTCGG AAGCGTTACA GGCTCATTCT GCAGCACACG ACTAGACGGT
CGTCATACCA AAAAGTTCGA GAGACAAGCT GAAGCCCAGG AGCTCGCGCT TGGTGGAAGC
GCACTGTCAG ATATCGCAAA AGCTCGATTA CAGGAAAAGT TGAGCTTCAT TCTCCATGTA
CTTGCAGCTA CTTATAGCAA TGTAGGATAT TGCCAGGGAA TGGATTATGT TGCAGCTCAT
CTTCTTCGAA TCCTGGAAGA CACTATTCGT TGGAAGGCTG TCACAGGAAA CCTTCCTTCT
GTAATTCAAT TCGAGCCGTC AAGCATGATT GGTCACGACA ATCCCGATCA ATCGCTATCG
GATATGTATG CGGAAGTCGA TAAAAGTTCG GTGGTCGAAG AAACATGTTT TCGAGTCATG
GATTCGTTCT TTACTACGTA CGGTCTTCGA CATTTCTACT GGCCAGAGTT GCGGTGTCTG
AAGACGTGCT GCTTGGTCTT CGAGAAACTT GTCCAAATCA AACTCCCAGT TCTTGCTGAC
CACTTCGAGC ATCACGAGCT GAACATCGGA CTGTTTGCAT TGGGATGGTT TCAGACACTA
TTTTTGTACT TGCCATCTAT GCCTGCAGCA ACTGTTTGTC ACATGTGGGA CATTTGGTTG
GTTGAGAGGA ACTTCAAAAT CTTCTTTAGA GTCGGCACTG CTATTTTGTT TCTTTGCCAG
CCTGTTCTTT TGAACAATGA GCTCGAAGGT ATGATGGGAT ATCTCAATAC TTTCCCTGAT
GCTACACTGT TAAGCCCAGA TATTCTCATT GCATGCGCAT TGCAAATCAA AGTTACAAAT
CGAATGCTCA CACAGATTGA ATGCGACTTA TATCAAAAGT TGTAA
 
Protein sequence
MIFAQEAGDS TEIVRAIYEA LSTLPDVSLL SQPETDKEEA SPPSITFQPE ICRSTSSNLS 
IGKRYNAPCE QPSPFEISLS TFPAFLSFVL DFLGDPVAVC RFKMVNRLCL KYVDEHEHVL
MRDAVRLGGL QINVRPSFWL WVTLEKDSRE SCHRKEDPDD NVIMGARNEL TLLERKGREG
KWNNVIHRDV LRSFGNLPPH KSGARLRTDS IVRALATWGR SRILKSGIRG SGDPPPPSHS
FYEEDDDDVS LAPTDTVSDW GAVSPVGSVT GSFCSTRLDG RHTKKFERQA EAQELALGGS
ALSDIAKARL QEKLSFILHV LAATYSNVGY CQGMDYVAAH LLRILEDTIR WKAVTGNLPS
VIQFEPSSMI GHDNPDQSLS DMYAEVDKSS VVEETCFRVM DSFFTTYGLR HFYWPELRCL
KTCCLVFEKL VQIKLPVLAD HFEHHELNIG LFALGWFQTL FLYLPSMPAA TVCHMWDIWL
VERNFKIFFR VGTAILFLCQ PVLLNNELEG MMGYLNTFPD ATLLSPDILI ACALQIKVTN
RMLTQIECDL YQKL