Gene PHATRDRAFT_14998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14998 
Symbol 
ID7203731 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp71943 
End bp73889 
Gene Length1947 bp 
Protein Length582 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182764 
Protein GI219124971 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGATTCGCG AGCAAATGGA GCAAGAAGAG CGATCCCGAG CCATTCAGCA AGCGATGGAA 
ATGTTGCAAA CGACCGAAAC CAATACGGTC GAATCTATGA TGGAAAAGTC CGCTGACAAT
GGCGCGGACG TGCACTTTAC CAATTTGGAC TTGCCCAATT TACGCGGCGG GGGTCAACCT
TTGCTGCAAA ACGCTAACAT TACCTTTTCT CGAGGTCGAC GATACGGACT CATGGGACGG
AACGGTTGCG GAAAGACCAC ATTGCTGACC TTTATGGCTA GTCGACAAAT GGAAGGAGCC
GTTCCGAAGC ACATGAATAT GGTCCTCGTA CGTCAGGAAA TCATGGGCAA CAAATGGACG
GCCGTCGAAA CAGTTCTCAA GAGTGATGTC AAACGAGAAT CGGTTAAGCG CTTTATTGCC
TACTGTGAAG AAGAATTGGA AAAACTGGAT CAAGGCAACA AGAACCCAAC CATCGAAGAT
GCCGACGACC GCGGCACGAA CGATGAAAGC AAAGGCAAGA ATGACGAAAG CAAAGGCCGA
CAAAAGCTTC GGGAGCGCAA ACGACAAAAC CTGCAAAAGT CTGCTCGCAA GGCAGCGGAA
TCTTCCACCA CAGCACAAAT GCAAGAGTCG AAAGATGCAC AGCGACTCAA GCTCAACGAA
AAGCTGGGAT TGGCCTATCA GCGTTTGGCA CAGGTCGAAG AAGAGGAAGG CGGGGATCCG
GAACCGCGCG CACGCAAAGT ATTGGCTGGC CTCGGATTTG CAAAAGAAAT GCAAGATAAG
CCCACTGATG AACTTTCTGG AGGATGGCGG ATGCGGGTAT CGATTTCGTG TGCGCTTTTC
GCAAATCCAT CGTTATTGTT GCTCGACGAA CCGACAAATC ATTTGGATTT GGAAAGTGTT
CTCTGGTTGG AGCGATATTT GACAACCACG TTTTCTGGTA CGCTTGTGGT AGTCTCGCAC
GATCGGCACT TTTTGAACGA AGTGGTTACG GATGTCGTAC ATTTCCATCG CAGCCAATTG
ACCACTTATC GTGGAGATAT ATCCAGCTTT GAAGCAGTAC GGGATGACGA TCGTTTGCGG
CAACAACGCC AGCGTGAGCA GCAAGAAGCA AAGCGAGCAC ATCTGCAGAA GTACATTGAT
TTACACGCAC AAGCCGGTGA GAATGGTGTC AAGGCTGCTC GTCAACGAAA AAGTAAGATG
AAGAAGCTTG ACAAACTTGG AGTCATGGCA CAGGACGGGA AGAAGTGGAA GGCGTCGTAC
GATGGCGATG CTGAAGAGGT TGAAGAAGTA CTCGACGACG AAGAAGTCAT ACTGAATTTT
CCTGATCCGG GGGCTTTCGA TGGTGACATT GTACGTTTGG AGCAAGTCAA GTTTGGGTAT
TCAGCCCAAA ATATTTTACT AGAGACTGTC GATTTGACTG TCAATCTTAA GTCTCGAATT
GCTCTACTCG GTCGCAACGG ATGTGGAAAG TCAACCTTGA TCAAGCTGGC GGTTGGGGCA
TTACAGTCGA TGCAAGGCAA GGTCGTTATC GATCCCGGTG CCAAAATCGA GTACTTGGCG
CAGCATCAAC TGGAGCAACT CGATCCCGAC GGTACTCCTT TGCAAACGAT GGTAGACCGA
TATCCTGGAG ATCACAGCAA CACTCATATT GGTGAGCTAC GCCGATATCT TGCAAACTTT
GGCCTAGGCG GGGAGATCTT GCCCGTCCAA AAGATTCACA CTATGTCGGG AGGTCAGAAA
TGCCGCGTTT GTCTGGCTTG CGCTATGTAC CGCAAACCAC ACTTGCTGAT CCTGGATGAA
CCGACGAATC ACTTGGATCT CGAAACAACA GCAGCTCTAA TTGACGCCAT CAAAACGTTT
CAGGGAGGCG TGCTCTTGGT CAGTCACGAC CAGCACTTGT TGACTTCCGT ATGTGAAGAT
TTGCTGGTAG TCGAAAACGG AAGAGTG
 
Protein sequence
EIREQMEQEE RSRAIQQAME MLQTTETNTV ESMMEKSADN GADVHFTNLD LPNLRGGGQP 
LLQNANITFS RGRRYGLMGR NGCGKTTLLT FMASRQMEGA VPKHMNMVLV RQEIMGNKWT
AVETVLKSDV KRESLRERKR QNLQKSARKA AESSTTAQMQ ESKDAQRLKL NEKLGLAYQR
LAQVEEEEGG DPEPRARKVL AGLGFAKEMQ DKPTDELSGG WRMRVSISCA LFANPSLLLL
DEPTNHLDLE SVLWLERYLT TTFSGTLVVV SHDRHFLNEV VTDVVHFHRS QLTTYRGDIS
SFEAVRDDDR LRQQRQREQQ EAKRAHLQKY IDLHAQAGEN GVKAARQRKS KMKKLDKLGV
EEVLDDEEVI LNFPDPGAFD GDIVRLEQVK FGYSAQNILL ETVDLTVNLK SRIALLGRNG
CGKSTLIKLA VGALQSMQGK VVIDPGAKIE YLAQHQLEQL DPDGTPLQTM VDRYPGDHSN
THIGELRRYL ANFGLGGEIL PVQKIHTMSG GQKCRVCLAC AMYRKPHLLI LDEPTNHLDL
ETTAALIDAI KTFQGGVLLV SHDQHLLTSV CEDLLVVENG RV