Gene PHATRDRAFT_28568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_28568 
Symbol 
ID7201976 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp827192 
End bp830271 
Gene Length3080 bp 
Protein Length707 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181449 
Protein GI219122221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.43424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCAAAAGGAT TGACGGCAGC CTTGTTGCCC TTTCAAGTCG AAGGCGCATC GTGGATGTAT 
CACCAAGAAA CGCAAAAACC AGAGATTCGC GGAGGGATCC TAGCCGACGA AATGGGAATG
GTACGCCTGG TGTGCAGGTT GCCGTGCCAC GGAAAACTTT ATTGCGCTGT ACCTGCTGTG
CGTCGTTCGG GTCCACCAAA CGTCTTTTGC CTAACCTGAC CTTTTTATTC CTTTCTATCT
AGGGGAAAAC AGTGCAAACA ATCGCTACCG TGCTGGACAA TCGTCCAAAA CTACAGCACA
GTCGTCCCGG AGCTAAACAT CCGCCCTCAC TTCCCGATGT CACCGAACGG CAGTTGGAAG
AGACACTTTG GAACCGAGCA GTCTCGGATT GGAAGCACGA AATGGACATG TGCAACGTTC
CGCCCAAGAT GCGTCCCCAC AAGTATGCCG CGGCTCGCGC CGGCACCCTC GTGGTCTGTC
CCGTCATTGC CCTGCATCAG TGGAAAACCG AGATCGAAAA GTTTACCGAG CTAGATACCC
TGTCGGTGGG CATATATCAT GGTCCAAATC GGGCTACGGA TATGCCACCC GAACTGATGC
AAAAATACGA TGTTGTGCTC ACCACGTACC AGGTTTTGGA ACAGGATTTT CGCAAAATGA
TGTCGCCCAA TAAAATCAGC TGTCCCAATT GTGGGGGCAA GTTCAAAGTC GACAAGCTGC
GAGTTCATCT CAAGTACTTT TGTGGAGATG GCGCTGAGCG TACCGAAGCG CAAGCACGTC
AACATCGTGC CCGTGATCGG GACGAGAACG GTAGTGGTCG GGGTAATACC AATCGTGGTA
TTGGTGGTGC AAGGGGCAAG AAAGATAAGG TTAAAAAGCC TCTGACCCCA ACAAAGAAGC
ATTTGTCCAC CAAGAGTGTG GCAAAAACCA AGCAGGCGAC GAGACGAACA ATTCGGGTAA
AGAGTTCGGG AGACTACGAA TCCGACAGTG AACTTTCGTT GGACGAACCG TTTCTGGCAA
CTCCGCCGCA ATCAGGCCGT CCATCGCGAT CAGCAGCTTC AAAAGCTTCG AAACGCATGT
CCAAGACGCT CAAGGAATGG GGTCGGGAGG GGCGTAACGA CAACGATGAA AGCAGCTTCG
GCTTTGTTAG CGAGGGGGGA GACAGCGACT CATCCGATGA AGATATTCCG CCAGTGACCG
CTGCGAAACT TAAATCCGTC GCCAAGAAAC GGACAGTGAG CCAACGAGAA TCGTCTCATG
AGAGTGCGCT GGATCGCGCT TGCGAAAAGC AACGCAAAGC GATGGACAAT GTCAAAAAGC
AAAAGACCGG GAAAAAGAAA ACGTTGGGCA AGAAAGGCAA GAAGAAATTC GATAATGAGG
GGTCGTCTGA ATCAGATTCC GAAGGGAAAG CAAGCGATCC CATTAATGAT ATCGATATGA
ATGAGTTGAT GAAGGAAGCC ATGGTGGGTT CGCGCTTTAG TGTGCTCCAC AGCTTCTGTT
GGTGGCGAAT TATCCTTGAC GAGGCCCATT TTATTAAATC ACGATCGAGT CAAACTGCCG
CTTCTGCGTT CTCACTGTCG GCTATTCATC GTTGGTGTCT GTCGGGAACG CCACTCCAGA
ACCGTGTTGG AGAATTGTAC TCGTTGATTC GCTTTCTCCG AATCGATCCC ATGGCGCATT
ACTTCTGCAA AGCGAAAGGA TGCGATTGCA AATCAATTCA TTACCGCATC AAAGACGGCA
AGTGCCAGGA CTGTAGTCAC CACGCCTTTT CACATTACGC ACATTTTAAC CGGTACGTCC
TGAATCCTAT TCAGCGAGAT GGGTACAGCG GTGACGGACG TCGAGCTATG TTCAAATTGA
AGAACGAAGT TCTCGACAAA TCCTTGCTAC GTAGAACGAA AGAAACTCGG GCAGAAGATA
TGAATTTGCC GCCACGACTG GTGACGATTC GACCCATTCG TCTACATCCA GTCGAGCAAG
ATTTTTACGA TGCTCTCTAC ATGAACACTA AGGCTTCCTT TAATGACTAC GTTGATGAAG
GAACCTTGCT GAACAATTAT GCGCACATCT TCGATCTTTT GACAAAAATG CGCCAAGCGG
TCGATCATCC GTACATGATT GTTCACTCTA AAAAGAATAC CGAGAAGCGG CGATTGGAGC
AGGGAGCTCC AGTCGCGAAC GGATCGGTGG ACTGTGATAT CTGTCATGAA TCTCCAACGG
AGCGTGTCGT CAGCTCTTGT TGCGGTTCTG GCTTTTGCCG TGAGTGTGTG GTTGAATACC
TCACCGGCGC CGGTGGTGGG AGCACCCCGT GCCCTTCCTG CCAATCCCCC TTTTCCATCG
ACCTCAACCA GGCGAGTACT GAAGCACCAG TGGATGACGG TACGCTCGCG TATGGTGTCA
GAGAGTCGCA GAAAAGTGTC GATTGTTCAT CAATTCCGTC GTTGAAAGAG CTGCAGCATG
TTCCTTCGGG TTCTATTTTA CGACGGATCA ATCTAGCCGA GTTCGCCACA TCGTCGAAGA
TTGAGGTCTT GGTCCAAGAG CTCGTTGCTA TGCGCAAGGG TCGGCCAGGT AGCAAAGCCC
TCGTGTTCTC CCAGTTCGTC AACATGCTGG ACCTCACTCG CTGGCGCATC CATTCCGATC
CCTGCTTAGC TGACTTAGGT CTCGGGGTTC GAATATTGCA CGGTGGAATG GACGTCAAGT
CTCGCGATGC TACCCTTCAA GCATTCCGAG AAGATCCGAG CGTCCGAGTT TTACTCATGT
CGCTGAAGGC TGGCGGTGTT GCACTGAACT TGACCGTCGC TTCGGAAGTG TATCTGTTAG
ATAATTGGTG GAATCCAGCT GCAGAAATGC AGGCAATTGA TCGTACTCAT CGTCTCGGAC
AGTACCGTCC AATTCGCGCT GTGCGATTCA TTGCGGAGGG CACTGTGGAA GAGCGCGTGT
TGCAACTGCA GGAAAAGAAA AGGTTGGTGT TCGACGGTAC CGTGGGCCGA GATGCTGGCT
CTTTGAAAAT GTTGACGGTA CACGATATGA AAGCCCTTTT TACTTGAGTT TTAGTTATAG
CCAGGGAATT ATGAGTTATG
 
Protein sequence
MYHQETQKPE IRGGILADEM GMVRLHEMDM CNVPPKMRPH KYAAARAGTL VVCPVIALHQ 
WKTEIEKFTE LDTLSVGIYH GPNRATDMPP ELMQKYDVVL TTYQVLEQDF RKMMSPNKIS
CPNCGGKFKV DKLRVHLKYF CGDGAERTEA QARQHRARDR DENGSGRGNT NRGIGGARGK
KDKVKKPLTP TKKHLSTKTM VGSRFSVLHS FCWWRIILDE AHFIKSRSSQ TAASAFSLSA
IHRWCLSGTP LQNRVGELYS LIRFLRIDPM AHYFCKAKGC DCKSIHYRIK DGKCQDCSHH
AFSHYAHFNR YVLNPIQRDG YSGDGRRAMF KLKNEVLDKS LLRRTKETRA EDMNLPPRLV
TIRPIRLHPV EQDFYDALYM NTKASFNDYV DEGTLLNNYA HIFDLLTKMR QAVDHPYMIV
HSKKNTEKRR LEQGAPVANG SVDCDICHES PTERVVSSCC GSGFCRECVV EYLTGAGGGS
TPCPSCQSPF SIDLNQASTE APVDDGTLAY GHVPSGSILR RINLAEFATS SKIEVLVQEL
VAMRKGRPGS KALVFSQFVN MLDLTRWRIH SDPCLADLGL GVRILHGGMD VKSRDATLQA
FREDPSVRVL LMSLKAGGVA LNLTVASEVY LLDNWWNPAA EMQAIDRTHR LGQYRPIRAV
RFIAEGTVEE RVLQLQEKKR LVFDGTVGRD AGSLKMLTVH DMKALFT