Gene PHATRDRAFT_48220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48220 
Symbol 
ID7203339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp551733 
End bp554858 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182709 
Protein GI219124853 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.313872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACT TTCCCCCTCC ACCACCACCA CCGCCTTCTA GGCCACTGGG CGTCTCTCCA 
TCAGCTCCAG CTGTTCTCGA GACGTCGTCG GCAACGTCCC ATTACAGTAC GAGAAGAGGG
CCGCTGAATG GTGGAGCCTC AGCGTCCGCG CTCCCACAGT CCAACCTTTC CACACACGGC
AGCACAAGAT TTCCACCGCC GCCTTCGCAC AGCGTCACGA CCAATGGATC CGTTCCGTCG
CACGATGCTC GGACTACACC TCCGTCCGCA CACACTCCGG TACCGCCTCC ATCGATCCAT
CGAATCGCTC ATCCGGGCAT GGTTTCTCAG CCGCCGTCCC GGCAGAATCC ACCACTTCCG
TTCGCACCTC CCGCATCTGT ACCTCCGACT GCGAGTCGGA CACTGCAACC AGCCTATCCC
GTTCCACCGC AACCCTACTC GTCAAATCCT CTTTCACCCC CGCTTCCATC GTCACGTCCT
CCCAACACTA CGCACCCGCA CCCATCGCCT CCACCACCGC AGCCCGGGTA CTACCAACCG
TCGCACACGC AACAGCCTCC GCCTGGTTTA CGGAAAATCG ATCCCTCACA AATTCCTAGA
GCTCCACTCT TTACCCGCCC GCAAGAATCC CAGTTGCCGG TATATTTTCC GCGAGCTGCC
GTCCTCAATG GCGAAACGGC CCAAAACCCC CCACCCGCCG ACTCGCGATA CATTGTCAAA
GACGACGGTA ACGCTTCGCC TAATCTCGTA CGCGCCAGCG TCTACGCCTT TCCCCTTACA
CGTGCTCTCT GGCATCAAAC CGGCGATCTA CCACTCGGAA TCCTCGCTAC CCCCTTGGCC
TGTCACGACG AAACCTTCGT GCCACGTCCC CGCGTCCTCC CCGACCGTTC CGTCCAAGAC
TGGCGTGATC CACAACGTAT CCCGTGCGTG GATGCCCGGG AACCGTCCCC GCCACCGCGC
TGTGGACACT GTCACGCCTA CGCCAATCCG TTTTTTGGAA CGGATGGATC TTGTAATCTC
TGTGGTACCA GCAATCGCGG GATTGCCGCC AACCTGACGG GTCCGGCCAT GCAGTGCGGT
ACGGTAGATT ATCACGTTTC CGGACCCTAC GTCACCCGCC CACAGCCCGT GCCGCCCGTG
TTTGTGTACG CCGTGGATTT AACGTGTCCG CATGTCACGC AGTATCTACC CATTCTGGCA
CAATTGGGGG AAGACCTGGC GACTCACGTC GGGAATCAGT ACGCACCGCT GACGCCACGC
ATTGGTCTCT GTTGGGTATC TTCGGCCGGA ATTTTGGTGG CCGGGCACCA CGACCGGGAA
CGCTACTCCG TCATGGCGGA TGTGACCAAC GCACCCTTTT GCCCTCTACC CCTCAACGAT
TGGACGTTTG ATGTGTCGGT ACCGGAAGGA TTGGCATCCT GGAAGGCCTA CTTGGATGGC
CTACTCCAGA ACGATCTCGA GGATCTGCGG AAATTGGCGC GTGCGAAAAA TGCGTACGGC
TTGGACGGTA TGGAACTATC GTGTGGTGGA GCGGCGCTGG CCTTTCTTGC TGACGCCTTG
GCGGCGACGG GGGGTCGCGG TACCTTGATC ACCCGACGAC GACCGAATTT TGGCGTCGGT
AGCCTTTCGG TACGGGAACC TGTTCCGGGT AAAGCGCACG ATCCGGACAA TATCATATCC
TATAGTCCGC TACAAACCGC CTCCAAGTTG AAGCATACAG AAGATGCGGC GGCTTCTTCG
TTTTATCAGG AGCTAGCCGC GAAATGTTGC CAGGACCGAA CCTGTTTGGA TGTTTTATAC
CACACTAGCC CGCTCACACC ACCCGCGTAC TTGGATCTCG CCACACTCGG CGAGTTGTGC
CGGAATACTT GTGGAAAATT GTTCCACGTT TCGAACAAAG ACTGGAATCC GATCATTCTG
GAAGAACTGA GAGCCCAAGT CTTTTCCTTT ACGGGATGGG ATGCGGTGTT TAAGGTTCGA
TGTTCGGACG GAATTCAAAT CAAATCCTTT CCAACCCATG TCGGTAACCT AGTCGATAAC
GGATTGGGTA GCTCGTCGGA AATCGAATTG TCCTGCGTGA CGCCGAACAC GTGCATCGCA
GTGGAGCTTG AGCATCGCGT GGGTGGTGTA CCCCCCAAAA ATCGGTACGT GTACATACAA
ACGGCCTTGC TCTACTCAAC AATATCGGGC TGCCGCCGTG TGCGGGTCTC CACCTTGGCC
ATCCGTAGTT CTACTGTGGT AGATGAAGTA TTCCGATCGG TTGACATGGG AACTGCTTCT
GCTCTACTAG CCAGAGAAGC GCTGGATCGT ATGAAGAAGC TAGTAAGGGA GAAGGAAGGA
GACGCGGCCC GTGAAAAAGC CCGTGACTTG GTGTTCCATC GGTGTCTGGA AATTTTGCTC
AACTACCGCA CAAATTCGTC GGCCGCAAAC TCATCCGCAC GGCAAATGGT CCTCCCAGAA
AAGTTGCAGT TGTTTCCACT CTACTGTATG TGCTTGATGA AGAGTCCGAT CTTTCGCCCG
GGCATGGCCC GTCGCGATGC ACAAACTCAA GCTGTCCGCA TGTCGCCCAC GGGTGATGAC
AGGGCACTAT TCGTACATTA TCTGGCCAAC GTAAGTTCCA GTACCAGCAT GCTTATGGTG
CATCCCAACA TTTTTTCCGT CTTGGGAAAC GAAAGTGGTA CTGCGGAGTT CGAGTCGCAT
CATGGACCGG AGCAAGTTGG GTTTGTAAGA ATGCCACAGC CCATTTTGCC GAGTATGGCT
AGTCTTGAAG ACGATGGTGT GTACCTCCTG GATAGCGGCC TACAGATTTT TTTCTATGTT
GGAAAGACTG CGCCGGATGA AATAAAGGAG ATGGCACGTA GTCACCAAAT CGATCAAGCA
GAACTGCTTC ACAATTTTGT CTGGCAAATG AGGACGTTCA ACGGCACAAA TCAAGGAAGC
GAAGGTTCCG TCCGGCCAAC TCATGTGCCT GTTGTGTCAA TTATACAGCA GGACGGTCAC
GATGCTCCAA TGGAAGCGGA TGTTCTCAAT CTTTTGGTGG ATGATGCGGT TTCTGGGGAG
AAAGACTACA ATGATTTTCT GTGTGGATTG CATCAGCGCA TTCAAGACAG ACTCAAAGCC
AAGTAG
 
Protein sequence
MTNFPPPPPP PPSRPLGVSP SAPAVLETSS ATSHYSTRRG PLNGGASASA LPQSNLSTHG 
STRFPPPPSH SVTTNGSVPS HDARTTPPSA HTPVPPPSIH RIAHPGMVSQ PPSRQNPPLP
FAPPASVPPT ASRTLQPAYP VPPQPYSSNP LSPPLPSSRP PNTTHPHPSP PPPQPGYYQP
SHTQQPPPGL RKIDPSQIPR APLFTRPQES QLPVYFPRAA VLNGETAQNP PPADSRYIVK
DDGNASPNLV RASVYAFPLT RALWHQTGDL PLGILATPLA CHDETFVPRP RVLPDRSVQD
WRDPQRIPCV DAREPSPPPR CGHCHAYANP FFGTDGSCNL CGTSNRGIAA NLTGPAMQCG
TVDYHVSGPY VTRPQPVPPV FVYAVDLTCP HVTQYLPILA QLGEDLATHV GNQYAPLTPR
IGLCWVSSAG ILVAGHHDRE RYSVMADVTN APFCPLPLND WTFDVSVPEG LASWKAYLDG
LLQNDLEDLR KLARAKNAYG LDGMELSCGG AALAFLADAL AATGGRGTLI TRRRPNFGVG
SLSVREPVPG KAHDPDNIIS YSPLQTASKL KHTEDAAASS FYQELAAKCC QDRTCLDVLY
HTSPLTPPAY LDLATLGELC RNTCGKLFHV SNKDWNPIIL EELRAQVFSF TGWDAVFKVR
CSDGIQIKSF PTHVGNLVDN GLGSSSEIEL SCVTPNTCIA VELEHRVGGV PPKNRYVYIQ
TALLYSTISG CRRVRVSTLA IRSSTVVDEV FRSVDMGTAS ALLAREALDR MKKLVREKEG
DAAREKARDL VFHRCLEILL NYRTNSSAAN SSARQMVLPE KLQLFPLYCM CLMKSPIFRP
GMARRDAQTQ AVRMSPTGDD RALFVHYLAN VSSSTSMLMV HPNIFSVLGN ESGTAEFESH
HGPEQVGFVR MPQPILPSMA SLEDDGVYLL DSGLQIFFYV GKTAPDEIKE MARSHQIDQA
ELLHNFVWQM RTFNGTNQGS EGSVRPTHVP VVSIIQQDGH DAPMEADVLN LLVDDAVSGE
KDYNDFLCGL HQRIQDRLKA K