Gene PHATR_46821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_46821 
Symbol 
ID7204678 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp552284 
End bp554083 
Gene Length1800 bp 
Protein Length543 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185726 
Protein GI219120989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.24642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATGC GACCGTCACC CAAGGCAAAT CGGTGCGTCT CCAATTCCCG TTCGACCTCT 
CCCCAAACCG CACACGCCCA TCCGCTCGCA TTGCTGACTA ACATGGTGCA AGGGATTGAG
ACCATTTCGT TCCCTACCAA CGCCGCCGGC ATGGCAATCA ATCGCCGGTC TCCTCGTGCG
GATCGTAGCT CGACTCGTAC GTCCCTACAA CGAGACGCGG CGTCGTCGTC GTCGTCGTCA
GCAGCGGGGA ACCGTCGATT CCTGCCCGTC ACGTCCGCTT ATCTCAGATA TCAAACGATG
CACACCTTTG TCGTGAGTCA ACCGATGAGC GCTGCCGTAC TTTTACTCCT CGTTGTGGCG
GCACTCGACG GTTTTTACGA AAGTGTACGC GTACGGCACC CGTACAATTT CGCACACGGG
TTGTCCCCCG CGTCGGGACT TTCCAAACTC CCCTCGTCGT CTCTCCGCCG GACGGCAACG
ACCGGTGCGA ACGGATCCGA ACTACCGACA CCACTGCTAG CCAAGCATTC GACTACGGCT
ACGACTGCAG CCAATGACAC GGCTTCGATT CCTACCGCCG ATAATGCACC CCCAACAACC
CGAGTTCAAT CGGCTTTCGC GGACAACAGC GTCAGCGACA ATCCGTCGTG CCTCGGCAAG
GAGCATTTGG TACAGATTCT GATCGCCGCG GGGAAAACTC CGGAGGAAGC CGAGGCACAG
TGTCCCACCC TACCTCTCTG GCAAGAAGTG GTGGATCTGT ACGGCGACGC CCCCATCATA
CTGGGGCAGG AGCGGTGTCA GGCCTACCGG GAAGCCGTGC GCAATCAGTT CGAAGACGCA
CCTCCTCTGC ACAACATTCG CGTGGACGGA CTCTTCAACG TGGGTACCAA CGCTCTGGCA
CAGAACAATT TGTTGAATCT TGAACACGGA CGGTACTTTC AACCCAATTT GTCCTTGGAC
GATCCGGACT ATGCGGAAAA GCTGGGCGTG CTCCTCTTCG TGGGTTGGGG CAAACATTCC
ATGATCAAGT ACAAGCCAAC TAATGCGAGG TTACAGTTAC CCGTAGTACT CGTCCGAGAT
CCCTACCGGT GGATGAAGAG TATGGTGCGT AAGCATTTGT TTGATGAAAG AGAATTCTAT
CGTTTGTGTT TGTGTATATA TATATACATG TGTGTGTGAT TGTGACCGCG GAAGTGTTCT
ACTTTTTGTA CATGGTTCAA CAGAATAGGC CTACCGACTG ACACGAATGG CTCTGCCAAC
CCTTCCATAC AGTGCAAAAC TCCGTACCGG GCTGTTTTTG ACCAACAACC CAATCATTGC
CCCAACCTTG TCCCTACCGA GGACGAACAA CTCGCCTCCG GCAATTTAAC TACCTACAAA
GTTTCGGTCA CTCAGAATGC CCACAGTACG GTGACGGACG AATTCGATTC CCTCGCCGAC
TACTGGTCGG AATGGAACCG CATGTACCGG GACGTGGAGT TCCCGCGGCT GATTGTACGC
TTTGAGGACA CCATTTTTCA CGCGGAAGCC GTCATGGACG CGATTGCCCG TTGCGCGGGC
GTGGAACGGG CAAAACCTTA CCGTTACTAC GTAGAACAAG CCAAATCCCA CGGTCTCAGC
TCCAATTTTG TGACAGCCTT GGCCAAGTAC GGGACAAGCC AAGGACGGTT CGACGGCATG
ACTCCAGCCG ACCTCGCTTA CGCCCGGACC CATCTAGATC CGGCACTCAT GAATACGTTT
GGTTACCAGT ACCAAGGCAG GTATACTGCG CCGCAAGTTT CATTGGCCAA AGAGTCTTAG
 
Protein sequence
MHMRPSPKAN RCVSNSRSTS PQTAHAHPLA LLTNMVQGIE TISFPTNAAG MAINRRSPRA 
DRSSTRTSLQ RDAASSSSSS AAGNRRFLPV TSAYLRYQTM HTFVVSQPMS AAVLLLLVVA
ALDGFYESVR VRHPYNFAHG LSPASGLSKL PSSSLRRTAT TGANGSELPT PLLAKHSTTA
TTAANDTASI PTADNAPPTT RVQSAFADNS VSDNPSCLGK EHLVQILIAA GKTPEEAEAQ
CPTLPLWQEV VDLYGDAPII LGQERCQAYR EAVRNQFEDA PPLHNIRVDG LFNVGTNALA
QNNLLNLEHG RYFQPNLSLD DPDYAEKLGV LLFVGWGKHS MIKYKPTNAR LQLPVVLVRD
PYRWMKSMCK TPYRAVFDQQ PNHCPNLVPT EDEQLASGNL TTYKVSVTQN AHSTVTDEFD
SLADYWSEWN RMYRDVEFPR LIVRFEDTIF HAEAVMDAIA RCAGVERAKP YRYYVEQAKS
HGLSSNFVTA LAKYGTSQGR FDGMTPADLA YARTHLDPAL MNTFGYQYQG RYTAPQVSLA
KES