Gene PHATR_43850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43850 
Symbol 
ID7204278 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp220871 
End bp223121 
Gene Length2251 bp 
Protein Length411 aa 
Translation table 
GC content50% 
IMG OID 
ProductCCT motif containing protein 
Protein accessionXP_002186019 
Protein GI219112871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0351755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAATCTCGA CAGACAAACT CAAAGTATCG CACTGCGACC TCAAGGATTT GATCCCTCCA 
CAGTCATCTA CGACGCTTTC TGAATCACAG GTAAGAACGC TTCGAATGCG GTGCTTTTGG
GCTGATTGAT TGTCATGGTC ATTCTGTATG CTGTCGTTAC GCGCGGAGTT TCGCTCCGCA
CGAACTTCGT GTGTCTGGTT TGGAAATTCG GCGCGCTCCA CACGAACAAA GCAATTCGAG
TCGGAACTTG ACGTTTTGTG TCGTGTGCGA TTTGAATTGG ACAGCGATTC GCGAGCGGTT
CTGAAAAGCC AATGAATGAT GTTTCCTTCG TCATTGTGCG TCAGTGTCTG TCGGCCCTCC
TTGAACCACG AATCTTGTGA CGGAATCTCG CCGTTGACTC TGTGTCTTAA TTGGAGTGTG
TCATCGACCC ACTCCGGGAG AGGTCCTATA TGTATGCGGA GTGGTGAAGC AATGCGATGT
CTGCGACAGC AAAAAGCGAC ATTTTGCGTT ATGGAAGATA GACCCGACTG ATCCCCAATC
GTCATCCGTC AGCCAACACT TTCTCAGCCG CTCATTGGTT TGTTGATGGG GGGGGTCGGT
CCTCTGTGCT TCTCTCGGGT CTTCGGGAAG TGTCAAATCA AACCACGCGA AATAGCGAGG
CCTGTTTATT CTTATCGCGT CTTCAACATC AAGTTGTATA CATTTCAGTT ACTTACATTA
CTCTGTCCAT TCACTGTCAA TGCCTTTTTC TGCAATATAC CTTGAGCAGG CTTTCTTTGA
ACAATCCTCG GCATGATCAT GGCTAGCTCT ACGCTGACTG CCAGTCTATC CCCTGCACCA
ACGATGGACC ACCAGGACAT GGATGCAGTG AGCGCATTGC TCGGAGTGTC ACCGTCGCGC
ACGAACGGCT TGCTACGCGG AAGCACATGG ACATCGTCTA GTACGGTGCC AGAACCGACA
TCACATGGGT TCTCATTCGC ACCCGCACGT ATCGTAAGCT TGGACGAGCC ATCCTCTTCG
CCTTACATGA CTGCTTCTGG TAGTCATGAT TCGGGTCGGT CTCCTGAATC TGTGACTCTG
CAAACAAGAC CTCGCTCCAA TTCGGCTGGA CTAGATGCTT TGGCCTTGTT GGCATCCAAG
GAACAAGCCA AATACGAAAA CACAAAGCTC CAGAAAGAAG AGCAACCTTC CTCGTTTGAA
AGTTTCATTC TTTCCGCTTC GCCCTCGTCT TCGAGCGACG ACGATGACTC GGAATCTATG
CCGCCACCGG CGCCGCGCGG ACGTAGACGA AGTGCATCGA ATCCGGAGGG AATGGAGAAG
TGGGACTCAT TGAGCGTGGG ACGCAATCAA CACAGGAACA ATTCTTGTCG TCGTCATTTT
ATGCTACCAG ATTACGTCCT AGCTGAGGAA CTGGCAGAAG CAAGTGCCGC TATAGAAGCG
CACGGTCGCA AACCACCGAG GACCATCCCT GAACACGCTG AATATGAAGA AGATCCAGCG
GACAATTTTA GTATCAGCCA AGACGAGGAA ATAGAAGAGA ATTTAACGCC AGCAGAATTG
CTTCGCCGAG CACGATCTCG GCTTTTGGAG GATTTGAGCG AGGGAAACAT TAGCGGAGAC
AAAGGGGTTG TCACGCTACC GCATTCGCTT CCGAAATACA AGGAGGTAAG TGCCACACTG
TTGTGTTCTC ATAAACGCGT TTTCCTCGAT ATCCCGCAAC GACTAAAGTA TTCCTTCTCA
TACATTTCGT CAGTTTTACA ACAATGGTCG CATCGGAATC TACACACCTA ACGAACGAGC
AGCGGTAATT GACCGATACA AGGACAAACG CTGCCGCCGC GTTTGGAATA AGAAGATTCG
CTACGGTTGC CGTAAAAATT TAGCAGACCG CCGGTTGCGC GTGAAGGGGC GATTCGTGAA
ACGTTGCGAA CAAGAGCAGC TTGCTAAGCT GCTAAAGCTA CAGGCGGAGG AACAGGAAAG
CAACGTTGCG CTCAGTGAGG ACGATGTAGC TACCAATGGG GACGAAGACA TGCCAGATGT
TAACGACCCC GAAGCTGGTT TTGATCCCAC GGACGATCAA CCTTATCGTC GTGTCCGTCG
CCATACGATT ACCTAATTTT TGAGATGAAG AAAAGAGAAG CTATCATGCA TTTTACATAG
CGCGCTAACC TTGAAAGCAG CGTCTTGGTA CTATTGACTC CTCTGCTAAC CTTAATAGTT
ACTTACAATT AACTGTAAGA CTTTTTAAGT C
 
Protein sequence
MIMASSTLTA SLSPAPTMDH QDMDAVSALL GVSPSRTNGL LRGSTWTSSS TVPEPTSHGF 
SFAPARIVSL DEPSSSPYMT ASGSHDSGRS PESVTLQTRP RSNSAGLDAL ALLASKEQAK
YENTKLQKEE QPSSFESFIL SASPSSSSDD DDSESMPPPA PRGRRRSASN PEGMEKWDSL
SVGRNQHRNN SCRRHFMLPD YVLAEELAEA SAAIEAHGRK PPRTIPEHAE YEEDPADNFS
ISQDEEIEEN LTPAELLRRA RSRLLEDLSE GNISGDKGVV TLPHSLPKYK EFYNNGRIGI
YTPNERAAVI DRYKDKRCRR VWNKKIRYGC RKNLADRRLR VKGRFVKRCE QEQLAKLLKL
QAEEQESNVA LSEDDVATNG DEDMPDVNDP EAGFDPTDDQ PYRRVRRHTI T