Gene PHATRDRAFT_47534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47534 
Symbol 
ID7202774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp40691 
End bp41860 
Gene Length1170 bp 
Protein Length387 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181831 
Protein GI219123021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAACCATGA CATCCGCCAC AGCAATCTAC GCCGCTGTCC CTCTGAATGA CAAGATTACA 
TCGGAATCAA TTGTCGAGAC TCTTGAGGAT AGCGATATCG TCTCCTTCTC GCATTCCACG
GATGACGATA CTCTCGAAAA ACCACTTTTG GGAAGCGCTC CAGCCAAATC GGTCATGAAT
GTGACCCGTC TCAAGGCTCT CTTTTTCGTA ACCGGCATTT TAATAGCCCT TGCGTCCCAA
TACCTCTTGG CGAAAACCAT GTGGAAGGAC GACGTCGTCG GTCGAGCCTC ATGGGAAGTC
GTTGTCTTTA GTCTACAGTG GAGCTTTTGG ACCTGCGTCA TGGTGTTCTC CGTCATGATA
TGTATGGTCC GCGCCTTTTC TACCCAGCAG CAAACTCCGG TGGAGGAAGG TCTTGCATTT
ACCTTGGAAG CACACCACAT TGTCGGGGCA TTGCTTGCCG TATCGGCCAG TTGGTTCACG
GTAGATTTTC TCCATCTGCA AGTTCCGACG CATGCCCATA CCCTGTCGAT TCTCGGGCTT
GTTTCGATCG CGTACGCAAT TTTTGTTCGC TGCATGACCG CACGCTTTCG AGAGCGTCGT
CATAGTTTCA GCGGAACGGC AGATCGGCAA ATCTACTCCT CCACCCAAGC TCTCATGCCG
ACCTACCAAC TCTTGGCGGC GACCCTCGGA CTCGTGGTTG GTCTCTGTTC TCAGTTCCTT
TTGAGTTTCT TGCTGTGGAC AGACAGCATG ACAACTCCAG TCATCGACAA CATGGTTGTC
TTTGCCGCAA TTTGGAGCAT CTCCACCGTC ATCATTACCT TTGTTGGTTG TGCATCCTTG
CGCTGCTTGG TTAATCAGGA AGAGCACAAT ATGCTCGAAA CGGAGCGTGT CTTTTTGCGC
ATGGAAGCAC ACTACGTCTT TTGTGCTTTG ATTGGAATCT GTGCCGCTTG GATTCTCATG
AACGTTGCGC TTGGTTTGGA ACAGCAAGTC TTACCCAGCT TGGGCATGCT CGCTCTCAGC
TTGATCGGCT TTCGAGCCAT CCTCCACTGC TTCCCCGAAG AAGATTGCCT AGCTGAGATT
GGACTCGCCC ATGCTAGAGA AAAAGAGGTT CTCGTCAGCA AGAGTACCAA AGAACAGGAT
GCGCTGCATC TGGTTGTCCA AATCGTCTAA
 
Protein sequence
MTSATAIYAA VPLNDKITSE SIVETLEDSD IVSFSHSTDD DTLEKPLLGS APAKSVMNVT 
RLKALFFVTG ILIALASQYL LAKTMWKDDV VGRASWEVVV FSLQWSFWTC VMVFSVMICM
VRAFSTQQQT PVEEGLAFTL EAHHIVGALL AVSASWFTVD FLHLQVPTHA HTLSILGLVS
IAYAIFVRCM TARFRERRHS FSGTADRQIY SSTQALMPTY QLLAATLGLV VGLCSQFLLS
FLLWTDSMTT PVIDNMVVFA AIWSISTVII TFVGCASLRC LVNQEEHNML ETERVFLRME
AHYVFCALIG ICAAWILMNV ALGLEQQVLP SLGMLALSLI GFRAILHCFP EEDCLAEIGL
AHAREKEVLV SKSTKEQDAL HLVVQIV