Gene PHATRDRAFT_49612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49612 
Symbol 
ID7198256 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp202233 
End bp203923 
Gene Length1691 bp 
Protein Length439 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184419 
Protein GI219128436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00322088 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGATATTTG TATATAGGCG ACTGTACTGA CAGTGGTCCT CATTATCATC TTGACATAAC 
AGTGAAACCT CGTGGAGCTC TCGCCGTGTT TAAATTTGAA TGATACACAC GTCAAACGAA
TTCCAAACGC CAAAGTTGAT GGCTCCACGA GAAAACAGTT GTCGAATGGA GGCGTTTTCA
CCCAATGCTG TCGGATTACA ATGCAGCTGA CTTCTCTCTC TCTACAGAGT AGCGTATACC
TTACCACATT CCCTCAGACG CTGTGTCCAA ATAAAGAATG TCCGCTCCAA GGAAAGAGAT
CCATAGTAAA CCAAGAAAGA ATGAAGCGAA CCCCTGGCAT CACAACACTG ATGCGCTTGC
GACCTTGGAC CACTTTGGAA AGGTTCGTTC TCACGAATTT TCGCTGTGAA AGAAGCCTTT
ACAGAACCAA AACCAATTTA TTCCCTCAAA TACTTTGACA TGCGTTTGTA ATACAGAGAG
CTCTCGAGCG TGAGGCTGCC GTCCAGCAAA TGTTCGTCGA GTACGTAGAA AAATGGTCAG
CCAGCGTCAA CGCCGGAGAG ATCCCGAACC ATAGCCGTAA AAGCGATACG GTGCGATTTC
TACCACCAGA CTTTCATTTG GAGGATAGCA CTTGGGTGTC TTTCTCAAAT TTTGTGAGAG
CTCATGGCGA CGGCTATTGG TCAGCGAAAC GAACTCGGTT GACTTTGATA GAACTGAACA
ACCATCCAAA ACTCAGAAAT CACCGAGGGC CTTTCTACTT TGTCGATCTT TCACATAGCG
AGCCACAAAC CCAATCATCT ATTCATCCAG CAGGGGAAAA TCAAGAGGAG GACAGTCAGA
CTGTTTCAAA ACGAAAAGAG GGTTCTCGCG AATTCTACGA TTCAAAATTT GTGTCGAATA
ATCAGAGAAG TCAGCAACCT AAGTGGCCCA AACAGGCACA AGAGGTCGAG ACTGCATACA
GCCAATTTGT TGAAAGCAAG ATGTCGACGG GCCTGAAACA GCTTACTCAT TCTCAATCAA
GAAATGTGGG CGATCCCACT TGGATCCTTC CAGCGCCCGT TCCCTTCGAC ACGGTGCAAC
TACCTCAGCA CGTCCTCTAT CGGCAGCCGA CTCAGAGTAA AAAAATGTCT GTGGGGGATA
TCGATTCAAG TTCACCTCCA ACATTGCGCC TTTTGGGAAA GCATGCTGTT TCCCCTGGCA
ACCATGATTT TTCTCGTCGC TCAAAAGTCT TACCTAGGTC AATGGCTGAC AGCAAAAATG
GATGTCTCGA TTTTACCGAG TTTCACAGCA GGAACGGCTG CTTTTTGCCG ACGGTACCGC
TACTCCACCA TGATCAGCAT CTGTATAGCG CTACACGAGA TGACAACGAA CGATTGCCAG
TCCTACCATC CAATACTGAG ATGCTTGTAT ACTCCCAGCT CTCGGTCAAT ATGTGCGATA
TCGAGACGCC TTTTAACGAG ACGACGGAGG CCGAGACTAC AGAGTACACA CGCGCAGACC
AGATCATGCT TATGGCTCCA CTCATCCCGA CGCCACCCAA ACCTACTCCT CCCCGCATGC
AAAGGAACCT TGCCAACCAC TGTCTTCCTA GACTATCACC GTTCGTAAGC CCATTTCAAT
CGTTGGAGAA TTCGGTCACG TCCACGAAAA TAAACGACAC CTGCAAAGGC TTACCACTCC
AACATGATTA G
 
Protein sequence
MSAPRKEIHS KPRKNEANPW HHNTDALATL DHFGKRALER EAAVQQMFVE YVEKWSASVN 
AGEIPNHSRK SDTVRFLPPD FHLEDSTWVS FSNFVRAHGD GYWSAKRTRL TLIELNNHPK
LRNHRGPFYF VDLSHSEPQT QSSIHPAGEN QEEDSQTVSK RKEGSREFYD SKFVSNNQRS
QQPKWPKQAQ EVETAYSQFV ESKMSTGLKQ LTHSQSRNVG DPTWILPAPV PFDTVQLPQH
VLYRQPTQSK KMSVGDIDSS SPPTLRLLGK HAVSPGNHDF SRRSKVLPRS MADSKNGCLD
FTEFHSRNGC FLPTVPLLHH DQHLYSATRD DNERLPVLPS NTEMLVYSQL SVNMCDIETP
FNETTEAETT EYTRADQIML MAPLIPTPPK PTPPRMQRNL ANHCLPRLSP FVSPFQSLEN
SVTSTKINDT CKGLPLQHD