Gene PHATRDRAFT_33827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_33827 
Symbol 
ID7198060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp445000 
End bp446238 
Gene Length1239 bp 
Protein Length412 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178512 
Protein GI219115433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.373932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC GAACGGAAGA GTCTCCGCGA GTCCACGCAC CGTACCCTTC TTGGCATAAC 
CCAGTCGCCG GATCCGTGGC TGGTGCAGGT TCCCGGATGG CGACGGCACC ACTCGATCTC
ATTCGAATCC GTCGGCAGCT CAATGTTGTA TCGTACCCAC GCGAAAGCCT CTGGGGATCC
TGGAAGTCGA TCGTCAAGAA TGAAGGGGGA GTATCCGCGC TTTTTCGAGG AAACGTCGCG
GCCATATTTC TGTGGATCAG TTACTCGGCA GTGCAGTTTT CCTTGTACAC TCAAACACGA
GACTGGCTGA TTCAACACGC GCCAGCAGCA GATCCGGAGG ATTCCGATTC CGAGCCTGCA
AAGTACTACA GGTCTGGTTC CGCGTTTGTT GCTGGTGCAA CAGCTGGAGT GTGCGCTACT
ATAGCAACCT ACCCCTTCGA CGTATGCCGG ACAACATTTG CAGCCCGTGG AATCCAGACC
ACGGGTAGTG CATTGACGCC GTCCAAACCG CCACCCATTA CCCCCAAGGC CACCATTCAC
CACATGCCGT TTTCGTCGCT CGTGGAACCC ATGATTCATG ATCGCGGACG GTTTGCAAGC
TCACCACCAA AACCCATCTC GACGCCTCCT CTGCAGCCAC ACCCTTCCTT TACTGTAACA
CCACCGACGC GATTGTACGA TTTTGTGTGG TACCTTTATC GCCAAAAAGG CATTGCTGGG
TTTTATGCTG GTGCTGGACC AGCCGTGCTT CAGATCATTC CCTACATGGG TATCAGCTTT
TGGCTGTACG ATCAATTGAC CGCCGGGGAT CGACGAGTTG CGCTCTCAGC GTACGCGGGA
TCCATTTCGG GAGCTGTCAG TAAAATACTT GTGTACCCCA TGGACACGGT CAAACGACGG
CTCCAAGCCC AGGCCTTTTA CGATAATTCG AGTGCGACTG AAAGCAGGAC AGGAGGGAGC
GAGCGCCGGA GGTTGTACTC GGGTCTTCGC GATTGCTTTA CCCGAGTTAT CAAGGAAGAA
GGCTGGGCTA GTCTGTATCG AGGGGTTGTG CCGTCTGTTC TCAAGACTAC CATTTCTACA
GGACTATCGT TTGCGCTTTT CCGATCCACC AAAAATATCT TGGAAGGATT GCATGAGGAC
TGCCCCTCAA TTCAAACTTC AACATGGCGG GAAACGCCAC CCGATGCCAA AAGTTTGGCA
TCTACAGATG ATCGAATATC CGACGAGCGC AAGCGGTAG
 
Protein sequence
MEERTEESPR VHAPYPSWHN PVAGSVAGAG SRMATAPLDL IRIRRQLNVV SYPRESLWGS 
WKSIVKNEGG VSALFRGNVA AIFLWISYSA VQFSLYTQTR DWLIQHAPAA DPEDSDSEPA
KYYRSGSAFV AGATAGVCAT IATYPFDVCR TTFAARGIQT TGSALTPSKP PPITPKATIH
HMPFSSLVEP MIHDRGRFAS SPPKPISTPP LQPHPSFTVT PPTRLYDFVW YLYRQKGIAG
FYAGAGPAVL QIIPYMGISF WLYDQLTAGD RRVALSAYAG SISGAVSKIL VYPMDTVKRR
LQAQAFYDNS SATESRTGGS ERRRLYSGLR DCFTRVIKEE GWASLYRGVV PSVLKTTIST
GLSFALFRST KNILEGLHED CPSIQTSTWR ETPPDAKSLA STDDRISDER KR