Gene PHATRDRAFT_45551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45551 
Symbol 
ID7200620 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp521402 
End bp523159 
Gene Length1758 bp 
Protein Length535 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179665 
Protein GI219117752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CACAAATCCA TGCAAGAAGT TCTGCGAACA CTGGTACGCT TATTGCCATG CGCAAAGGAT 
CACCCTTCAT TTCCGTAAAT GTTCGCCGCC ATGAACGGGA GGAGATTGAG CAACGGGACC
GATTCCACTC GAAGGCTGCT TCATCGTGAA GATTGTGGCG GTTCTAGTGG AGTCCGATCA
TCGCGACCAG TACCGAATTC ATCGAGACCG CAGACACAAA GAGTCCGCTT TCCGGATGAA
GATCGGGAAA GTGTCGATGG AACTCTGGAT AGCTTATTTG GTTTGATCAC CGCCGCCTTG
GCAACTGCTT GTTCAGCAGG TCTCTTTACT ATGTTACCAT TTTCCTTGGC TGCGTATCGG
AGATTGGCTT CACAATTGGG AGCGTCATCG ATCCTCGATG CATTGGCGCT GTTGCTCCCG
AATACGAGAA TCTGTTTATC TGGGGATTCT GATATACCGA GTCCAGTAGG AACTTCGATT
CTGGTGTCTA ACCATTTGAT GGATGGGGAC TGGTGGGCGC TATTAATGCT GGGGCGGTGC
GTTGGGCTTC GAGGGAGTAT TAAGTTCTTC CTTCGAAACG AGTATTTTAA TCTCAAACTT
CACAACTCAG ATTCCGCGAC AAGTCGATCT AACTCAACCA CAATTGCTAC GAGTAAAGCG
GTGGGAACTG CTGTTCACAT TCGGAACGAG AACGCCCAGG CCAGTTCATC GTTGCCGCGA
GTCTCGCACC TGCGCGAGGG CTCTACCTCT CACGGCATTG CCATCATGGC AAATCTTCTC
CACCAGTTTC TTGAGTTCCC GCTGCTGAGT GGGGACGACC ACACTGCTGA CAGAGAACAG
CTAGTTCGAC TGCTGAGGTC GTTTGCACAC GACAATGCAT CGGCCCCTGT TCATTTGCTG
TTCTTTCCAG AAGGATGGTC GCTCCACAAT GGTGCTGACC GAACAGCAAT ATTGGCCAAG
AGTAACGAAT TTGCTCAACG AGAAGGCCGC CCGCAATTAA AGCACCTGTT GCTGCCCCGT
GCTCGTGGTT TCAACGCAAG TCTTGAATGT TTACGAGAAT CCAGTCCAGT AGTCTACGAT
GTCACGATGG TACGTCATGC TGTTGTGGAG GACGTCCTGT CGTTTCTTTA CATTTTTCTA
ACACAGAAGC CTCTCCTCCA GGCCTACAGT GGGTACAATG GATCGCTTCC ACCTTCTATT
GAGCTTACCT TTCCCGCCTT GTGGAAACTG CTTCGTGGGT TCCCTCGTGA AATACACATC
CGAATCAAGC GATACAGCAT GGAAGAGGTT ACTCAGGACT CATCTTGGCT AGATCAAAAG
TGGGCAGAAA AGGATCGTCT TTTGAGTCAC TTTGCTCGGC ATCAAACCTT TCCTGCTGAT
AACCGAGGCT ACTGCCGACA TCGAGTCTTT GATACGAGAA CGCATGCGTT TGAATCTTCC
ATCATCGCAC TTGGACGCTT GCTGCTATTG CCATTGGCTG TTCCGTTGTT CGTTTTGGTA
TCTATCCCAA TATTTTGGGC CTTGATGTGG TTGTGGCTGG CACACTGGGC TTATCGGCAA
CTATTTGGTC GGGTAGAACA GTCGTCGTCC AACGGAGGCT CGTCTGGGAG TGTCGGAAGT
GCTGGTGCAG GTACCACACC TGGTACTTCG TCCGCTTCCG GAACACCCTT CTTCCCAGCT
ACTCCGTTTG CGTCTCCAAC GGTGACTTCC TGGCGTGACA TGTTCTCAAA GAGTGCCTCA
TCATCGTCGC CATCTTAA
 
Protein sequence
MFAAMNGRRL SNGTDSTRRL LHREDCGGSS GVRSSRPVPN SSRPQTQRVR FPDEDRESVD 
GTLDSLFGLI TAALATACSA GLFTMLPFSL AAYRRLASQL GASSILDALA LLLPNTRICL
SGDSDIPSPV GTSILVSNHL MDGDWWALLM LGRCVGLRGS IKFFLRNEYF NLKLHNSDSA
TSRSNSTTIA TSKAVGTAVH IRNENAQASS SLPRVSHLRE GSTSHGIAIM ANLLHQFLEF
PLLSGDDHTA DREQLVRLLR SFAHDNASAP VHLLFFPEGW SLHNGADRTA ILAKSNEFAQ
REGRPQLKHL LLPRARGFNA SLECLRESSP VVYDVTMAYS GYNGSLPPSI ELTFPALWKL
LRGFPREIHI RIKRYSMEEV TQDSSWLDQK WAEKDRLLSH FARHQTFPAD NRGYCRHRVF
DTRTHAFESS IIALGRLLLL PLAVPLFVLV SIPIFWALMW LWLAHWAYRQ LFGRVEQSSS
NGGSSGSVGS AGAGTTPGTS SASGTPFFPA TPFASPTVTS WRDMFSKSAS SSSPS