Gene PHATRDRAFT_40679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40679 
Symbol 
ID7198498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp130387 
End bp131517 
Gene Length1131 bp 
Protein Length376 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184652 
Protein GI219128927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTTG GCACATACCG ATGGTTCCCG TCCGCGGGTC CTAACAATCG ACTACGTATG 
GACAAACTAC GCGCGCCGGT GGGTCGTTCA TCAAGCACAT CCCGGACGGC CACGAAGCGC
GGTTCCGTAA TTCCACCAAC TTCCGCTGAC ACCACCAGCA TCTTGTCCCC ATCGAGCACC
TCGTTTGCGG TATCGCTCGG CGGTCGGGCC AAGGAAGAAC TCCGCCGAAA TTCGATAGCC
TACACTTCGC GTCTCATGTC GGGGTCCGAT CTTACTTTGG CAGGAGCGGA AGCCTCGGCT
CTTCTGGTAA GCTGTGGATA CTGGGACCAT GGCGTTCGCG TTCACGGTTT GGACAACAAT
CTTAGAGTGT TGGCCACCGA GGCCGGTGGT CACCGTGGTC CCATACTATG CTTGGCCGTT
GCTCAGGATG ATGCACTCAT GGTGACGGGA GGTGAGGATT GTACGTGTCG CGTATGGGTG
GTTGACCATT CCGACCTGGC CGTGGCACTG TCTGACGGAT ACGTACAAAC CGCACTGGGA
TCTGCCAATA CTGGCGAAAG TGTTTTAAGT TGCTGTCACG TCCTGTGGGG CCACGAAACG
CCCATCACTT GTGTGGCTTT AGATTCTTCC CTAGACGTGG TGATTTCGGG TAGTAGAGAA
GGCAAGATTT GCGTGCATAC GTTGCGTCGG GGTGAATTCG TCCGTTTCTT CACGCCCCCC
GTATCCGGCG GCACCCCGCC GGCCATCGCA CGCGTGGCGC TGCACCCCAC AGGAACCGTC
GTGGTACACG CGCGGGACCA GAGTTTGCAC GCCTTTAGCG TCAACGGCGT GCGATTGGCG
AGCGTCAACG CTGGCGAAGA ACTGTACGAC CTGCAATTCT GTAACGAATT TGTCGTGACG
GGCGGGACGC GGGGTCAAGT GTGTGTGCGA TCTTTGAGCG ACCTCCAAAT CCAATCCGTG
GTGGACTTGT CTCGGCACGG ACCCGTACAC TGTTTGGCTT TGACGAATCC CGAACTCAAC
CCGATTCCTC AATTCCTTTT CGTTGGCAGT GCCGACGGAA TGTTAACCAT CGTGGACGTG
GATCCAACTC AAGAGCAACA ACACGTATCC GACGCAGTGG TGACTTTATA A
 
Protein sequence
MSVGTYRWFP SAGPNNRLRM DKLRAPVGRS SSTSRTATKR GSVIPPTSAD TTSILSPSST 
SFAVSLGGRA KEELRRNSIA YTSRLMSGSD LTLAGAEASA LLVSCGYWDH GVRVHGLDNN
LRVLATEAGG HRGPILCLAV AQDDALMVTG GEDCTCRVWV VDHSDLAVAL SDGYVQTALG
SANTGESVLS CCHVLWGHET PITCVALDSS LDVVISGSRE GKICVHTLRR GEFVRFFTPP
VSGGTPPAIA RVALHPTGTV VVHARDQSLH AFSVNGVRLA SVNAGEELYD LQFCNEFVVT
GGTRGQVCVR SLSDLQIQSV VDLSRHGPVH CLALTNPELN PIPQFLFVGS ADGMLTIVDV
DPTQEQQHVS DAVVTL