Gene PHATRDRAFT_39968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39968 
Symbol 
ID7195466 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp482047 
End bp483489 
Gene Length1443 bp 
Protein Length457 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183883 
Protein GI219127315 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC CGGAGCATCC TCTACCGCCT TCGACCACGG ATGATTACGT GGCGAACGGA 
GAATTTTCTG CGTCATCAAT GAATGCCGAC GACGCTAGTA TGGTCGCATC CGATGCTGCT
GCTTTACACT TGCCAGAATC GTCCGAGGTA CTGATGAAAA AGACTAGGAT TACAGCTCTC
AACAAAGCAC TTTTTGCAAC GTACTTTTGC AATGCCGTAT CGGTAACACT ACCAGTCATT
CTCATGCCTT TGATTGCTGC CGAGCAGACT TCTCTGGCTG GCTCTTCGCT CGCAACTGCC
GCATTTGTGG TATCAACTGC ATCCGTTTCT ACCTTGGGCG GGGGTTTCGG CAAGTTCATC
AACGGGTTTG TGTGTTAGGC ATTGGGCGGC CGAGTGTCGG CTTCGCTGTA CCTTACAGCC
ATGGCAGGCT TCCATTTGTG GTTGTCTTTT AACAAGACAG GCCCCATATT TGGATGGATT
CTTGCTGGTC TGGACTTTTG CGCTTCAATT CAATGGACAG CATGCTCCCT CATTTTGGCA
AATCACTACG ACACCAGTCC TGCCGAATTT GCAGCTGGGG TTACTGTTCT GAGTTTGGCC
AGCACGTTCG GAGTTCTTTT CTCCAAAATA GGAGGAATAG TATTGCTCCA GTATGTATCA
TCCTGGAGTA TTGTTGCTCG AGTTGGAGCG GTAGTGGCTG TGGTCGGAGC AATCCTCGTC
CGCTCTTTGG TTACCGAAAT GCCACTCCAG GCCGGAGGAA TTACTCCATC GTCGATCAAA
CGGTTTAACA TTAGAGGAGT TGTGCGGTCT CTAGGCAATG TTTTGGGAAG CAGAATATTT
TGGTTGGTGG GATTGGCACA CGCAACCACC TTCTTGGCTC GCACCAGCGA TCGCGTGTTG
GGGTCATTCT TTTTAGAGTC TACTTCTCTT CCTCGCCATC TGTGCGGGGG TCTTACGGCT
AGCGTGACCC TCGGTTTTGT TCATGGTCTG GGTAAGGGGA AAATGTTTTA CAGCCTGAAG
GATACGCAGT CCAAGACAAG ATTGTTACGG AAGAACTACG CCAAGGCGAC GTTATCTTGC
CTGGCTTTGG CTTTGCTGGC GAATCAGAAG GTAGCTACTG TCTTGTTCCC CTCCAAGTAT
GTTATTGCGG GGTTGGTCGC ATTGCTAACA GGAGTCATGG CGTCGTCGCT CTCCTTTCAA
TTTTACCAGA TCCCGCCTAT GACGTCTAAG ATGTTTGGTG AAGACAAAGC GGTATGTCTT
TCGTTTCTGG ACGGCATGGG TTTCTTCTTG TCAGCTCCCA TATGGGCTGT TACAAGCCAA
ATTGTTGGAG GTCTTGGAAT TTATGGGTGG TCAACTGCCT GGGTGATGTT GGCCTTTCTG
TTCGGCTCGG GAGGGGCGCT GATGCTAAGA ACGCTCCCGC AAGTCCTTGA TGAACAACGG
TAA
 
Protein sequence
MTEPEHPLPP STTDDYVANG EFSASSMNAD DASMVASDAA ALHLPESSEV LMKKTRITAL 
NKALFATYFC NAVSVTLPVI LMPLIAAEQT SLAGSSLATA AFVALGGRVS ASLYLTAMAG
FHLWLSFNKT GPIFGWILAG LDFCASIQWT ACSLILANHY DTSPAEFAAG VTVLSLASTF
GVLFSKIGGI VLLQYVSSWS IVARVGAVVA VVGAILVRSL VTEMPLQAGG ITPSSIKRFN
IRGVVRSLGN VLGSRIFWLV GLAHATTFLA RTSDRVLGSF FLESTSLPRH LCGGLTASVT
LGFVHGLGKG KMFYSLKDTQ SKTRLLRKNY AKATLSCLAL ALLANQKVAT VLFPSKYVIA
GLVALLTGVM ASSLSFQFYQ IPPMTSKMFG EDKAVCLSFL DGMGFFLSAP IWAVTSQIVG
GLGIYGWSTA WVMLAFLFGS GGALMLRTLP QVLDEQR