Gene PHATRDRAFT_47036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47036 
Symbol 
ID7202137 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp244755 
End bp246292 
Gene Length1538 bp 
Protein Length430 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181336 
Protein GI219121985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC CTGGACCTCA AGCATTTCAA TCATTCGAAG AAACCGAGAG CTCCGGCGAG 
GCTGTCCGTC CTCTGGATTT CCCACCACCT GCAGTTCGTG CCGGCGTCCG CGTAAACGCT
TTGGTGGTGC ACCCCGAGTC GGGAACGCGA CAGGTATGTT CCGGCGTAAT TCATCGTGAA
GACTTGTCGG AAGCGTCGGA ACCTATATTG CAAGGCAGCG GGCCTGCGGG AGTATCGCCT
ATGACGACGC ACGAACGAGA CCCCTCGGTA CAGGAAGAAG TGTTAGCCTA TTGGCCGCAA
CGCCGCTTAC AGGATGCCAT TTACGGATCC GTATGGGCCT GCCTGGTTCT GCGACGGCAC
CACGGGATTG CAGCCGATGA CGCCGCACGG GCAGCTGGTG TGGAACCAGG GTCTGCCAGT
GCTCCAATTG TATGGGAAAT ATCAGGCCAG CATGTTGCGA TCAAAATGGT AGAATGGGCA
CGCGTTCATC ACATGCGAGG ACGGCTTCTG GAAGATCCGG TGAAAGAAGT TGCTGCTATG
CAGCTACTGG GTGCGCGTCA TCCAAATGTG CTGGGAAGTA CTGAAGTATT ACAAGATGGT
GACTTCTTAT ACTCGATAAT GCCCTACTGT CGAGACGGTG ATTTGTTCGG CGTTGTTGTC
CAGTACGCTG AGGACAGCGG TGGCGAATCT GGCATGCCCG AGCCGGTCGC ACGCTTTTGG
TTTCGTCAAA TTTTATGGGT ACGTAAAATG TGATGCACTG AAAAATCAGC GGGCTTACAT
TGAGTCCGAA GGTCTAACAG AATTTGCTCT TGGCATTTTG GTCGTAGGGT CTTCATCATC
TTCAAACGCA AGGCGTATGT CACCGAGACC TTTCTCTCGA AAATATTTTA GTTGATGGTG
ATCGCTGCAT GATTATCGAC ATGGGCATGT GTCTACGAGT TCCCTACAAT GATCCTCACA
AGCCCGGAGC AGTTACCGAT GTCACGCGTG GAAGTACTCG GCGATTGATG CGACCACAGG
GAGTTTGTGG AAAGCACAAC TATATGTCCC CAGAAGTATT TGCCAACACC GACAGCTTTG
ATGGTTTTGC TATTGATCTG TGGGCAGCAG GTGTCATTCT TTATATCATG CTGACAGGAT
TCCCGCCTTA CGACCAAGCT AGTCGAACCG ACCAGCGATT CGAGCTGATT GCCACTGGTC
GCCTAATGGA GCAACTTCGA AACTGGAACA TCCAACTTTC TGAGGAAGCA GGAAATCTGT
TACAGCGAAT GCTGACATTA GATCCTCGTG AACGGCCGAC GCTTGCGGAA ATTCTTGCCG
ATCCATGGGT AACGAGCGAC GACGTACATG TTCCTCCTCC GCCGGAGCCG CTTCCGTTCT
AACGATGGGT ACGCCTTGTT GGTTTCATTT ATCTTTCCCC TTCGAAAATT GTTTTCAGGG
ATTTCGTGGA CACAACCGTC CTCTACGTTA CAATCTTGCT ACTAGGATGG TCAGAAGGCA
CGAATTACGG CTAACGCAAA AGAAAAACTC ACAGCTTG
 
Protein sequence
MEEPGPQAFQ SFEETESSGE AVRPLDFPPP AVRAGVRVNA LVVHPESGTR QVCSGVIHRE 
DLSEASEPIL QGSGPAGVSP MTTHERDPSV QEEVLAYWPQ RRLQDAIYGS VWACLVLRRH
HGIAADDAAR AAGVEPGSAS APIVWEISGQ HVAIKMVEWA RVHHMRGRLL EDPVKEVAAM
QLLGARHPNV LGSTEVLQDG DFLYSIMPYC RDGDLFGVVV QYAEDSGGES GMPEPVARFW
FRQILWGLHH LQTQGVCHRD LSLENILVDG DRCMIIDMGM CLRVPYNDPH KPGAVTDVTR
GSTRRLMRPQ GVCGKHNYMS PEVFANTDSF DGFAIDLWAA GVILYIMLTG FPPYDQASRT
DQRFELIATG RLMEQLRNWN IQLSEEAGNL LQRMLTLDPR ERPTLAEILA DPWVTSDDVH
VPPPPEPLPF