Gene PHATRDRAFT_49575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49575 
Symbol 
ID7198239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp89891 
End bp91092 
Gene Length1202 bp 
Protein Length375 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184391 
Protein GI219128377 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00359533 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTCGAAAGG CAGGAACTGA CTGTGACAGC AAGTTAAGTT CGGACTACCT GCGTATAATC 
CAAATGTAAC TTGCATGCTA CCGCTTTCGC TTTGGAATAA GGTATTCACC GCTACTGCTG
CGGCAGCCGC GAGCAGTGTC GGTTGGTGGA TGTTGGATTG CCCGCCATCG CTGTACATCT
ATGACAAGTT TCGGACTCTT TCTTCGCGAC CAGGTCGCGA AGGTTCCTTT CACGACAAGA
ATGTGTGGAT AGTGGGGGCT AGTAGTGGTA TTGGACGTGA GCTCTGCTTT CAGTTGGCGG
CCTCGGGCTG CACCAACGTC ATTGTATCGA GTCGCTCTAC CGACAAACTG GAACGAGTGG
CGTCGGAAAC TATCCGTCGG TATCCCCGCA CGACCTGTCA CGTACTGCCC TTGGATGTGT
GCGATGATAC GCAACTACAA CAGTGCGTAC AAATCTTGCC GTGTCCGGTC GATTTAGTAA
TTTTAAACGC CGGTAGTGGA CACCTGTCGC CGGCTCTGGA AACGTCTCCC CGTACGGTCC
GCAACATGCT CGAACAAAAC GTCGTTTGGC CCATGATTTT GATTCCTTTG TTACTCCACA
GTGACTTTGG AGTCTTCCGG ACTTCATCTT CACAAATATT CCCACGTATT GCCGTAACGA
GCAGCGTTGG TGCCGTTCTA CCGTTGCCGT TATCGTCCGC CTACGCCGCT TCCAAAGCCG
CCTTGAACCG CTATCTCGGC TCACTGCGAG CCGAACGACC CGATATTCGT ATCGACATTT
GGTGTCCTGG TCCTGTGGAT ACCGACTTTC ACGGATCCCA ATCAGCGGCA AACGTTGCAA
CGCTTACAAA AGGAACCTTG GCCGACGAAT CGGTGTCATC GGCATCCGTC TCCCGGTCCC
GGCTGAAAAT GCCAGTGGCT CGGTGCGTGT CACTGATGCT GTCGAGTTTG TTGCAAACTT
CTCGACGCGA AGTTTGGATC GTTCCACAAC CAACCTTGAC CGTCTTGTAC TTGCAGGGAT
TGTTTCCCGG TCTCGTGGAT TGGATGCTCT CCCTGATCGG TCCGAAGCGC GTTGCCCTGT
GGCGCGCCGG TCTCGATTTG TACGATCCCG CTTCGTGGAC CGGAAGAAGA CCGACAGCGT
CACTGGGCAC CTCCTCACAA AACAAAAACG AAAATAACGA GAGTGATTCC ACAACAAGGT
AG
 
Protein sequence
MLPLSLWNKV FTATAAAAAS SVGWWMLDCP PSLYIYDKFR TLSSRPGREG SFHDKNVWIV 
GASSGIGREL CFQLAASGCT NVIVSSRSTD KLERVASETI RRYPRTTCHV LPLDVCDDTQ
LQQCVQILPC PVDLVILNAG SGHLSPALET SPRTVRNMLE QNVVWPMILI PLLLHSDFGV
FRTSSSQIFP RIAVTSSVGA VLPLPLSSAY AASKAALNRY LGSLRAERPD IRIDIWCPGP
VDTDFHGSQS AANVATLTKG TLADESVSSA SVSRSRLKMP VARCVSLMLS SLLQTSRREV
WIVPQPTLTV LYLQGLFPGL VDWMLSLIGP KRVALWRAGL DLYDPASWTG RRPTASLGTS
SQNKNENNES DSTTR