Gene PHATRDRAFT_22117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22117 
Symbol 
ID7203014 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp739291 
End bp740609 
Gene Length1319 bp 
Protein Length407 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182448 
Protein GI219124306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.701766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTGCCATT GGCAAGCCGA ATAGGTTTTT TTTTCATGAG TGATCATGGT TGTCCTGGAA 
AAGCTGTTTC TGTATCGGAG TATCTGCGTC GGATCCCGAA AGTCGAACTA CACGCGCATT
TGAACGGATG CATTCGGCAC GAAACCTTGA TGGATCTAGC TCACGAGAGA GGCGCGACGC
TGAGTAACAG GCACTTTTCT GCGGAACCGC TCCACGAGAA CCTCGCTTCA CCCCCAAACA
ATGGCGAGCA CCACAGCATG TACAATATCA TGCCACGATC TCTGCAGAAC TGCTTCGATA
TATTTGCCGA AATTCCGGCT TGCGTTAACG ACTTGTCGGC ACTGCGAAGA ATAACGCAGG
AAGCTCTGGA AGATTTCGCA GCACATCACG TTGCCTATCT CGAATTGCGT TCTACACCGA
AGCGCTTACT GCGGTCACAT CAAGATGATC AATCGCAAAA GGTTGACAAA CAGGTGTACA
TTGAAACAGT GTTGGAGGGT ATACGCGACT TCCAGAGCAA AGAAAAGGAA CGCTTCAGTC
ACGATCCAGT ATTGTCATCG TCTCGGTTAC CTATCGTGTG TAACTTCATT GTCGCTATCG
ACCGATCGCA GTCCCTGGAA GAAGCAACGG ATACTGTACA TATTGCAATC GACATGTTCC
AACGCCAGCA GAGTCGGCCT TCCAATCTCT CGCCGTCAAT TGTCGGAATC GACTTGGGGG
GCAATCCGAC CAAAAATGAT TTTCGGACTT TTCAGACCCT CTTTCAAAAG GCGAGACAGG
CCGGACTCAA GGTGACGATC CATTGTGGTG AAATCCCATG TGCAGAAGAT GATAACAGCA
AACACGAGCG TCGCGTTGCC ACCGAATCGA AACGGAAAGC CCGGGACGAA GCCGTGGCCA
TTTTGGCTTT CCGACCGGAC CGTTTGGGAC ACGCCTTGTT GCTCCCATCC TCGCTTCAAA
AAGTGCTGGA AGACACCAAG ATCCCCGTGG AAACCTGCCC CACAAGCAAT GTCATGACGT
TGGAACTCGC CAGATCCTCG AACGGGAATC TCGTGCACGG ACTATCCCAG CATCCCTGTT
TGGCACAATG GCTCCAGAAC AATCATCCAT TGTCTATTGG TACAGATGAC CCGGGTGTCT
TCCATACCAA CGCAACTAAA GAACTGGTGT TACTGGTCAA TACCTTTTCT TTGGATCCTT
GTGCAATGGC AGAAAAGGTT GCTGATTCTG TCAACTACGC GTTTTGCAAT GAGACTCTCA
GGCAAGAGAT AAACGCCAAG ATGCGTGAAA TCATGAAAGA GATTCATCAT TCTTCCTGA
 
Protein sequence
MSDHGCPGKA VSVSEYLRRI PKVELHAHLN GCIRHETLMD LAHERGATLS NRHFSAEPLH 
ENLASPPNNG EHHSMYNIMP RSLQNCFDIF AEIPACVNDL SALRRITQEA LEDFAAHHVA
YLELRSTPKR LLRSHQDDQS QKVDKQVYIE TVLEGIRDFQ SKEKERFSHD PVLSSSRLPI
VCNFIVAIDR SQSLEEATDT VHIAIDMFQR QQSRPSNLSP SIVGIDLGGN PTKNDFRTFQ
TLFQKARQAG LKVTIHCGEI PSRDEAVAIL AFRPDRLGHA LLLPSSLQKV LEDTKIPVET
CPTSNVMTLE LARSSNGNLV HGLSQHPCLA QWLQNNHPLS IGTDDPGVFH TNATKELVLL
VNTFSLDPCA MAEKVADSVN YAFCNETLRQ EINAKMREIM KEIHHSS