Gene PHATRDRAFT_54749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54749 
Symbol 
ID7202602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp802419 
End bp804622 
Gene Length2204 bp 
Protein Length734 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181807 
Protein GI219122968 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.104715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTTTT GTAGAATGAA TGCAAGGAGA GGGCACCTAC TGTTTATATT TCGCGTTTTC 
GTGGTAGGCG TGAAAACCAG CTTGCTCAGA CTCATAGACG CTTGCTCCGT GTGTCCCGAC
AGCAGAAATT TCACCCTACC CGAAAAAGCT ATAAGCATTC CCGGCTTGGA AAGTATTACG
ACTTGTCAGC AGCTAGGATC TACCGTATCG ATTCTCCTGT CTGACAACGA TCCTTTGTGC
AAAACAGTAC AGTCACTTGG CACTTTTTGC GGATGCCCGA AGGCAGAAAC AGCGTGTGAG
ATTTGTCAAA GCGGCCAGAG ACTTAGCCGA CCTGCCCGAG AATTGTTTTT TCTCTCTCTA
GAAGGCATCA TGCCATCGTG CGAGCTTTTC GAAGCGTACC TGCACAGTCT GGCCAGCGAC
GATGAAGTAT GTATCGCGTC TATCTTTTTG CTTGCAGATT ATTGTGGCTG TACGTCAGAA
GCGGATGATA TATTGTCAGG AGACGACATG TCTAGCATTG GCGGCTGCTC CCTATGTCAA
AATATGGACG ACTTAGAAAT GGAAACTGAA GTTTATTTTC CTGGGTTTCC GCTGAACACT
TGCGGCCAGT TCGCTAGCAC AGCAAACGCT GTCTCATCAC CAGAAGCAGA CACATGTCTT
TATCTTCAAG CTTCCATAGG CACACTTTGT GGGTGTTCTG TGAGAAACCC AGCGACGCAG
CCCTGCACTC TGTGCAGTGA TAGATCAGCC CCATCACGTC CATCTCAAAT GTTGCCGTTG
CTGTCGAGCC AATTCGCCGG GTTGACGCCT ACCTGTGAAA TCCTGGAAGC ATCATTGCTA
TCCGTTGAAT TGAGCTCAGA CGAGTGCAAA GTCGGACATT TGCTTGGAGG TGTCTGCGGT
TGCGCACCGG TTGAGGATCA TTGTGTTTTC TGTCCTGAAG GAGATCCGAT ACCTGAATAT
CTGCGAGAGC GCGAACTTCC ACAATTCGCT AGGTTTCTGG ATGGGCTAAT TGCAACCTGT
GCAGATATTG AACAAGCTCA AACACAAATA CCCACGGAAT CGTTAATCTG CAGCAGAGGC
AAGCATCGCA AAGATCTATG TTGCGGGGGA CACTTTCAGT ATTTAGGAAC GTCGACAGTC
CGAAAGCAGG CTGTTTTGGC CTGGCTGCCT CGGGGCGTCG CTCTTCTTTC GATTTTGGGA
TCTACTTACG TTCTCTCAAA TATTGTCCGC CACAAGGAAC GTCGCGCAAC GATTTTCCAT
CGCATAATGA TTGGTCTGAC CGTATCTGAT ATAGTGAGTG CAGTGGCTTG GGGTTTCACA
ACTTTGCCTG TACCGGCAAA CGATGATTTC GGTGCCCCTT CGAAGATTTA CGGCGCAAGA
GGGAATGGCG TGACTTGTGC AATTCAGGGC TTTTTTATTC AATTGGGCTT CACGTCGATC
TTTTTCAATG TAGCACTGAC AACCTACTAC GTCCTAGTTA TTGTCTACAA CTGGAGAGAA
ACGCGACTGA TCAAGCTGCA GTATTGGTTC TATACAATAC CCGTTCTTTT GGGCTCCACT
TTGGCTTTCG CTGGAATACC GTTTTATACG AACAACATTA TGGCTTGCTA CGTTAACGCA
CCTCCTTTGG CGGAGAAATA CACTGTCATC GCTTTGCTGG CCGTGGTGCC TGTGTGCTTT
GTTGTCCTCT TTTGCACCAC AGCAATGGCG AGAGTTTATC TGCATGTACG TCACCAGCAA
AAAAGGGCCA GTAATTGGAG AATGGGCGGG TCTGGAAAGT GTATTGAGCA GCAAGTACTA
TGGCAATCGG TCTTTTACGT GGGGGCCTTT TACATTTCGT GGCCAATTCA AGTTGTTGGA
ATCTTCATGT CCGAGCCACC GTTTCGAGAG CATACGCCTT ACGCATTTTG GCTTCTTATG
GTGTCATTGG CACCAATCCA AGGCCTTTTG AACTTTTTCG TTTACATAAG GCCTCGCTTG
TCCAAAAAAC GTGATTCCGC TGGCTCCTCA CCGGCGGATT CCGGCATTCA AGGATCGTCT
TGGCTGTTGT CGACACGTTT CGCCAAGGTA CTCAGTATTT TTGAACGTCA ACGAGAAGCA
AGAGACGAGA CGTCGTTATT GAATGAGCAA GTTTCTCACA AAAGGAATGA GGCAAATGAT
TTGGACCCCA AGGATTTTCA AGAAGATCCC TTGTCAGGTT TTTG
 
Protein sequence
MTFCRMNARR GHLLFIFRVF VVGVKTSLLR LIDACSVCPD SRNFTLPEKA ISIPGLESIT 
TCQQLGSTVS ILLSDNDPLC KTVQSLGTFC GCPKAETACE ICQSGQRLSR PARELFFLSL
EGIMPSCELF EAYLHSLASD DEVCIASIFL LADYCGCTSE ADDILSGDDM SSIGGCSLCQ
NMDDLEMETE VYFPGFPLNT CGQFASTANA VSSPEADTCL YLQASIGTLC GCSVRNPATQ
PCTLCSDRSA PSRPSQMLPL LSSQFAGLTP TCEILEASLL SVELSSDECK VGHLLGGVCG
CAPVEDHCVF CPEGDPIPEY LRERELPQFA RFLDGLIATC ADIEQAQTQI PTESLICSRG
KHRKDLCCGG HFQYLGTSTV RKQAVLAWLP RGVALLSILG STYVLSNIVR HKERRATIFH
RIMIGLTVSD IVSAVAWGFT TLPVPANDDF GAPSKIYGAR GNGVTCAIQG FFIQLGFTSI
FFNVALTTYY VLVIVYNWRE TRLIKLQYWF YTIPVLLGST LAFAGIPFYT NNIMACYVNA
PPLAEKYTVI ALLAVVPVCF VVLFCTTAMA RVYLHVRHQQ KRASNWRMGG SGKCIEQQVL
WQSVFYVGAF YISWPIQVVG IFMSEPPFRE HTPYAFWLLM VSLAPIQGLL NFFVYIRPRL
SKKRDSAGSS PADSGIQGSS WLLSTRFAKV LSIFERQREA RDETSLLNEQ VSHKRNEAND
LDPKDFQEDP LSGF