Gene PHATRDRAFT_47599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47599 
Symbol 
ID7202656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp237527 
End bp239251 
Gene Length1725 bp 
Protein Length567 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181869 
Protein GI219123100 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCCGA GGTACGCGTA CGGGATCGCG GTCCTCCTCG GGGCGGATCA AGCCATCGCC 
AGCACCACGC CAACCGGCAT TCGTGCCGTA CCATGGGCGA CGCCACACTC TCACAACAGA
ATGATTGATC AAGCTCACCA TTTGCTGATG CAGGCACGTG TCCGTGATCT CTGCGAGTAC
GCGACGAGTA GTTATCCCAT TTTCCAAACG GGCTGTCGCA AAGAGGCCCG CGTCGCACCG
GACAACACGG CTCGGCCGAT CCCATACGGA ACGGTGCGAC CTTCCGCCCA CGCAGATACA
ACCCCAAAGC TCTCCCGTCG ACTACTCGAC TCCGACGAGG CCTTTCGTTG GGATTGGACC
AGTCTAATTC TCGCCTTTCT CTGTGTCTTG TGTGCCGCCT TTTGCGCTGG TCTCATTATG
GGTATTCTCA GTCTCGACGA GCTCCAACTC CACATTAAAA TTCGGGCGGG ATCCGATCCG
GAAGAACAAC GCTACGCCAA CCGGCTTTTG CCACTCGTGC AACAACGTCA TTTGGTTCTG
GTGTCGCTCT TGCTTCTCAA CTTTCTGGCC GACGAAGTTT TGCCACTCTG TCTCGACAAC
GTCATGCCGA CCTGGATGGC CGTCTTGACG TCCGTCGTCC TCGTCGTTTT TGTTTCGGAA
ATCATTCCCT CCGCCGTCTT TATCGGACCC GATCAGTTGC GCTTGGCGAG TCAGATTTCA
CCCTTCGCCT ACGCCGTCAT TTATCTATTC TATCCCATTG CCTATCCTAT AGCACTGCTC
CTCGACTATC TCCTCAAAGG TGAAGACGAA CTCGGCAACC AGTACAATCG GGGCGAACTC
TCCGCACTGG TACGAATCCA GTACGAAGGC CGCCTGGCGG CCAAGCGCCG GGAACTCAAG
GAACGACGCA TGGAACAGGG GATTGCGGGG CTGGACGACG ACGAGTCCCA ATTATCCGAT
ATTCCACCCT CGATTACCTT CCAGCACGAC ACGCATTCTA TTCAGACTAC CGAAGTCAAC
ATGATGCAAG GCGCACTGGC CTTGAAAACC ACCAACGCTC GCGACGTGTG TACCAAAATT
CGGAAAGCTT ACACCGTCAT TGACAGTATG GTGTTGGACA GTGGCAATGT GGCCCGTATT
TATGGAGTGG GTTATAGCCG CGTCCCCGTC TATCAACGCA ACCAGCGGAG ACCGAGAGAT
ATCACCGGCA TTGTTGGTAT TCTACTAACC CGACAACTAA TCTTGATTCA GCCCGAACAC
CGCCGACCCG TCTCGTCGTT GCCTCTGTAC CAGCCCGTGT GTGTTGGACC GGAAGCCAAC
ATGATCGAAT TGCTACAAAT GTTTCAGGGG GGCAGTGCCG GGAACAAAGG TGGGCACATG
GCCCTCGTGT GTGAGCGTCC GGGGATCGCG ACAACCGCCC TGGACCAGAA AAAGGCCATT
CCTCCGGAAG CCGGCGTCAT TGGTATCATT ACCATGGAAG ACGTAATTGA AGAATTGTTG
CAGGAACCAA TTTACGATGA AGGCGACCGA GAAGAACGGG AAGAAATGGA AAGAGCCGAG
TGGGCCTTTC GCAAATGGCG TTTGTTTGTC AAACTCCGGC GTCGCCAGCG CGAGCTCTTG
ACAGAATTAG AAAGCACTGA AGGCACGCCC TTACTGACCA ACCACAAAAT GTACAATACT
TCGACCATTT TCCGGCCGGA ATAACCTAAA CCTAGTTGCT AGAGA
 
Protein sequence
MIPRYAYGIA VLLGADQAIA STTPTGIRAV PWATPHSHNR MIDQAHHLLM QARVRDLCEY 
ATSSYPIFQT GCRKEARVAP DNTARPIPYG TVRPSAHADT TPKLSRRLLD SDEAFRWDWT
SLILAFLCVL CAAFCAGLIM GILSLDELQL HIKIRAGSDP EEQRYANRLL PLVQQRHLVL
VSLLLLNFLA DEVLPLCLDN VMPTWMAVLT SVVLVVFVSE IIPSAVFIGP DQLRLASQIS
PFAYAVIYLF YPIAYPIALL LDYLLKGEDE LGNQYNRGEL SALVRIQYEG RLAAKRRELK
ERRMEQGIAG LDDDESQLSD IPPSITFQHD THSIQTTEVN MMQGALALKT TNARDVCTKI
RKAYTVIDSM VLDSGNVARI YGVGYSRVPV YQRNQRRPRD ITGIVGILLT RQLILIQPEH
RRPVSSLPLY QPVCVGPEAN MIELLQMFQG GSAGNKGGHM ALVCERPGIA TTALDQKKAI
PPEAGVIGII TMEDVIEELL QEPIYDEGDR EEREEMERAE WAFRKWRLFV KLRRRQRELL
TELESTEGTP LLTNHKMYNT STIFRPE