Gene PHATRDRAFT_39093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39093 
Symbol 
ID7194811 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp314386 
End bp316141 
Gene Length1756 bp 
Protein Length552 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183201 
Protein GI219125885 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACGC CAGAGCCACA AGAATCCCAC GCTGCGGCAA AGAATCGCAC TCATCGCCGG 
CGTTCCAGCG TTTCGGAAGC GCATCAGATT TATTTGCAGT ATCATCGTGA AGCTCATCAC
GATGACGGCA GCGACTCGCG CAAAGCGACG CAGATCTCTA CTGACGGAAG CACCAACAAA
GCCCCCAAGC GCCACGACAG CACCGAAAGC CGTGATCACA AGCCGGTCAC TAGCTTTCCA
ATGTTCCACC GGGTCCAGAA AACCGGCGTT ATTTACGCGA CCTCGCGGGC CGCAGCCCGC
GGCTTTCAAG GAGACGGTGA CCCGTCGTCG GAGTGGGCCA ACATGGGTCA GGGTGCTCCA
GAAACAGGTC CTTTGCCAAA CGCCCCGTCG CGGAATTTCA CCATGCACAT CCCGGACGCC
GAGCTCGAAT ACGCGCCCGT GACGGGACTC ACGCCGCTGC GCGAAAAGGT AGCCGACTAC
TATAACTTTT TGTACCGCCA GGGGAAACAG TCGCAGTATA CGGCAGAGAA TGTGTGCATC
GTACCGGGAG GTCGAGCTGG CATTACGCGT ATCATGGTAC GGGTACAACT GTATGGATCG
CATTGCGTAC AAGTTGTCTC CAATGTATCG TGTGCGTGTT CTGACCGCAC ATTGCTGTTC
TCTATTTCTG CAGGCGGTGC TGGGAACGGT TCAGGTTGGG TATTTCACCC CTGATTACAC
AGCCTACGAG CAAGCCTTGG GTCTCTTTTT GCGGGTTTCG CCGAGTCCTT TGCTTCACCG
GGATGTGACC GAGGCTTGCA TGTCTCCAGA GGAATTTGAC TTTCAGTGCG GTGGGCGGGG
ATGTGGTGCC ATTTTGATGT CCAATCCGGC CAATCCCACT GGACAGTCTA TTGAGGGAGA
CGACTTGCGC CGCTATGTGC AGACGGCGCG GGATCACCAG ACGGCCATTA TCATGGACGA
GTTCTATTCG TACGTTATTT CTCTCGTTCT GGCTTGCGCA ACCGAAGCTG GTTGTCTACC
GGGTACTCAC GAAGATTTTT TGTTCGAAAA TGCCATTAGT CACTATTACT ATGACGGCCC
AGATCAGCCA TTGGACGGTA ATACGAATGA TCTGCACAGC TGGCCCAAGA CCGTGAGCAG
CGCCGCCTAT GTGGACGATG TCGACGAAGA TCCGATTCTC ATTGTCAATG GATTGACCAA
AAACTGGCGT TGCCCAGGTT TTCGAGTTTG CTGGATTGTC GCTCCTAAAC CAATTGTGAA
AATGCTAGGA TCGGCCGGGA GCTACTTGGA CGGTGGAGCA AATGCTCCGT TACAGCGATT
GGCCTTGCCC CTTATGGAGT TGGCATTCAT TCGTCGGGAC GCAATTGCAC TCCAACAGCA
CTTTCGACAG AAACGGGACT TTTTGCTACG CAAACTCGAG GAACTCGGAA TCAAAGTCAA
ATTCAAGCCG ACGTCGACTT TCTACGTATG GGCCGATTTA TCAGGTCTGC CGCCGCCGTT
GAACGATTGT CTCGTTTTTT TAGAGGAATG TACCAAGCAC AAGTGCATAT GTGTCCCGGG
TGTGTTTTTC GATATCAATC CGAGAGGGAT TCGGAACATT CGCATGAGCA AGTGCCTGCA
TCACGTACGT TTCAGCTACG GTCCGCCTAT GGAAAATCTT ACCAAGGGAA TGGAGTTGAT
TTGCCAAATG ATTCAGTATT GGAAAAAGTG TCCCGAGCCG CCTGACGCGT ACGCGACCGA
GTCGTTTGGG GAGTGA
 
Protein sequence
MTTPEPQESH AAAKNRTHRR RSSVSEAHQI YLQYHREAHH DDGSDSRKAT QISTDGSTNK 
APKRHDSTES RDHKPVTSFP MFHRVQKTGV IYATSRAAAR GFQGDGDPSS EWANMGQGAP
ETGPLPNAPS RNFTMHIPDA ELEYAPVTGL TPLREKVADY YNFLYRQGKQ SQYTAENVCI
VPGGRAGITR IMAVLGTVQV GYFTPDYTAY EQALGLFLRV SPSPLLHRDV TEACMSPEEF
DFQCGGRGCG AILMSNPANP TGQSIEGDDL RRYVQTARDH QTAIIMDEFY SYVISLVLAC
ATEAGCLPGT HEDFLFENAI SHYYYDGPDQ PLDGNTNDLH SWPKTVSSAA YVDDVDEDPI
LIVNGLTKNW RCPGFRVCWI VAPKPIVKML GSAGSYLDGG ANAPLQRLAL PLMELAFIRR
DAIALQQHFR QKRDFLLRKL EELGIKVKFK PTSTFYVWAD LSGLPPPLND CLVFLEECTK
HKCICVPGVF FDINPRGIRN IRMSKCLHHV RFSYGPPMEN LTKGMELICQ MIQYWKKCPE
PPDAYATESF GE