Gene PHATRDRAFT_35058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35058 
Symbol 
ID7199996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp976209 
End bp978227 
Gene Length2019 bp 
Protein Length672 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179330 
Protein GI219117071 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATAG AATCACTGGC GGAAGAATCG TCTTCCGTGT CAGTGACTGA TTTGCGCATC 
GTTTGCTTGA TTCCTTCCGC CACGGATATT TGCGTCGCTC TCGGATTGCA GGCTGCCATT
GTCGCCGTCA CGCACGAATG TGATACAAAC CTTTGGTCGC GAGCGTCGTC GCGTACGGCA
GTCAAGATCA TTACAAGAGA CGGGGTGAAC GGGAACGAGA CGTCGCAGGG TGCCATACAC
GATCAAGTTG TGGCGAGCTG TCGGGCAAAA GATGAGGCCG TCGGTGATGG GGACATTCCC
GCACTGGCGG ACGTGCCTTC CTTGTATCCC ATTTGGGAGG ACGAATTCCG CGAGGCTCTA
CAGCTCAACG ACGTTCACGA CAGTAGTAAG CGCCTTGTGA TTACCCAGGA CCTGTGTGAA
GTGTGTGCTC CCTCGTCCGA AACCGTTCGT CGGCTTGTGG GCAAGGATGC CTCACAGCCG
CCACCGCCAC ACGAGGTTCA CGTTGTGTCC CTCACGCCAC AATCGCTGTG GGACGTTGCC
GCTAATATTC TCACCGTCGG CCATGCCTGC GGGGTCCCAC GACGTGCCAA AATCGTTCAC
GATGCGTTTC TGAGTAATTT ACAGACACTG GAAACGACCG TTACCGAAGT TCGTTCCCAT
GATGGTGCAC CCAAGCTCTT CCTATTGGAA TGGTTGGATC CACCCTTTGA CGGTGGCCAT
TGGATTTTGG ATATGATGCA GTTCGCCGGC GTGCAACCCG CCCAACACAA GCACACCCAG
AAATCGACAT CGACGACCTG GGCTCAAGTC CGTCAGGCCG ATGCCGATGT GATTCTGGTC
GCCTGTTGCG GTTTTGGTCT GGAACGGAAC GTTCGTGATA CTTTCGGTGC ACGCAACCAG
TTGCAACAAC TGCGTGCCGC TCGCAATCGT CGCATCTACG CCACCAACGG TGACCACTAC
TTTGCCCGTC CCGGTCCTAA ACTACTGCAT GGTGCAATCA TAATGGCGTT GACAGCTTAC
GCGGATCAGC CGGAGGTGGT GCAAGCGATT CAGGCTTTGG ACTTTGTCGA CGCGGAACTG
GGTGGATATC AAATGGTCGA TGTTCTGGAC CCTACCATTG TACAAGCAAA CAACGATGTT
CCCGACATGG AAGACTTTGA CCGCTTGCAC CGCGAAGCCT GCAGCGCCGG TTCGTTATCC
TACCCGGACC CCGTTACGGG CTATAAGGTC TTCACCGAGC TCGCCCACCG TCAACGTGGC
AAATGCTGTG GTTCGGGCTG TCGGCACTGC CCGTACAATC ACGAAAACGT CAAGAATAAG
GCGGGGAAAA TACAACAGCC GGCCATGCTC ACCGCTGGCG ACCAGACGGG TCCACTGGCA
CTGTCCAACG GAAACCTGCA CGTGCTGTTC TTTAGTGGCG GTAAGGATTC CTTCCTGGCT
ATTCGGGCAT TGACTCGACA AGCCAAACAG ACTGCCCCGT TTGGGTTAGC CCTGTTAACC
ACGTTTGATG CCACGTCGCG TATTATTGCG CATCAGGATA TGCCGATTGA TACCGTTGTG
GAACAAGCGA CACATCTGGG TTTGGCATTG ATTGGTGTCC CCATACACCG GGGGAGTGCC
GAAGGATACG TGACACGAGT TCGCAAGGGG TTAGAGGTAT TGCAGAGCAG CGTTAAACCC
CCAAGCAAGG TCACAACCTT GGCCTTTGGC GATTTGCATT TGGAAAATCT GGTGGAATGG
CGAAATTCCC AAATCGGATC GCTCGGCTAT AAATTGCAAT ATCCCGTATT CCAGACCGAG
TACGAAATAC TGTGGCAGGA TTTGGAAGCG TCCAAGGTAC CGTGTGTCGT ATCGTCATCA
ACGGTTGATC ACATCCGTGT TGGGGATGTC TACAGCCGAG AGTTTGCCCA ACGGTTACCG
GAATGCGTCG ATCGCTTCGG TGAGAATGGT GAATTTCACA CAATCGCACA AGTTTGGGAA
GTAGACCGCA TCACGGCTTT GGGATTTATA GATAGTTAA
 
Protein sequence
MSIESLAEES SSVSVTDLRI VCLIPSATDI CVALGLQAAI VAVTHECDTN LWSRASSRTA 
VKIITRDGVN GNETSQGAIH DQVVASCRAK DEAVGDGDIP ALADVPSLYP IWEDEFREAL
QLNDVHDSSK RLVITQDLCE VCAPSSETVR RLVGKDASQP PPPHEVHVVS LTPQSLWDVA
ANILTVGHAC GVPRRAKIVH DAFLSNLQTL ETTVTEVRSH DGAPKLFLLE WLDPPFDGGH
WILDMMQFAG VQPAQHKHTQ KSTSTTWAQV RQADADVILV ACCGFGLERN VRDTFGARNQ
LQQLRAARNR RIYATNGDHY FARPGPKLLH GAIIMALTAY ADQPEVVQAI QALDFVDAEL
GGYQMVDVLD PTIVQANNDV PDMEDFDRLH REACSAGSLS YPDPVTGYKV FTELAHRQRG
KCCGSGCRHC PYNHENVKNK AGKIQQPAML TAGDQTGPLA LSNGNLHVLF FSGGKDSFLA
IRALTRQAKQ TAPFGLALLT TFDATSRIIA HQDMPIDTVV EQATHLGLAL IGVPIHRGSA
EGYVTRVRKG LEVLQSSVKP PSKVTTLAFG DLHLENLVEW RNSQIGSLGY KLQYPVFQTE
YEILWQDLEA SKVPCVVSSS TVDHIRVGDV YSREFAQRLP ECVDRFGENG EFHTIAQVWE
VDRITALGFI DS