Gene PHATRDRAFT_37956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37956 
Symbol 
ID7202694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp509184 
End bp510332 
Gene Length1149 bp 
Protein Length382 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182079 
Protein GI219123537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.834363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCACGA GGAGAGGAGA GATGTTGCTT GGTCTCAATT GGATTTTACT GCTTTACCCA 
ACAACGGAGT TCCAGGTTGC ACATTCTCGT TGTATTGGAC GTTTGTTTTC GACGGTCCAC
GATGATGATA CCAATCACAA TCAATTGAGT AACAACGATC AAGATAGACC CCTTCCGTGG
ATGAACTGTG GCGTGCTCAT TTCATCTTTT AGTGACGGTG TGATGCCAAA TGAAGATGCA
CAGATATTTT TGCAGCGCGG ACTAGTAAAT GCTCTTTTAC TGGAAGAACG GCATCGACTG
GAACATGCAG TTAAAGCCTC GGCGATCCAA AGTCCATGTT GCGGACCCGA TGTTACGGTA
TTGGACCGTT TGCAAGATGT TGACAGACGT ATAGAGCAAG TCGAGAAGTA CGCCACTCCT
CTCGATCTCT TGAATGCGCA CGAGCCGGTC TCCATTCGCC TCCTTTATAT TCCTACCGCT
ATGTATGCTA TACGATCAAA TTCTGAGAAT ACGCCTGGCA AACAACGGCA ACGCGCTCGG
GCAGACGGAA AGAAGCGAAG GACGCGCATA GTGGATGTTT TGAAAGAGCT AATTCCGACT
GAAAATACAA CGATCTTGGC AGCGACTCTC GATTTCGACG ACGGCTCGGT CAAACAAACG
GAAGGAGCGG CTAGTCAAGC GGTGTTTCCA CAAAGCGGGA AAGACGCAAT GCGTGATTGG
GAACCTCATA TTATATATGT GGAAGGAGGG AACACCTTCT GGCTTTATCA TTGTATTGAA
AAAGGACACT GGAACGAAGA TCTGGTGAGA TATTGTACCG GCCCGCGACA AGGCGTATAT
TGTGGCTCTA GTGCTGGTGC TATAGTAGCG GGGGCGTCCA TTGAAACGGC TTGCTGGAAA
GGATTGGACG ATCCAACTGT CGTTCCGGGT AGGAATGGTT ACAAAGATTG GAAAAACGTT
ACGGGTTTGC GCTTAGTCGG CGCTACTTCG ATCTTTCCAC ACATGGAAGA CCGGTGGGCA
GATACCGTAC GGGAAAAACA AGAAAAGCTG CGCGAACCAG TTCTTTGTTT ACGCGACGAT
GAGGCGCTTT GTGTGTCTGG CCATAAGCAA TTGGCATACG TTACAAAGGG AGCGCAAATA
GCAAGCTGA
 
Protein sequence
MITRRGEMLL GLNWILLLYP TTEFQVAHSR CIGRLFSTVH DDDTNHNQLS NNDQDRPLPW 
MNCGVLISSF SDGVMPNEDA QIFLQRGLVN ALLLEERHRL EHAVKASAIQ SPCCGPDVTV
LDRLQDVDRR IEQVEKYATP LDLLNAHEPV SIRLLYIPTA MYAIRSNSEN TPGKQRQRAR
ADGKKRRTRI VDVLKELIPT ENTTILAATL DFDDGSVKQT EGAASQAVFP QSGKDAMRDW
EPHIIYVEGG NTFWLYHCIE KGHWNEDLVR YCTGPRQGVY CGSSAGAIVA GASIETACWK
GLDDPTVVPG RNGYKDWKNV TGLRLVGATS IFPHMEDRWA DTVREKQEKL REPVLCLRDD
EALCVSGHKQ LAYVTKGAQI AS