Gene PHATRDRAFT_35048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35048 
Symbol 
ID7200044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp952486 
End bp953574 
Gene Length1089 bp 
Protein Length362 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179545 
Protein GI219117501 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.541966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGG CTAGCAAGAA CAAGACGCGA ATGCGAGAGG AAAGCAAGGG GGCCTTCCGA 
GTAGCCAAAG AGGAAGAAGA AGCATCTATT GGTGGCTGTC CTGAGAACAA TAATCCTGTG
CACAAGACTG TCGACGAGGA CAAAACGCCA CAGGAAAAAG GGGGATCGAA CAAGTCTTCG
ACAGTAACGG GGAAGAGCAT CATTAACAGT GGAGACGCCG AGTCATCGGA TGAAGACGAC
TTGCTCGAAG CAGCAGCCGC CTGGGCGGAG GGAGACGACG ATGACAAGCA AATGAAAAAC
CTTAAGGTGC ACCAATCGAA CACCAAAAAG CCGCCCCAGA ACAAGAGCAA ACAATCAAAG
GACAAGTTGT CAACTTTGAA TGATAATACG GCGACTGCCG CAAGCTCGCT TTTATCGGAG
ACTTCGCCAG ACAGAATGTG GTCGCTACAC ATTACGCAGT TGGACTTTGA CACCACCGAG
TTCGACCTGC GTCAACATTT TGTCACGCGA GGATGCGCGC TTTCGTCCAT TCGACTCGTT
TGCGATCGCG GATTGAACGG CAAGAAACTG TTTCGGGGAG TTGCCTTTGT CGATGTCCTG
GACGAGGAGT CGTACAAGAC GGCATTGGCC TTAGATAAAA GTGATATGTT GGGACGCAGA
ATCAATGTGC GACCAACCAA GACCAAGTCC GAGTTGGCTG ATATTGTGCA ACGTACCAAA
GAGATTGTAA AGGAAAAGAT AAAATTGAAT TTAGAAGAAA TGGATGAGCG AGAGGCTAGC
GAAAAGTCGC ATACGTCGCC AAATACGGAT AAGAAGAGAT CGCGAAAGGA CAAGCAAAGA
GATGGAAAGG AGCGCAAACC GAAACGTCGC AAAACCGAAA TGCTAAAGTC CACAGACGAC
AATACGAAAG GCAATGCAAA GACAGAAGCG GTAGTTCCCA AGGATGCAAA GCAGGCAACA
AGAGGGGTCA AGAATCAATC TCCCACGGGT TCTAAAACGA TTGGCAACAT AGATCCGAAC
CGAAAATTGT CGAAAAAGGA ACGCAATCGC AAAGCAGCCA TCTTGATACA AATGCGAAGA
AGAAGATAG
 
Protein sequence
MAKASKNKTR MREESKGAFR VAKEEEEASI GGCPENNNPV HKTVDEDKTP QEKGGSNKSS 
TVTGKSIINS GDAESSDEDD LLEAAAAWAE GDDDDKQMKN LKVHQSNTKK PPQNKSKQSK
DKLSTLNDNT ATAASSLLSE TSPDRMWSLH ITQLDFDTTE FDLRQHFVTR GCALSSIRLV
CDRGLNGKKL FRGVAFVDVL DEESYKTALA LDKSDMLGRR INVRPTKTKS ELADIVQRTK
EIVKEKIKLN LEEMDEREAS EKSHTSPNTD KKRSRKDKQR DGKERKPKRR KTEMLKSTDD
NTKGNAKTEA VVPKDAKQAT RGVKNQSPTG SKTIGNIDPN RKLSKKERNR KAAILIQMRR
RR