Gene PHATRDRAFT_41473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41473 
Symbol 
ID7199233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp253785 
End bp254954 
Gene Length1170 bp 
Protein Length389 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185405 
Protein GI219130507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.0185557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTATC GACTACCGCC ACCGTTATTT TCTCTTATGA GACAGGGTGT AGTCCTAGAG 
ATCATTAACG ATAATCCTCG AAGTCCGCAA AAGTCGGCGA CGGCAGCTTC ATATCTCAGA
AGAAAACTTT TGCCGACTTC TGTTCGCTCC TGTGAAGGAG CTTACATTCC AACACGGCTC
CCTCGACGAA GCAATAGAAG TAGCGCGAGC GATACCATGT TGTGTGGCGC TCCTCGCAAT
AGCAAGGATC GGAACAAGCA ATCTCGTTGG GCCGCCTGCG CAATACATCC TCTTCTCCTG
GAAGAACGTC GATTGTTACT GTTGAGGGAA GTTCAAAGAA CAGACACAGC TTTTCCATCC
GATCACGACG ATGGAGGTTC TAATTCCTCG AGTAGTCTAG CATCTGTGTT CCAAATAGAA
AGTGGTTGTA TAGACATGCG ATTCCGCCGA CGATCACTGC GACACACCAG CGCTTTGATT
GAGCACATAC TTTCCCAACG AAATACGGCG CAAGCCCAAA GTCCGAAGGT TCCCACCCGG
CGTCGGTCGG TAGAGCAAGA AAAAGAGCGA CAGCAAAGTC CACACAAAAA CGAAGCCCAT
CAAATTACTG CGGATTTTAT TCTTAATAAG ATGCAGGTTC TGGATTTTGC GGGACGACGC
GAAAGCCCAC CCTGTAAACC TACCCGCCGT TGTAGCGACG AACATCAACG ATATATGACG
CAGCAAGCGA TTGCCCAAGT GTTGGACGAA ATGGAGGACG ACAATGACTG CAAAAAAGAC
GACGATGACG ACGAAAATGA CAACGAGGAA GGAATACTCG AGAAAAAGTC GGCGAATGAA
AATCGGCGTC TCAGCGTACA ACGAAACAAG GAGACTGCCG CTATCGCGGA AGCCTTGTCG
GAAATGAATC TGGATAGTGA TGAAAACGCA GCGCAGGAAG ACTCTACCTT TACCAGGGGA
GGTCAACTCC ATCACACCGC AGGATCCGAT ATACAAACAG GCCACACCTC TTTGAGAACA
GCGCTTCGTA GAGCCAGCTT TCGTCCCAGT TTTACCCCTG AAATAACCGA AACCGCCTTA
GCACAAGCCC TGGTAGAGCT TGATGACCAA TCGGAAAGTG GCGGAGACAC CAGCCCCGAC
TGCGTTGCGC TCGTCCCCGC AGCAATCTAA
 
Protein sequence
MAYRLPPPLF SLMRQGVVLE IINDNPRSPQ KSATAASYLR RKLLPTSVRS CEGAYIPTRL 
PRRSNRSSAS DTMLCGAPRN SKDRNKQSRW AACAIHPLLL EERRLLLLRE VQRTDTAFPS
DHDDGGSNSS SSLASVFQIE SGCIDMRFRR RSLRHTSALI EHILSQRNTA QAQSPKVPTR
RRSVEQEKER QQSPHKNEAH QITADFILNK MQVLDFAGRR ESPPCKPTRR CSDEHQRYMT
QQAIAQVLDE MEDDNDCKKD DDDDENDNEE GILEKKSANE NRRLSVQRNK ETAAIAEALS
EMNLDSDENA AQEDSTFTRG GQLHHTAGSD IQTGHTSLRT ALRRASFRPS FTPEITETAL
AQALVELDDQ SESGGDTSPD CVALVPAAI