Gene PHATRDRAFT_40447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40447 
Symbol 
ID7198167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp413402 
End bp414727 
Gene Length1326 bp 
Protein Length441 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184366 
Protein GI219128325 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.12237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGT CCCTTCAAGA CGAATCGGCC CTGGAAGCAG AAATTCATAA ACTGGAACAA 
GAGATTGAGA AAAAACGCCT CGAAGAACAG ATATCTCTGT TGGAAATGCA ACTCCATCGG
GCCAAGCCTG AGGCGCAGCG CCCGAACGCA CCATCCAACC CATCGGTGGT AGGACAATCT
ACCTCCACTT CTCGAACTTC CAGTCCCCGC TCGCTGGAAC ATATGCTTCC CACCCAAAAC
GTCAGTTGGT CTGGGGCCAA CGGACAAAGT CCAGAAGACG ACAGTATTAT CGCGTTGGCG
ACTCAAGATT CCGAAGGCGT GGAATACGAC GAGGAAGACT ATGACGAAGA AGAATGCGAC
GAAAATGAGT ACGAGGAAGT GTTCGAAGAA TTCGTCGAGT ACGTTACCGA CGACGAAGAA
GAAGAAGTTG GAGAGCAACC TCTCGAAAGC GTTGAAGAAT ACGAAGAGGA AGAGGAAATC
CAACAAGCCC CGGTTTCACC CTCCCGAGCT CCTCCTCGTC AGTGGCCTCC TCCGGCACGT
CCAGTGAACG ACGAGCATGT AGTTCAATAC GTGGCACCAA AAAAGAACAC AGAAGCTCCC
GAAGAACCGA AGCAAGTTGC CAAGCCTCGA AAGAAGTGGG TGCCCTTGAG TCAACGGGAT
CCCAAAAAGT ACCAAGCAGC CAAAGAAGCA ACCCCTACAC CTCCCAAGTC ACCAGGACTG
CCCGCTTTGC CGTTCACGCG ACGTAAGCTC CCCGACGTTA CGACGTCGCC TCCCGGTGAG
GAAACAGTTT GGGAACAACT GTTGGGTCCC AAACTCATTG TCAATGAAAA GTTGGTCAAA
TGCACAACCA ACTGTGCTGC TCAGGGACAA GAACTTATTC TGCTCCTGTT TGGCGCCAAG
TGGCGTGCAG AATGCAAGAT CTTCTACCCA CTCATGATCG ACTTCTTCAA ACTAATGGCT
CACCAGCACA AAATGGAATG CGTGTACATC TCGAATGATC GTACCTTGAT GGAGTTTAAG
GATATTTTTG TCAAAATGCC CTTTTTAAGT TTGCCAACAG GTACGGTGGA AATCAAGAAT
ATCTTGGCGC AACGACTGAA AGTGAACGAC TTGCCTGTAT TGGTCGTCAT GACCGCCGAC
GGTCGTGTCA TCACAACGGA AGGATACCGC ATGGTGGCAG CCCTGGAGCG TCGGAACGAG
GACCAGGCTA ACAAACTGGT TGATGTCTGG AAAAAGGCGC AGACGTACAA CATCGATCAA
GTACCAGCCG ATACCAGTCT CAAACATGGC AATTTGGCGC GGGGAACAGT CTACTGGCAA
GCATAA
 
Protein sequence
MASSLQDESA LEAEIHKLEQ EIEKKRLEEQ ISLLEMQLHR AKPEAQRPNA PSNPSVVGQS 
TSTSRTSSPR SLEHMLPTQN VSWSGANGQS PEDDSIIALA TQDSEGVEYD EEDYDEEECD
ENEYEEVFEE FVEYVTDDEE EEVGEQPLES VEEYEEEEEI QQAPVSPSRA PPRQWPPPAR
PVNDEHVVQY VAPKKNTEAP EEPKQVAKPR KKWVPLSQRD PKKYQAAKEA TPTPPKSPGL
PALPFTRRKL PDVTTSPPGE ETVWEQLLGP KLIVNEKLVK CTTNCAAQGQ ELILLLFGAK
WRAECKIFYP LMIDFFKLMA HQHKMECVYI SNDRTLMEFK DIFVKMPFLS LPTGTVEIKN
ILAQRLKVND LPVLVVMTAD GRVITTEGYR MVAALERRNE DQANKLVDVW KKAQTYNIDQ
VPADTSLKHG NLARGTVYWQ A