Gene PHATRDRAFT_39457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39457 
Symbol 
ID7195162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp564018 
End bp565184 
Gene Length1167 bp 
Protein Length388 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183380 
Protein GI219126262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGAC GAACATACCA GCCTCGTCGC GAATGCTTGA CGATCCTCCC TACTAACACG 
AAGTGGAATA CCTATCGTAT TGTCATAATT TTGCTCGTAC TCGCGCTTGA CTTCAATGGA
TCGGTCGTCG TACGCGCGAA ACTCTTAGCC GCCGTGATCC TTCCGCACGG CGACTTTGCC
TACGACCCGA CGTTGCTGCC GACATCCCAT CCCGGTCGAC CGATAGCCGA TCGATTGGCT
AGTACATCGC GTGCCGTCGG TCACTGGCTT GTCCAAAAGA ATGTCGCTCC CGACGTTCTT
TTCTTTTCAA CGCCACACGG GATTGCCCTG TCCAACGATT TCGCCCTGTA CCTGGGCTCC
ATGGCCAGTG GGACCGCGCG GATTGGAAAA GATCTACGCA ATGCCAGTTT CATTCCCTAC
AATGTGCGTA TTGCCAACGT AACATTGGCT CCCACAATGG TGGCAGATCT TATACATTAT
TTGCGAGTAC TGCGGCAACA AAATGTGTCG GGCGTTTCCA CCTCGCCCGA TGATGCCGAT
GACGTTCCTT TGCATTGGGC CGAAGTCATT CCACTTTCGT TTCTGGATTC CAACAAAAAA
GGCAGCGTTG TAGGAAGAGA ACAATCCGCT CTCAGAAAAT CTCATCGGCG ACGTCGCTTA
CCCACAGGCG CCGGCAAAAC GAGGCAGCAT TTGATATGGT CCCATCCGTT GCAGCGGTAC
AACGCAGCAC CAGCCATGGT TCCGGAATTG CTGCACGTAG GATGTCTACT GCGGACTTGG
CTTGAACAGC GCCCTGAAAC ATTTGCAGTC GTCGTCTCCG CCGATTTGTC TCATACACAT
CGACAGGACG GTCCCTACGG ATATTCCAAC ACTTCAATAG CCTTTGATGC CGCCCTAGTG
GAATGGGCGA GCGGCAATCC TTGTCGGAAC CAAGCTGCTC TACTGGAGCG GGCTCGACAC
TTACAAGCAG GAGCAAAATC TTGTGGATAC ACCGGCCTGG TCCTTTTGCA CGGAATACTT
TGCTCAAGCA ACGAAAATTG GGAATCCCAG TCGCCCCACA ATTCGCAATT GTGGGAATCC
AAGGTATGGG CAAACGGAAA CGCGACCTAT TTTGGCATGA TGGCCGTCAG TATCGCAAAG
AATTTTGAGG ATATATCCTC ACCATGA
 
Protein sequence
MARRTYQPRR ECLTILPTNT KWNTYRIVII LLVLALDFNG SVVVRAKLLA AVILPHGDFA 
YDPTLLPTSH PGRPIADRLA STSRAVGHWL VQKNVAPDVL FFSTPHGIAL SNDFALYLGS
MASGTARIGK DLRNASFIPY NVRIANVTLA PTMVADLIHY LRVLRQQNVS GVSTSPDDAD
DVPLHWAEVI PLSFLDSNKK GSVVGREQSA LRKSHRRRRL PTGAGKTRQH LIWSHPLQRY
NAAPAMVPEL LHVGCLLRTW LEQRPETFAV VVSADLSHTH RQDGPYGYSN TSIAFDAALV
EWASGNPCRN QAALLERARH LQAGAKSCGY TGLVLLHGIL CSSNENWESQ SPHNSQLWES
KVWANGNATY FGMMAVSIAK NFEDISSP