Gene PHATRDRAFT_40831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40831 
Symbol 
ID7198773 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp48114 
End bp49495 
Gene Length1382 bp 
Protein Length374 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184882 
Protein GI219129409 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.353784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACCGA ATCAGCACGG TCAATTTCTA CGCTCAGTGT CCGAACTCTT CGACAATTTG 
CGAGCTGCGA TACCTGGTAT AACGCAAACA GCAAAGAAAG CAAGTAAGTA GCTACCCGAT
TGGCTGTCTT TACTCTAAAA GTGAATGGAG CCGAGCATTC AAGGGGGCGT GGGAGGGGCG
GGGTAGCGAT TGGGGATGCG GAGACTGTGC CGGAATCGGT AAGGGCAACA ATAACGACTT
CGCAGTGGGG CGTGTCCGGA TAAAGAACGT TGAGATCTTC TTATTGGGCA CGGCACACGT
TTCCAGCGAT TCTAGCGAGG AAGTGAAACT TCTGCTCCGT CATGTGCATC CCGACGCCAT
TTTCGTTGAG CTTTGTGAAG CTCGCATACC TCTCCTTGAA GGAACGGCGA AGGACGAACA
CGAAGAAGAA GCATTGGCAC ACCAGAATCG CACGATGTGT GAAAAAATAC GGCAGGTACA
GTCCACACAG GGAGGCTCCC GTCTTCAAGC TCTTTCCACA GTTTTGTTGA CTTCTGTCCA
AGAAGACTAT GCATCCGAGT TGGGAGTAGA GCTGGGAGGC GAATTTCGGG CCGCATACCA
ATACTGGCAA GCGCAACAAT CCATACCGAC TGGAACAAGT TCTCAATCTT GTGCTTTGAT
TTTGGGCGAT CGTCCTCTAC AATTGACACT TGTACGTGCC TGGGAGTCTC TCGGGTTTTG
GCCCAAGGTA AAGGTTTTGC TAGGTCTGCT TTGGAGCTCA TGGCAAAAGC CGAAAAAGGA
GGAAATCCAG GAGTGGCTAC AGTCTGTGCT TCGGGACGAA ACAGATGTTC TCACGGAAAG
TCTGAAAGAA CTGCGCCGTC ATTTCCCTAC CCTTTTCACA GTAATTATTG CAGAACGTGA
TGCATGGCTA GCTGCCAAGC TTGTACAAAG CTGTCGAGTA TTATCAGCCT CAGCAACAGC
AGCTTCTCCT GTATGCACGG TCGTGGCCAT CGTTGGTGCT GGACATATCC CAGGAATTGT
AGCCTGGCCC CACATTGTTG CGCACTGTCA AAGACAGTGT CGTTACGAGG AACATCCTTG
TCGAAGAGTC GTTGATTTCC GTTCGGAGGC GTTGTCAGTT GAGGTAGTCC AACCGACATG
GTGCTTGTGC AAGTTCCAAA GAATTGCACC CCTCACTCAC TTTCATATCA ACCTTCTGTT
CTACAGATCC ATACAGCGCA GCGTAAGTGA GAACAAAGCA ATAGGTTTTT GATGCGATAA
CAGTTTCTCG GGATCACGAG GATGACACTT TTTTGGCTTT GTACAAGAAC AAAGCAGGCG
CAAAGATAAG CGGTCGGAAG TCAAAGTCAA AGTGTACAGG TACGACGGCC TTTACCGAAT
AG
 
Protein sequence
MGPNQHGQFL RSVSELFDNL RAAIPGITQT AKKAMGRVRI KNVEIFLLGT AHVSSDSSEE 
VKLLLRHVHP DAIFVELCEA RIPLLEGTAK DEHEEEALAH QNRTMCEKIR QVQSTQGGSR
LQALSTVLLT SVQEDYASEL GVELGGEFRA AYQYWQAQQS IPTGTSSQSC ALILGDRPLQ
LTLVRAWESL GFWPKVKVLL GLLWSSWQKP KKEEIQEWLQ SVLRDETDVL TESLKELRRH
FPTLFTVIIA ERDAWLAAKL VQSCRVLSAS ATAASPVCTV VAIVGAGHIP GIVAWPHIVA
HCQRQCRYEE HPCRRVVDFR SEALSIQRSV FDAITVSRDH EDDTFLALYK NKAGAKISGR
KSKSKCTGTT AFTE