Gene PHATRDRAFT_48494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48494 
Symbol 
ID7203773 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp683306 
End bp685898 
Gene Length2593 bp 
Protein Length582 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183009 
Protein GI219125480 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.078765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCGGACAAAC GAGAGCAACG ATGACCATGG AGTTCTTTTA CCTGATTGTC GTCATGTTGA 
CTTTTCTGTT GCTGCCTTCA GCTTCGGCAA GTTTGGGCTC TCACCGAAAA CTCAACCGAG
CTATAGGGAG TGAAGCAATC CCAGGACAAT ATATCATCGC ATTGGATTCA AACATCCCAA
ATTCAAGAGG GTTTGCCACC CACATACTCA AACGCGCGTT CCGAAATAAC ATCATCGCGA
CCTACGACTA TGCCATGAAA GGGTTTGCTG TCAAGGATCT CCCCGATATG TTGTTGCACT
TCATTCTCAA CATGGATGAC GTAATCTCTG TATCGGAGGA TGCGGTTGTG AAGGCGGAAA
CGGTACAAAC AGATCCTACG TGGGGCCTCG ACATCATTGA TGGACAAGCT GACAACCTCT
ACAGTTACGC GTTTACGGGA CAAGGCGTCG ATGTTTACGT TCTTGACACG GGTATTCAAG
CAAATCATCC GGACCTTGAA GGTCGCGTCG AGAGCTGCGT CTCCTACAGT GGAGAAGGTG
CGTTTTGCGT CAGTGGCGAA TGTCGCTGTA TTTGATGATT CACTTAGATT CTTACCACCG
TCTACGTCAA AAAAGAAAAT GACGAAAAGG TAGTCAACTC ACAAACGTTG GTCTCTCCTT
TGTGTAGAGT GCGGATCCGA TCTCAACGGG CACGGTACCC ATGTGACCGG AACTGTCGGT
TCGAAAACAT ACGGTGTAGC CAAGAAAGCA TCTTTGCACG ACGTCAAGGT GCTCAACCAA
AATGGTAGTG GATCCTACGT CGGAGTTATT GCTGGCATCG ACTACGTAGC CCAGATCAAT
AATGCAGATC CCAGCCGCAA CATTGTTATG AATTTGAGTC TCGGGGGAGG GTTTAGTCCA
GCCTTAAACA GCGCAATCAA TTCCGCAGCA GATTCAGGCG TGGTAGTGGT AGTCGCTGCC
GGAAACAGTA ACGAAGATGC CTGCAATTCC TCTCCTGCAT CTGCAGAGAG AGCATTAGTG
GTTGGTTCCA TCAACAACAG TAACCAACGT TCGGTTTGGT CTAATTGGGG CAGCTGTGTC
GACATCTTTG CGCCAGGTTC CGGGATTCTA TCGCTGTCCC CAAACAGTGG CACGTCTACA
AAGTGGGGCA CGTCCATGGC CTCCCCACAC GTAGCTGGTG TCGCCGCACT GTACTTGCAA
GCGCGTAAAA GCACCGATTT CATTACGTCC GATGGGTTGG AGGATCAGCT CTCTTCGAAT
CTGGAAGGCT CCCCCAACCT CCGAATAAGC ACTGCAAAGC TGCCGCTGGT AGCTCTACCA
ACCCAGTCGC CCAGTCGTGC ACCCACACGT CAACCCACTC CCGCACCAAC CCTCCAACCC
ACTCGTGAAC CTACCGGTCG CCCGACGCCG TCGCCTTCAA ATGAACCCTC GGAAGACGCC
GTTCCCAACC TTCCCACTAT CGACGCCATA CCTTTGATTG AGCCAGATTC GTCGCCTCCG
ACAAAGACTC CAGCTTCTTT TCCGACCAAC GCTCCAGTGC CTCCACCAAC TCGTGCTCCG
ACCAAGGCGC CGAGCAGTGC TCCAAGCAAT ACTCGTGCTC CGACCAAGGC ACCAACTGGT
GCTTCAGTGA CTCCACCAAC ATCTTCTCCG GTATTGCCAC AATGCCAGTC TAGTGGACAG
GTGTGTACTG CTTTCCGGCG ATGCTGTAGA GGATTGAGAT GTCTTCGATC GTGGTCACCA
CAGCGTGGCC GACACCGAGC GTGTCGTCCT CGCTGGTGAG GATGGGCTTC CAAGACGGTC
GCCCACTCCA CTATCCGCCC AGTCCGGCAT TCACATTAAG CCGGGTAGGC CATATCCAAA
CACTTTTGCA AGATTTCGCG CGGAATGGGC GCCTCCCTGG AGGTACCCCT TCTGAGAAGT
GATTGTCTTT CCGTAGTACA TAAGCGCTAG TATTGCGCAA ACGTCGTTAT AGAAACTGAT
TCTATGATTA TCGGTACCGT GTTGGTAGCT TATTGAAACG GGACCATGCA GAATATCCAG
CAGTATTAGG GATCATCCCA TGAAATACTA CCAACGGAAA GAACAGGATG GTCTGTGAAA
TGGTGATCGA AGTGTTGCGC GACGGTCAAA GCTCGATCAA CAGTCTTTGC CCCAACGGGT
TACTGTTGCG CAGTAAATTC CTCCGGCGGC CCAGCAGCTG ACAGACTTGC GTTTCTCAAA
ATCCCGGGAG ATTGCTTGAT CACCAAAAAT TTGGAAATAC CGCCAGAAAA ACACTTATAT
CAACAATTAG AGAGAAGTTG GCCAGTAGCA TATTTATAAT TCGACCCAAG TCATTTTTCT
TCGACTAGTG AACGCCGCTG ACTTCTTTTC CCAGCCTACT GCCGGTCTAG CCAAAGAAGT
CGGTGACCAC TACTTCGAAG CCTTTTCCTT CGACATGCTG CAGAAACTGT CTAGTCCATG
CGACTGCAAG GGAGTTTCAA AGACAGTCAT CAATTTAATG CTTCTGTTGG AAGAAAGAAC
AATGAATGCC ACAACACATC AAACGACAAA TCTGGGAAGC TCGCGAAAGC AGAAAATGGA
TAAGCAAGCC TGA
 
Protein sequence
MTMEFFYLIV VMLTFLLLPS ASASLGSHRK LNRAIGSEAI PGQYIIALDS NIPNSRGFAT 
HILKRAFRNN IIATYDYAMK GFAVKDLPDM LLHFILNMDD VISVSEDAVV KAETVQTDPT
WGLDIIDGQA DNLYSYAFTG QGVDVYVLDT GIQANHPDLE GRVESCVSYS GEECGSDLNG
HGTHVTGTVG SKTYGVAKKA SLHDVKVLNQ NGSGSYVGVI AGIDYVAQIN NADPSRNIVM
NLSLGGGFSP ALNSAINSAA DSGVVVVVAA GNSNEDACNS SPASAERALV VGSINNSNQR
SVWSNWGSCV DIFAPGSGIL SLSPNSGTST KWGTSMASPH VAGVAALYLQ ARKSTDFITS
DGLEDQLSSN LEGSPNLRIS TAKLPLVALP TQSPSRAPTR QPTPAPTLQP TREPTGRPTP
SPSNEPSEDA VPNLPTIDAI PLIEPDSSPP TKTPASFPTN APVPPPTRAP TKAPSSAPSN
TRAPTKAPTG ASVTPPTSSP VLPQCQSSGQ PTAGLAKEVG DHYFEAFSFD MLQKLSSPCD
CKGVSKTVIN LMLLLEERTM NATTHQTTNL GSSRKQKMDK QA