Gene PHATRDRAFT_48701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48701 
Symbol 
ID7194685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp671356 
End bp673313 
Gene Length1958 bp 
Protein Length615 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183137 
Protein GI219125751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCAAG CTGCGTCTTC GACAATCAAC CGCATTGCTG AACTCGAAAA CGCGTCCGGA 
AGGGACAGAA TCAGAGAACA CGATGGCAAA CCCACAAGCG AGCCACCTAT TGTGCACAAA
AACGAGGCCA AAAGCTCTAC CAGCTGCTCA CAGATTGTGC CCCACACTTC CCCTTCGGCC
CTGCACCTTC TCAAATCAGC TGTGCAACGA GTGCAACAAG AAGAAATATT GGCTGCCGAG
GCATTACCTA CACCTTTTAT AGGCGTTTTG CGATTTGGCC CTGGCAAAGC TGTCCCTAGG
AAAAGAATTA GTCCTTCTGG TAAGCGGCCG AGACATACAA ATGACCGTTC ATTTTCTGAA
GCATGGATGC TCGAATTACT CGACAAAGTC GATCCTGAAG CTATCAATCC CTTGGTAGCA
TTTGCCGCAG CCTTAAAAGA CATCCCGGAT GAGGAGAAGA CTGGATACTA CAACACCTTG
CGGGAAGCAC CCGACTTGAT TGCTTGTGAG TCGAATCCAC TCATTTTCTT AAAATTTCAT
TCGGACAATG TCGAGGCAGC TGCTCAGCGC TTTGCACTGT TCTGGCAGGA ACGGTATCGG
ATCTTTGGGG AAAGAGCTTA TCTACCGATG GATTCCACTG GAAATGGTGC GATGTCACGC
CAAGATCTGA TCGTCTACAA CTCACAGTAC CTCTTTGTGT TGCCTCGAGA CGATGCCGAG
GAGTCTGTTG TTTGCTATAC GCCGTCCTTA GTGGATGTGG ACTGTAAAGA GCGCATGAGA
TGCGTTTTCT ATACATTTTT CCAAGTATTT AAGAATCCGA TGTCATCGAA AACAGGATTC
ACTGCCATAA CAGTCTTCGA CAAGATTGGT ATGGAACGTG CTTACGGTAG AAAGCACCCC
GCGGATCTGC TGCGAGATGC ATTTCCAATG CGAATGAAAA AAATCCATGC TGTTCTGTTG
CTGGATAGCG AGCACGTGAC CTACTTCAAA CAAAGATCAG AGCCTCTATT TATACAGCTT
TACTCTTGCC CCCTGAACAC GCATTTCGAG GGAACAAAAG AGGAGATAGC AAGTTCATTG
GTTTCATCCG GGTTCAGAAG TGCTACTCTC CCGGAATCTG TCGGCGGCAT GCTCCGATAC
TGCGATGTCG AAGAGGCTTT TACAAAACAA CGACAAAAAG AAACAGTAGA AGAAATAGAA
CGCATGAAAA TGAGAAAACG CCTATCGGCA ACTGCTTCTG ACATTTCCGA GGCGGCCTAC
GACCTTTCAG CCACGGAAGC AGATATCACA CGCTCGGATA GCGACGACAG CGCATCCCGA
TGGAAGATGA GGCGACGACG TCAACTGAAC AACGAAGCGT CAAAGCGAAA GAGAAGAAAA
ATCAAGCAAC ACGAACACGA TTTAGAGAAA CAATGTGCGG CATTGAGAGA GCAAAATGCA
AGGCTCAAAA GCGGCAATTA TCAATTGCAA CGTCTTTTGA TACAGGCGCA AGAAATTGTC
ACTGTGTATG AGGAACGCAT TCTCCACCCG AATCGTGCGA TTCAAAACAC TAACCAGCTC
AATTCCATGA TAAGCAACGA AAATCATCAG GTAGTAGTCC AGCCCAGCAC GACTGCGCTT
TCGACACCTG TACCCGTCAC TCGTCTTGCC GATGCTATAG GACAGCCTTC TGGTTTTTCT
GCACACTTTC TACTCTCACC TGAAGATCCA TTTAAATCCA ACGCAGTGAG CCTGGCTGAC
GTTCGTTTGC TCCCGGGAGG CTTGTACGAG CGACCAATCC AATCAGATAC CATCGGACAA
AGCGAAACAA AAATGCCGGA TTGTGAACGA AAGGCCCGGG TACCTTAGTT TCATCGGCCC
TAACATGCGG ATATGTCCAG CGCTATTTAC GAATAGAGTT CCTCTTGTGA AACGCGCTCT
TTCAGTTTAT AGTACTAACA GTCAAATACT TGATGGAT
 
Protein sequence
MNQAASSTIN RIAELENASG RDRIREHDGK PTSEPPIVHK NEAKSSTSCS QIVPHTSPSA 
LHLLKSAVQR VQQEEILAAE ALPTPFIGVL RFGPGKAVPR KRISPSGKRP RHTNDRSFSE
AWMLELLDKV DPEAINPLVA FAAALKDIPD EEKTGYYNTL REAPDLIACE SNPLIFLKFH
SDNVEAAAQR FALFWQERYR IFGERAYLPM DSTGNGAMSR QDLIVYNSQY LFVLPRDDAE
ESVVCYTPSL VDVDCKERMR CVFYTFFQVF KNPMSSKTGF TAITVFDKIG MERAYGRKHP
ADLLRDAFPM RMKKIHAVLL LDSEHVTYFK QRSEPLFIQL YSCPLNTHFE GTKEEIASSL
VSSGFRSATL PESVGGMLRY CDVEEAFTKQ RQKETVEEIE RMKMRKRLSA TASDISEAAY
DLSATEADIT RSDSDDSASR WKMRRRRQLN NEASKRKRRK IKQHEHDLEK QCAALREQNA
RLKSGNYQLQ RLLIQAQEIV TVYEERILHP NRAIQNTNQL NSMISNENHQ VVVQPSTTAL
STPVPVTRLA DAIGQPSGFS AHFLLSPEDP FKSNAVSLAD VRLLPGGLYE RPIQSDTIGQ
SETKMPDCER KARVP