Gene PHATRDRAFT_49410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49410 
Symbol 
ID7195904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp182862 
End bp184838 
Gene Length1977 bp 
Protein Length658 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184200 
Protein GI219127975 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAAC AAAGGAATCG CAAAGCAAAG CCTCTCAAAT CAATACTACC TGATTGTCTC 
GATCCGATTG ACGAATCAAT TGAGGGATCA GAGTCATCGT CCATTCATTC GGAGAATTCG
GAACCTGGAG TAGACTATTT AAAACCAGGA AAGGAGACGG CTCAACATAC GCAAAAAGAT
AGCAACCAGC CAGAGATCAC TAAACAGTTG GACGAAGAAA AACAGAAGTA TTGTGATAAT
CCTAGCTTTC GTTCTGCAGG CATTGTGTCA GGCGAACTCA TAAAACAAGA ACTTTTAGTA
GCACCTTTCT TTCTACAGGG GACTCCTTCA CGATTCGAAG AGAATTTACA CTCAACGGAA
GAATCAGGTT CTGTTACTCC AAGTATTCCG AACATCAGGA ACGAGCCTGT AAACAGCGAT
ATCACGAAAC ATACTGTTCT CGCAAGTATG ATCACTCCGA CGGAAAACGT CTCGAACAAA
TCTATTTTAG CGGTTGTCCG CGACGACTCC TCAGCCCCTT TGTTCAACTA TGACGACGAC
TTCGACACCG ACGCTGTCGA CCCCGAAACG GAAGAAGAGC AGTTGACCTG GCGGATGGAC
CCGTCCAAGA GCTTGAGTGA CTGGAAAATC AAGGTAACCA ACAAGGAGAC ACGGCAGAAC
GAATTATATC ACGTTCACAA AAATTTACTT GCTGTTGGGC CCAAAAAATC TGAGTACTTC
GTCCGTATTT TTCGAACCAA TAATCGTCTT GATGTGGGAA CCAGTACCAC CGACATCTTT
ATGGAGAGCG TGGCTGCCCA CGTGATTCCA CAATGGCTGG ACTTTCTCTA TTCGCCCGAC
GATGAGCTGG TCATTGACAC ACAAAGTGCA ACTGGCCTCC GTCATCTGGC TCAGTTCTTC
GGAATGCGTT CCATGCACAA GAAGGCCATG GAGTTCATCG TACAAGATTT GTCCATGACA
AACGTGATTG TCTACTACAA GGATAGTGTT GTACTAGCGG ATGACAAGAT TTCCGAGCTT
GCTGCCAACC ATTGTAGCAA CAACATACTA TCAATTGATA GCAAGCACGA GCTACTCACG
ACGGTCGACC CTTTCTTCTT CCGAAGATTG ATGACAGGTC CAGAGATAGA CAGTAGAAAG
AAGCAATATC ATATCAGTTC TCTCCTGGCT GAGTACTGCG CACTGAATTC GAACGTACTT
GATGAACAGT CATTTGAACG ATTGACGGAT GAGAAATATT TGCCACTCGT GGACCGAAAT
GCTGCGTTGA CTTTATTGGA ATTGGAGGCT GATCTTGTGT TAATCAATTC TTCTGAAGAA
GAGAAAAGCG AGTTGACGAG TTTACAATTA AGGTGCATCA AGGACTTGAC GCTGTATTGG
CAAGAGCTAG AGGTTATGGA ACATGATAGG ATTATGCGTG TTTGTCGCAA ATTACCATCC
ACAGTTGTTG CTGACCTTTT GGTCAAATCG CTAACCCAGG CCAAAAAGAA AGTTGATGAG
GTTGAAGCTC AGTCTGCAGC TCAGACAGCA GCGGTAAAGC TAACGCGATC AGGATCGGCG
AAATCTCTTG CCTCAGAAGA AAACAGCAAA ACGGATTATA GAGACATGTC TTCAAAATCA
GACAATGGAA AAATGAAGGA AGTTCGTAAA GAGTACGACG CCAAGATGTC GAGTCTGAAA
CGAGAACATC AAAAATCAAT CGACAAGATC CAGCGAGACT TTGAAAGCAA GCTTTTGAAG
CTCCGAGATA TTTGTGTTGA AAAAGATAAA CACATCGCAA ACTACTGGGA CGAGCTAAAG
CGTTTTCAGC GTTTGCCAAA CCAGCCTGAA GGAAAGATCA TTCCGTCTGG TCTAATGGCA
AAAGCGACCA AGATGCCGGA AATTGGGAAT CAGCCACCAG ATGGATATTT GCTCGTCGGA
AAGGGCAAAA CTCCATCAAA ATACCCTGTA TTCTTCTACA ACGGCGATCA AGTCTAA
 
Protein sequence
MGKQRNRKAK PLKSILPDCL DPIDESIEGS ESSSIHSENS EPGVDYLKPG KETAQHTQKD 
SNQPEITKQL DEEKQKYCDN PSFRSAGIVS GELIKQELLV APFFLQGTPS RFEENLHSTE
ESGSVTPSIP NIRNEPVNSD ITKHTVLASM ITPTENVSNK SILAVVRDDS SAPLFNYDDD
FDTDAVDPET EEEQLTWRMD PSKSLSDWKI KVTNKETRQN ELYHVHKNLL AVGPKKSEYF
VRIFRTNNRL DVGTSTTDIF MESVAAHVIP QWLDFLYSPD DELVIDTQSA TGLRHLAQFF
GMRSMHKKAM EFIVQDLSMT NVIVYYKDSV VLADDKISEL AANHCSNNIL SIDSKHELLT
TVDPFFFRRL MTGPEIDSRK KQYHISSLLA EYCALNSNVL DEQSFERLTD EKYLPLVDRN
AALTLLELEA DLVLINSSEE EKSELTSLQL RCIKDLTLYW QELEVMEHDR IMRVCRKLPS
TVVADLLVKS LTQAKKKVDE VEAQSAAQTA AVKLTRSGSA KSLASEENSK TDYRDMSSKS
DNGKMKEVRK EYDAKMSSLK REHQKSIDKI QRDFESKLLK LRDICVEKDK HIANYWDELK
RFQRLPNQPE GKIIPSGLMA KATKMPEIGN QPPDGYLLVG KGKTPSKYPV FFYNGDQV