Gene PHATRDRAFT_14345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14345 
Symbol 
ID7202676 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp341927 
End bp343136 
Gene Length1210 bp 
Protein Length367 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181887 
Protein GI219123137 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATGA ATGTGCGGCG TGATCACTTA TTGGAAGATT CGGTGGATGC CGTCATGAGT 
CTGAGCCGCA AGGACATGCG CAAGTTGTGG AGATTCGAAT TTATCGGCGA GGCAGGCATT
GACGCAGGTG GCTTGGCTCG GGAATGGTTT CAACTAGTGA CGGACGAAAT ATTCGATCCC
GACATGGGAT TCTGGCAAAG CTCCGAAACG AACCAAATGT GTATGCAGAT CAATCCGGCA
TCCCGTAAGT GCAAGACGAT AGATTTTTGG CCTTATACGG ACGTGGATGC TTGACGTTGT
GTAGGGCTCA CGCTTGCTAT TTCTTTCTTG GTTCACGCAC GTCTATACAG AAATGATCCA
CGAGGATTTC AAGGTCTACT ACAGGTTCCT GGGACGCGTC ATGGGTAAAG CCTTGTTTGA
TCGTCAATTA GTGGCCGGTC ATATGGTTCA ATTTATTTAC AAGCACATGC TAGGTTGGCC
TATTCAGTTC AAGGATATTC GAGATTCCGA TGAAGAGTTA TATTTTAATC TGAAACAACT
CACGGAGCTG GCAGCAAACG GAGAAGATTT GGAGATGCTT TGTTTAGACT TTACTACCAC
CGTTGAAGTT ATGGGCGCGA AGCAAGCCAT TGAACTGGTG GATGGTGGTG CCGACATCGA
AGTAACGAAC GATAATTTCC CAGAATATTT GGAAGCCTGT ATGAAGTACC GAATGATAGG
CCGTGTCAAG GAGCAGTTGA ATGAACTGTT ACTGGGCTTT TTTGACGTCA TCCCGGAACC
GCTCTTGACC ATCTTTGATT TTCAAGAACT CGAGCTGCTC ATGTGCGGTT TGCCGGAGAT
CGACATGCAG GACTGGCAGG ATCATACGGA ATATTCAGGC GATTACGAAA ATATCGGTGG
CGAGTATCCG ACGTGTCAGT GGTTCTGGGA GGTTGTCGGG GAATTCGATC AGGAAACCAA
GGCTCGACTT TTGCAGTTTG TGACGGGTAC CTCGGGTGTC CCTTCGCGAG GATTTGGCGT
GCTGCAAGGG AACGATGGAA ACGTGCGAAA GTTTACCATT CACGGTGTTT CGGTCGGAAC
ATGCCTGTAC CCTCGAGCGC ACACTTGTTT CAACCGTATT GATCTTCCCA TGTACGAAAC
CAAAGAAGAG CTCAAGGAGA AACTAAAGCT CGCCGTCACC ATGTGCGCTT CCGGATTCGA
CATTGAATAA
 
Protein sequence
MRMNVRRDHL LEDSVDAVMS LSRKDMRKLW RFEFIGEAGI DAGGLAREWF QLVTDEIFDP 
DMGFWQSSET NQMCMQINPA SQMIHEDFKV YYRFLGRVMG KALFDRQLVA GHMVQFIYKH
MLGWPIQFKD IRDSDEELYF NLKQLTELAA NGEDLEMLCL DFTTTVEVMG AKQAIELVDG
GADIEVTNDN FPEYLEACMK YRMIGRVKEQ LNELLLGFFD VIPEPLLTIF DFQELELLMC
GLPEIDMQDW QDHTEYSGDY ENIGGEYPTC QWFWEVVGEF DQETKARLLQ FVTGTSGVPS
RGFGVLQGND GNVRKFTIHG VSVGTCLYPR AHTCFNRIDL PMYETKEELK EKLKLAVTMC
ASGFDIE