Gene PHATRDRAFT_46346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46346 
Symbol 
ID7201616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp61754 
End bp63904 
Gene Length2151 bp 
Protein Length716 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180745 
Protein GI219119993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0737232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAG ATAATCTTTA CGAAAAATCC CAGTCTCATT ATGAGCGCAA GGTTGCTCGC 
CAGCGTCAGT TAGAGATGAT GGAAGCCGCA GCTTCGGGTA CGGCAGCGGA AATGGAGCTC
GACACGAGCA GTTTTGCCGA AGCCGACGGC TCCTTCATGC GCCAACGCAA CCAAGAGACG
GCTCGTCGGT TTCTGCACGA CGAAGACGAC TCGCAAGAAC AAACAACCCC GATGCGAGGA
AATTCTACCA CCGGTCACCA CACATTATTT GGGCAACGAC AATCTTCCGG TCCGTCACTC
AATTTAATGG ACCATTCCGC TGGCGTTTAT ACCGAGCAGG TGTCGGGACT GTTCAGTGAT
GCGGTTCATT CCGTCGGAAA GTTCATTTTC AAGGTGTCGG GTTCTGAGAT AAGAGGAAAT
GAGCACGATG CGATAGCAGC CACGGATGGA GATGTGTTTC TGGGTGATGA TTACAAGCGT
CGGTCGTCTA TGCGTGCGAC TCAATGCAGC ACGATCTTAG CGTCGCGTGT CATGTTTTTC
AAATCTCTTC TGCGCGACAA GAAACGATTG GGACTGTTGG CCATGGTCTT TTTAAGTCTT
TTTGCGATTG TCTACGCCAC CAATATGGCA CTGCAACGCA GACCAACGGA AAAGATACTT
CGGGAAAATA ATTCTTTCCG CTTCAACGCT GTTTTGGATC ACATTGTAAC GGAAAGTGTG
AGCCACACCG AAGTGTTCCT AAACTACGAC AGTGCCGAGT ACCACGCACT CCGCTGGGTG
GCGTATTCGG ATCCGGCCCG TCTCGACCCC ATGGATCCCC TCATTCTCCA ACGGTACGCT
CTGGCGGTCT TTTACTACGG TTCTTACCTG GCCTTCTCGG AGCAATACGG AGAGCAAAAG
CCTATTCAAT TCGAAGGACA ACAGTTTGAG GGTGTCCCTA ATCCCGGCTG GACGCGCAAG
GATTACTGGA TGACGGGCAA GGGAGTTTGC AAATGGTACG GCGTGCTCTG TGAAGGCGAA
ATGGTCAACG GAGTGGAAAT AGCGGACTTT GACAAGAATG CGCCGATTAT TGCACTCAAC
TTGACCGCGA ACCAGCTGCA CGGTACACTA CCAATGGAAT TCAAAGCTCT CGATTCGTTG
CTGGTTCTGG ACTTTTCCCG CAACAGTATC GGAGGAACCA TTCCTCTGCA ACTCGGGCGT
AGCATGATCG AACTCCAGTA CTTGTTCTTG AACCGTAACG AAATTACGGG GACGCTGCCT
AGCGAAATGG GTTTCATGGA GAGTGTTCGC TCGATCCGTC TGGGCAACAA CCGCTTACAG
GGATCCATCC CGTCTACTTT TAATCGTATG TACCAGTTAC GAGATCTAGG CTTGGACAAT
AACTACATTA CTGCCGACAT TCCCGATCTA GAAGGCTGTC AGCACTTGAT TGGCCTGTAC
CTCAACAACA ATCGCTTAAA CGGCCGACTG GACACCTCAA TTGGCAAGCT GACGAACTTG
TTCGAACTAC GGGTAGACCA AAACTATCTA CAGGGCACGA TTCCTCCAGA AGTATCGAAT
ATGCGTCGAT TGGAACACTT CAGAGCCAAC AACAATCTCT TGACGGGTAC TATTCCGAAC
GAGATATTCC TCAAAACCTT ACGCATGAAG GAAGTCGATT TGCAGCGTAA CAATTTTGAA
GGGCCGTTGC CAGTCTCCCT CGGTGGTTTG TCACATTTGC GCCTACTCAA GCTCAACAAC
AATGCTTTTC AAGGTGGTAT TCCCCAAGCT TGGAATCGCA TGCACGGTTT GCAAATTCTG
CATCTCATGA GGAACAAGCT GACGGGCACA ATCCCTACGG GCATCGCAGG TTTGCGGGAG
CTCCGGGATC TGTCTGTGGC CAACAACCGC TTGGCCGGTA CGATTCCCCG CGAAATTGCC
GCGTGCGAAG TTCTTACGGA AGCCTATTTC GACTACAATC AGTTCACGGG GACGATTCCG
GCCGAATTTG GCTTTTTGAA ACACTTGGAG ACGTTACGGG TATCCGGCAA CTTGTTTCAT
GGTGACGTAC CGGTCCAAGT GTGTGCGCTC ACCCGCGAAC ACGTGTTGAC CAATTTCATG
GCGGATTGCA AAAGCAAGGT GACCTGCGAG TGCTGTCACC GGTGTGCGTA A
 
Protein sequence
MSQDNLYEKS QSHYERKVAR QRQLEMMEAA ASGTAAEMEL DTSSFAEADG SFMRQRNQET 
ARRFLHDEDD SQEQTTPMRG NSTTGHHTLF GQRQSSGPSL NLMDHSAGVY TEQVSGLFSD
AVHSVGKFIF KVSGSEIRGN EHDAIAATDG DVFLGDDYKR RSSMRATQCS TILASRVMFF
KSLLRDKKRL GLLAMVFLSL FAIVYATNMA LQRRPTEKIL RENNSFRFNA VLDHIVTESV
SHTEVFLNYD SAEYHALRWV AYSDPARLDP MDPLILQRYA LAVFYYGSYL AFSEQYGEQK
PIQFEGQQFE GVPNPGWTRK DYWMTGKGVC KWYGVLCEGE MVNGVEIADF DKNAPIIALN
LTANQLHGTL PMEFKALDSL LVLDFSRNSI GGTIPLQLGR SMIELQYLFL NRNEITGTLP
SEMGFMESVR SIRLGNNRLQ GSIPSTFNRM YQLRDLGLDN NYITADIPDL EGCQHLIGLY
LNNNRLNGRL DTSIGKLTNL FELRVDQNYL QGTIPPEVSN MRRLEHFRAN NNLLTGTIPN
EIFLKTLRMK EVDLQRNNFE GPLPVSLGGL SHLRLLKLNN NAFQGGIPQA WNRMHGLQIL
HLMRNKLTGT IPTGIAGLRE LRDLSVANNR LAGTIPREIA ACEVLTEAYF DYNQFTGTIP
AEFGFLKHLE TLRVSGNLFH GDVPVQVCAL TREHVLTNFM ADCKSKVTCE CCHRCA