Gene PHATRDRAFT_46994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46994 
Symbol 
ID7202231 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp126464 
End bp128773 
Gene Length2310 bp 
Protein Length730 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181306 
Protein GI219121923 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00183734 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGATAGTAA GGCGTTCAAC TTCACGAGCT GACCATAGCG TGCATCGCTT GTGGCCTACA 
GATAGTTTTA ATACAATGAA TCTGCGAGAT GATTCAATAT CAGATTTTCT GGAATTTTTG
TCTCGTCCAG AATCTTCAGC TGCAAGCCTG CTTTGGAAAG ATGAACCCAA AGCGACCGAT
ATGCTCCGTC GCACCTGCAA GGTTCTTTTC CAACGCACTG AACATCTCGC AAAGAAGCAC
CCCACCATAA TGAAAGATAA CAGCATAAGT TCGGGGCTTT CCGCTCTACC TGAGCTTTAC
ATGGGCAGCG TAGACGACTC GGTTGATGCC GAAACTCTTT GGGGTCAGGT TGAACTGCAA
AATGAAGCTT TGCAGAAGCT GCTGAAGAGA TCAGTGGCAC AGCTCGCTAA GATCGCAGAG
GAAGGAGGAT CGTCTATCAA ACTTCTTGAT GACGTTTTGA GTGGAGATAA TGACGGCGAA
ATTTCTGTTG AGGATCACAT CGAAAGGCAA GACCATGAAG CATTGAACGT GAATGTTGAT
GACGCAACCC GACGGGTTTG GGAACGTATG GAGCGTGCTA TGGACGACGT GGACGAAGAG
GATCCATCGG ATAACAGTGT AAACAACATA GTGGTCGGAG AAGAAGACGA GCGTAGATCT
GTTGATGCAA GTTCGATCGA AGACCCTGCA GCTGACGAGC TCAATGACGG GTTCTTTGAC
ATAAATGATA TGGAGGCTTT CGCCGACGAA GAGGAAGAGT ATCTTCCCGA CGAAGCTTTT
GGTTCTATAC CACCTGATTC ATCAGAGACA GCTGATAATC GATCGTTCCA CCAGAAGCAG
CGCGACGGAT ATTTTGACAA CAACTCCGAG GAAGTTCTCG ACGATGAGCT TCGTCGTGGA
AAGGAATCTC AAGGAAATCG AAAGAAATAT CGAGAAGATG ACGAGATTGG GGCACTTTAT
AAACTTTACG ATACTCCTCG AGACGACGAT ACAAATGGCG AAGATGCTGA TCCCGTGAAC
GTGAAAGCTG TTGATGTTTT TGGAAAGCCA AAGGAGAAGG ATTTTAAGAA ATGGAATTCT
CGAGTGAGAA ATAAAGCAAA CGACAAAGGC AACGACGGCG ACGACGATGC ATGGAACGAA
GATGGTGTTG ATCAAGCTAT AGCTAGCAAA ACGACAGGTT GGAATAATGA CGAAGAAGAG
TCCGATGCGG TTATTGAATT TAGCCGTGGT GATACATCAT CATTGTACAA AGAACATGGC
GAAAGCAAAA TCGAAATCGA TCAGAGATTA AAAGCGGGAA GTTCTACTTT TACGAAACAA
CAAGAGAGGC TTCGTCGTCA GACAGAAGAA CTAGAAAGGG AAATGATAGC TGAAAAGCCT
TGGCAAATGA CTGGTGAGTC TACATCGACA TCTCGTCCTG TGAATTCATT GTTAGAGTCG
ACGCCGGAAT TTGAGCGCGC TGCCAAATTG GCACCGGTTA TCACAATAGA GCACACAGCG
GATCTCGAGG AAATTATCAA AAATCGCATC ATTGCTGACG ATTGGGATGA CTTGGTGCCT
CGAGAGCTGC CTGATATTGG CTTTGGTCAA AAGAAAGGCG AGTTACCGGA AGTTAGCCAA
GAAAAGTCCA AGCTGGGCCT AGGTGAGCTT TACGAGCGCG AATATCTCAA GAAGGCTATA
GGCTACGATG TGTCTGCTGC TGAAAAAGAA TCAGAAGAAG AGAAAGCAAA AAGTGAAATG
AAGACACTCT TCGCAAACCT TTGTAGCAAG CTTGATGCTT TATCCAACTA TCACTTCGCG
CCACGCCCTA TTGCAGAGGA GGCTGAGGTG CGACCTGTCA CTAAGCCAGC GATTGCGATG
GAGGAAGTTT TGCCTTTGCA TGTAAGTAAT GCTCGTGGTG TCGCGGCCGA AGAGGTGTAC
GGCGCGAAAC GTGGTAGGGA AGCCATTCTC CGAAACGAAA CCGAGCTTGA CCAAAAGGAT
CGGAAGCGCG CCAGAAGCTT AAAAAAGACG GCTAGGCGGA AAGCAAGAAA GGAGAAACAA
GCGGACGAGA AACTCATCTC TCGGCTTCAA CCCGGGCTTG GGCTTAATAA CCCTTACGAA
CGCAGGAAAA TGCGTGAAGA ACTATCTGAG GCAAGGGCAC GAGGCAAGGT CACAACTGGC
GAAACTGACA TGAACGAGTA CGGCGGTAGT GGGACTTTCT TTAAGCGCAT GCAGGAAGAA
GCTGAGCAGT CTATTAATGA TCGTAAGACT GATGGGTCAG GGAAGAAAAA TGTGCGCTTG
CACCCCAAGT CAAGTTCACT GAAGCTTTAA
 
Protein sequence
MNLRDDSISD FLEFLSRPES SAASLLWKDE PKATDMLRRT CKVLFQRTEH LAKKHPTIMK 
DNSISSGLSA LPELYMGSVD DSVDAETLWG QVELQNEALQ KLLKRSVAQL AKIAEEGGSS
IKLLDDVLSG DNDGEISVED HIERQDHEAL NVNVDDATRR VWERMERAMD DVDEEDPSDN
SVNNIVVGEE DERRSVDASS IEDPAADELN DGFFDINDME AFADEEEEYL PDEAFGSIPP
DSSETADNRS FHQKQRDGYF DNNSEEVLDD ELRRGKESQG NRKKYREDDE IGALYKLYDT
PRDDDTNGED ADPVNVKAVD VFGKPKEKDF KKWNSRVRNK ANDKGNDGDD DAWNEDGVDQ
AIASKTTGWN NDEEESDAVI EFSRGDTSSL YKEHGESKIE IDQRLKAGSS TFTKQQERLR
RQTEELEREM IAEKPWQMTE STPEFERAAK LAPVITIEHT ADLEEIIKNR IIADDWDDLV
PRELPDIGFG QKKGELPEVS QEKSKLGLGE LYEREYLKKA IGYDVSAAEK ESEEEKAKSE
MKTLFANLCS KLDALSNYHF APRPIAEEAE VRPVTKPAIA MEEVLPLHVS NARGVAAEEV
YGAKRGREAI LRNETELDQK DRKRARSLKK TARRKARKEK QADEKLISRL QPGLGLNNPY
ERRKMREELS EARARGKVTT GETDMNEYGG SGTFFKRMQE EAEQSINDRK TDGSGKKNVR
LHPKSSSLKL