Gene PHATRDRAFT_42995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42995 
Symbol 
ID7196220 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1733373 
End bp1735149 
Gene Length1777 bp 
Protein Length549 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177360 
Protein GI219111217 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACCA AACCGACGGC AACGGCAGCG CCGTCGCTGA ATATGTATCA GAAAATCGCG 
GTGATTCGAC ACTCCAGCGA TAGCTTGACT TACGATGACA GCGACGGAAA CTCAAAATCC
GTGTCCGTCC AGCAACTCGG GGACGCGATT GCCACCGCGG TACGTAGCAA AACCCCGTCC
GTAGCGCTGA TCCTTCGAAT CGTGAGATAG CGAAAGGAAC ACAGTACGGA ATGGATTTTT
GCATTCCATA GCCATCTGAC GCAATGCTTT ACTTTTTCCT CTCCAGACAC CACTGGTTTT
GCCAAAGTCC AAGCTCGACA TTCCGGTTCC CCGAATCAAC AACGTAGAAA GTTACGAGCG
TGACGTACCG GCAACGTATC AAAAGCCAAT TTCGTACGTG CGATGTCACA GACCTTCACG
GGCGGAATTG AAAGCAATGG TAGAATACGT CGCCGACCGC GAGGACCAGG AATGGCTGAC
GAACAACACT AAATTTGGCG GTGCGGTCGT GTGGGACGAG GGATTGGACA CGTTGCAACA
GCGAAAGCCT CAGCTACCGT TAGCTCTCTT GGAACGCATT CTAGATTTGT TCGAGAAGGA
AACTGGCTTT GACGCCATCA TGACATCAAA TCAAGCAGAA GCTATGGTAT TTAAGAACAT
TCCGCTTATT TATCAAATCT TCCCGAATAA ACCTCGGAAT GGGGTGGTGA CAACCAAAAC
AGTTCTCCTG GAAGTATACA ACTACTGGCT TCACAAGCGT TCCAAGCTCA AACGCCCCTT
ACTGCGACGC TTTTGGCCGG TCACTAGTAG CGACGACACC AACCCTCATC TTGTTTTCCG
ACCTCGGGAA AAAGAGAAAT ACAAACTACG TAAAAAACGT CAAAATGACA TGGACGCGTA
CCGAAAAATG AAACAGCTTC GCAACGATTC GGACAATCTG CGTGCGGTGC TGGAGTTGGT
CCGTCGACGA GAAGAGCTTG CGCGTGCCCA CATCAAGACT CAAATGGAAT TATTCGAACA
GCGTATGTAC GACATCGTCG ACACGACCGG ACTACCCCGG GAATTGAAGC ATGTAGACAA
AGATCAGCTT AAGCGGGTGT TGGACACGCC ATCCTTTTTC GACATCTACT ACGGAGGGCG
GAAAAAACAG ACCGCTCGGT CCCCTGTTTT CCCTAGTGAT ATTACAGCGC GTGAAGCTCG
CCCTCTCTTG AGTAAGACCC TCCACGACAA CGCTAGTAGT GCCTCCCAAG AAACACCAGC
GATTGTCGCT GGACAAAATA GTGGTGAACC CGCTCCCTTG TTCCTTGATC CATTACAGAC
TCGAGAGACG TATGCCACTT CTTGGCAAAA TGCTGTGCCT CACGTAACTT CGTACATTGA
ATCTCATGCC GAACCGACTT TTCGGTTTAG ACATCGACCG CGGGTTGGTC GCGGTGGTCG
ACTTTGTATC GATAGAATGC CCCGCCCGCC GAATCCGACC GGTCCGACCA CTACTGTCGT
CACCGCCGGT CGCGGTATGC CCCAGTCACT GACCCATAAG GACCGCCTAC TCGACCTGCT
CCCCAAACCA CTCGATCATA TTTCATTGAG TCGAAAAATC GAATCGATGT CTGTCGAAGC
TATCAAAGAA GACCAAGAAG CCAACGTGTT AGCGGCGGCA ACCAATGGTG ATTTAGACGA
AAACGATGCG GACGAAGTGC TGGTGAAGCT CGACGACTGG CTGGAGACGG ATGATCAACC
ATGGGGAAAC GAACGGTTTG CAATTGGTCC GCTTTGA
 
Protein sequence
MATKPTATAA PSLNMYQKIA VIRHSSDSLT YDDSDGNSKS VSVQQLGDAI ATATPLVLPK 
SKLDIPVPRI NNVESYERDV PATYQKPISY VRCHRPSRAE LKAMVEYVAD REDQEWLTNN
TKFGGAVVWD EGLDTLQQRK PQLPLALLER ILDLFEKETG FDAIMTSNQA EAMVFKNIPL
IYQIFPNKPR NGVVTTKTVL LEVYNYWLHK RSKLKRPLLR RFWPVTSSDD TNPHLVFRPR
EKEKYKLRKK RQNDMDAYRK MKQLRNDSDN LRAVLELVRR REELARAHIK TQMELFEQRM
YDIVDTTGLP RELKHVDKDQ LKRVLDTPSF FDIYYGGRKK QTARSPVFPS DITAREARPL
LSKTLHDNAS SASQETPAIV AGQNSGEPAP LFLDPLQTRE TYATSWQNAV PHVTSYIESH
AEPTFRFRHR PRVGRGGRLC IDRMPRPPNP TGPTTTVVTA GRGMPQSLTH KDRLLDLLPK
PLDHISLSRK IESMSVEAIK EDQEANVLAA ATNGDLDEND ADEVLVKLDD WLETDDQPWG
NERFAIGPL