Gene PHATRDRAFT_37218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37218 
Symbol 
ID7202017 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp414596 
End bp417528 
Gene Length2933 bp 
Protein Length671 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181375 
Protein GI219122066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000635734 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCCGT CCGCGACACT TAAAACTAGT AATAACGATC TAATCAATCA TATTACTAGT 
CTTAATTTGT CTGTAGTCCC CTCCCCGCCT AGTACCATGA CCTCGACCAT TGCCGACACC
TTTTGCACAG GACACTACAT CACCGTGTCC TGCCCCCACT TCAACCGACA ACCTGCCGCC
TCTCCACTCT CCGTCCGTGT TCCCAACGGC GCTACCCTCC GTTCCAGCCA CACGGCAACT
CTCGACCTCC CTGGTTTTTC CCCTGCTGCT TGCCAAGCTT ACATCTTTCC TGGCCTCGCT
TCCCACCCCC TCATTTCCAT TGGCCAACTC TGTGACGACG GCTGCACCGC CACATTCTCC
GCCACCCGAC TCGACATCTA CCGAGACACC ACCCTGCTTC TTACCGGCGC TCGAGCACCC
GCCACCGGCC TCTGGCACCT TGACCTCACC CCAGCCAAGA CTGCCCATGC CCTCATTCCC
GACAGCTCCC TGGCAGACCG CATCGCTTTC GTACATGCCT CCCATTTCTC CCCTGCTCTC
TCCACTTGGT GTACCGCTAT TGATGCCGGG CGCCTCCCCA CCTTCCCCGA CATCACCTCC
AAACAAGGGC GCAAGTACCC TCCCCTCTCT ATGGCCACCA TCAAGGGCCA CTTAGACCAA
CAACGCGCCA ATCTTCGGTC CACCAAACCT TCCTCCGTTC CTCCAGTGGC CTTGCCCAAC
CCTCTCCATG AATCCCAGCT AGACTTCTGC CCGGCCCCGG CCACTCCTCC CGCTGGCCGA
ACCCACCATG TCTTCGCCGC GCACCAACGA GTCACCGGCC AGATCTACAC AGATCAACCG
GGCCGTTTCC TCACTCCCTC GAGTACAGGC CACACGGACA TGCTTGTGCT GTATGACTAC
GATAGCAATG CCATCCATGT TGAACTCATG AAGAGCAAGT CCGGCGCCGA GATCCTAGCA
GCCTACCAAC GTGCCCACTC ACTCTTCACT CAACGGGGCC TTCAGCCGCA ACTCCAGCGT
CTAGACAACG AGGCGTCTCC CGCTCTCGAG TCCTTTATGA CGGCCAACCA GGTCGACTTC
CAGTTGGCAC CACCCAATCT GCACCGTCGC AACGCCGCCG AACGCGCCAT ACATACCTTC
AAGAATCACT TTATTGCCGG TCTCTGCAGT ACGACCCCGG ATTTTCCGCT TCACCTTTGG
GACCGCCTCA TTCCCCACGC TCTGCTTAGT CTCAATCTCC TCCGTGGCTC TCGCATCAAC
CCCACCCTCT CGGCCCACGC ACAGCTCCAT GGCGCGTTTG ACTACAACCG CACCCCGCTC
GCCCCTCCCG GCACTCGCAT CCTCGTCCAC GAAAAGCCCG CCGTTCGGGA AACTTGGGCA
CCCCATGCTG TTGAAGGCTG GTACCTCGGC CCCGCTCTGC ACCACTACCG CTGCCATCGC
GTTTGGATCA CAGAGACGCG TGCCGAACGT GTTGCCAACA CTCTTGCGTG GTTTCCCAGT
CGCATTCCTA TGCCCACTGC CTCCTCCACC GATCGCGCCC TGGCCACCGC CCGTGATTTA
GTGCGCGCCC TCCAAAATCC CTCTCCCGCT TCGCCGTTTG CACCATTGGA CGCCACCCAA
CACCAGGCCC TTCTACATCT TGCCGATCTC TTTGCTTCGG TCGCTGCTCC GGCCTCTCCG
ACCGCTGCAC CGACTCCCGC GCCCCCGGTC CCAGCACCTC CCCCTGCTCA AGTCCGCTTT
GCTGTTCACA TTGTCACGGC CGAGCATGCT CCTGCACTTC CGAGGGTGCC CATCCTTGCG
CCGCCAGCTC CGAGGGTGCT CTCTCGGACC CGCAATCCCG GCCGCCGCCG TCGCAAAGCA
CGCAAGCAAC CGCCAACCCC AACCTTAGTT CCGGCTCATC CACACAACAC CCGCACCCGA
CCCTTTCTTG TCCCAGCCTC CGCCAACGCA GTTGTCGACC CCGCAACCGG CGCCTCTTTA
GAGTACCGCC ACCTACGCAC CGGTCCCAAT GCTCCCGATT GGATTCAAGC CGCGGCTAAC
GAAATTGGCC GTCTCACCAG CGGTAACCCG CCTCACAGCA CTCACGGTAG CCAAACTATG
CACTTCATCG CGCATACCGC CATTCTTCCC GGACACAGGG CCACCTACTT ACGCATTGTT
GCCAGCATTC GCCCGCAGAA AGCAGAACCC AAATGCATAC GTTTTACCGT CGGTGGCAAC
TTAGTTCAGT ACCCCGGCAA GGTTAGCACC CCAACTGCGG ACATCACCAC AGCCAAGCTC
CTCTTCAACA GTGTCCTCTC AACTCCTGCG GCAAAGTTCA TGTGCATTGA TATCAAAGAC
TTCAATCTTG GCACCCCCAT GGCATGCTAC GAATACATGC ATATCCCGGT CCCAGATATT
CCTCCCATCA TTTTGGCTCA GTACCAGTTG GCCCCTCTTA TCCACAACAA TTCAGTTACC
GTCGAAATTC GCAAAGGTAT GTACAGCCTT CCCCAAGCCG GCATTCTCGC CCATGACCGC
CTTGTTGAAC ACCTCGCTCG CCACGGCTAC GTCAAGACCG CGCATACTGC GGGCCTTTTT
CGACACGTCA CACGCCCGAT TCAATTTACC CTAGTTGTCG ACGACTTTGG CGTAAAATAC
ACCGGCACCA ACAACGCTCA GCACCTCATT GACACATTGC AAGCGCTCTA CACTATCACA
ATTGATTGGG ATGGTACGCG TTACCTAGGT CTTACACTCG CTTGGAATTA TGAACATCGA
ACCCTTGACA TGTCCATGCC CGACTACATT GATCAAGCCC TAACCCGCTT CCAACGTTCG
CCTCCTACCA AGCCGCAACA TGCGCCTCAT CGCAAAATGC GCTCGCACTA CCTTTTCGAA
CAGTCATCAA CCAATGACAT AGCAACTTCT CATTTGCAGC AAGGGTGTGT TGA
 
Protein sequence
MSPSATLKTS NNDLINHITS LNLSVVPSPP STMTSTIADT FCTGHYITVS CPHFNRQPAA 
SPLSVRVPNG ATLRSSHTAT LDLPGFSPAA CQAYIFPGLA SHPLISIGQL CDDGCTATFS
ATRLDIYRDT TLLLTGARAP ATGLWHLDLT PAKTAHALIP DSSLADRIAF VHASHFSPAL
STWCTAIDAG RLPTFPDITS KQGRKYPPLS MATIKGHLDQ QRANLRSTKP SSVPPVALPN
PLHESQLDFC PAPATPPAGR THHVFAAHQR VTGQIYTDQP GRFLTPSSTG HTDMLVLYDY
DSNAIHVELM KSKSGAEILA AYQRAHSLFT QRGLQPQLQR LDNEASPALE SFMTANQVDF
QLAPPNLHRR NAAERAIHTF KNHFIAGLCS TTPDFPLHLW DRLIPHALLS LNLLRGSRIN
PTLSAHAQLH GAFDYNRTPL APPGTRILVH EKPAVRETWA PHAVEGWYLG PALHHYRCHR
VWITETRAER VANTLAWFPS RIPMPTASST DRALATARDL VRALQNPSPA SPFAPLDATQ
HQALLHLADL FASVAAPASP TAAPTPAPPV PAPPPAQVRF AVHIVTAEHA PALPRVPILA
PPAPRVLSRT RNPGRRRRKA RKQPPTPTLV PAHPHNTRTR PFLVPASANA VVDPATGASL
EYRHLRTARV C