Gene PHATRDRAFT_40080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40080 
Symbol 
ID7195774 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp105934 
End bp108242 
Gene Length2309 bp 
Protein Length743 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184184 
Protein GI219127942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTCG ATCAACGGAC GATTGTAGTG TTGCTGGTGT CAACGGCGTA CGCGAGCTGG 
CCTACTACTG GCGGTGGTGG TACGTGCGGA GTCGAGGCGT CGGCACCATT CAATCCCGGT
ACTTACTACG GCAATGCGTA CGCCCAGGGA AACGAACGAA ACGTCAACCG CAATGGCAAC
AACGCGTGGC CAGCTCAAAC GTCTCCGCAA GAATCCGCGT ACGACAGTTA CGGGCAATCG
TCCACCGCGG AGCCTGTAAG CGACTTGCCA CCGCTACCGG AAGGCTGGAG TGAACACTTG
GACCCTGCCT CGGGACAACT ATACTACTAC AACGCCAACG ATGGAACAAC CACTTGGGAC
CGCCCCTTGC GTTTGGAAGA CGAGGCCAAA GCGGAAGAGG TTCAACCCCC AGAGACCAAT
ACGAGACAAA ATCTGAGTGG AGCGGACGAG TCACGAGACA ACGACAACAG GGCGACACGG
ATGGCAGAAT CGGACGTGAA GGAAACACAC GCCGACAAAA TGCTGGACGC AGAGTCTCCG
CAGGAATCAG ATCATGAACA TAAGCAGCAC TCATCGGAAG ACGCCTGGCG GAACTCTGCA
TCATGGGATT CGCCAGGGGA ACATGATCCT GAATCTTCAC CAAAGGAAGC CCCAGCAGAA
CCAGAACGCG GCGCAATACC GGAACTTGAC CAACATGGTT GGGAAAATCA GAACATGGCT
ACTGCAGATA ATTCAGCGAC CGAGGATTTC CAGCAAGAAG GACCAGGACG GGATACTGGT
CATTTTCCTG CTCAGCGACA GCCGGTGGTT GACAATGAGC AGCGTTTGGA GGGCTTTTCT
GAAAGTCAGT TGAATGTAGA TCGGCCCGAC CAAGGACCTC CCACGAGTCA TACGGGTAGC
TGGGGAGCAC CGCGCTCTGT CGAGCAACCT CGCAGAGAGC AAGACGTTCA TGAACGGCCG
TACGAACCAA GGTCCGATCA CGTGTCGGCG GACACATTTG GACGCCATGT ACCGGATGCC
AACCGCCCGC AGCCAGAAAA GCAATTACTA CCGGTACAGC ATCAAGACCA GTATGGCGTC
AATCCTCGGA GTTCCGTATT CGGACTTGTG CCGGAACCCC ATAATCCACA GCAATATCAA
CGCACTTCGC CCCAGCAATC CATTCATCGT GATCCAGAGC AGAGTGTTCC TATCCAAGAA
AGGGAACAAT ATCAGGAACG GCCAGGGACT GCGATGCCAC CGCGGCAGCA AATCATGCAG
CAACATCCGT ACGGACAGCA GCGTCCAACT CATCCACAAC AATTGCAGCA ACAGGGACCG
CCTTCACAGT CACGGGTACT ACCTCCTCAG TACCAACAGC AGCAACCCCA TCCACAACAA
CAGCAGCAGC AGTCTCATCA CGGTCAGTAT GGTGGTCCCT ACGGACAACC TTACGGAGGT
CCTTACGGTC AATACGGGCA GCCATCTCAC GGTGTCAATA CTCCTCGAGG GTACGGAACG
CAACAACAAT CTCAGCAGCC ACAACGACAG TTGATTTCAG AGGACACGAC ACGTCCTGTC
AAGGAAGCTT TGGGCCGGAC TTGGCAGAAC ATTCTAGGCT TGAGTAACCG TACGAAGGAA
GCCGTCGATC ATGCACGGGA ATCGGTGGTC ACGGGAGCCA AGGGAGCCAG TCAGTCCCTA
AGTACCACTA GTGCGAGTAA GTGAAAGTAT CATGTAAAGC ATGCACGCGA TTGCCAAGAT
TGTTCTTAAC GTTGATGTTT GTAATTCCCT CAGGTTGGTG GGGACAAGCA AAGAATACGT
TCGGATCCGT GTTTGAAAAC GAAAATGGAC AGCCATCGCA ATATTCTCTC TCTGGACAAT
ATGGTGGACA AAGAGAGCAA GTTATTCGTG GTCCGCCACC CGGTTATCCG CCACAACAGG
GTGGCCATCC TGCTCACGGG CAGCCGCAGT ATCCTCCGCC TATGCACCAC CAAGGTGGAG
GGTATCCGAC GTATCCTGGA CAGCAGTACC CTCCAGGCTA CGGGCCACCG CAGCCTCGGT
CGGATCCTTC GTATCCGCAC GATCCTCAAT ATCGTCCACC CGCGCAGAGC CAGCAGCCTC
CACTCCAGGC ACAATGGGGC CAGCAACAAC ATTATCCTCC TCAATACCAG CAAGGTGGGC
CAGGCGGACC AAGAGGGGGG CCGGGTTCTC CACAACAACG GCAGCCGCAG CAAAACGAAC
AACGACCACC GCCACGGCCG CAAGGTGGAC CTAGCCCGGA TACAGACGAT CCATGGCAGC
ACCCTGGACT CGGAACAGAC GGCTATTAG
 
Protein sequence
MRFDQRTIVV LLVSTAYASW PTTGGGGTCG VEASAPFNPG TYYGNAYAQG NERNVNRNGN 
NAWPAQTSPQ ESAYDSYGQS STAEPVSDLP PLPEGWSEHL DPASGQLYYY NANDGTTTWD
RPLRLEDEAK AEEVQPPETN TRQNLSGADE SRDNDNRATR MAESDVKETH ADKMLDAESP
QESDHEHKQH SSEDAWRNSA SWDSPGEHDP ESSPKEAPAE PERGAIPELD QHGWENQNMA
TADNSATEDF QQEGPGRDTG HFPAQRQPVV DNEQRLEGFS ESQLNVDRPD QGPPTSHTGS
WGAPRSVEQP RREQDVHERP YEPRSDHVSA DTFGRHVPDA NRPQPEKQLL PVQHQDQYGV
NPRSSVFGLV PEPHNPQQYQ RTSPQQSIHR DPEQSVPIQE REQYQERPGT AMPPRQQIMQ
QHPYGQQRPT HPQQLQQQGP PSQSRVLPPQ YQQQQPHPQQ QQQQSHHGQY GGPYGQPYGG
PYGQYGQPSH GVNTPRGYGT QQQSQQPQRQ LISEDTTRPV KEALGRTWQN ILGLSNRTKE
AVDHARESVV TGAKGASQSL STTSASWWGQ AKNTFGSVFE NENGQPSQYS LSGQYGGQRE
QVIRGPPPGY PPQQGGHPAH GQPQYPPPMH HQGGGYPTYP GQQYPPGYGP PQPRSDPSYP
HDPQYRPPAQ SQQPPLQAQW GQQQHYPPQY QQGGPGGPRG GPGSPQQRQP QQNEQRPPPR
PQGGPSPDTD DPWQHPGLGT DGY