Gene PHATRDRAFT_49099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49099 
Symbol 
ID7195452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp584270 
End bp585867 
Gene Length1598 bp 
Protein Length499 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183773 
Protein GI219127083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0142086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GACAAGAGTA GAAGCATTTT TGACTGCTTA CGAGTGCGTT TATTGGAGTC TTTACTCATA 
GACAGCAACT AACTGTAAAT CTTCGTTCAT TCTTTGCAAT GGAGGGCAAA TTGCCAAGGG
AGCGGCTGTC AGAACACAAA CGCTCCGTTC TAACCAACGA TGCCGTCATC AAGCAAGGAA
AGCAATCGAG TCTGCAAGAA GTTATCGAGT GGCGAGGCCG TGCTAGCAAT GTGGCCAGCG
CCACGCATAG CAACGATTCT TCCAGTCATC ACGATACGTC AAACAAGTCC AAAAGTCGTC
CGTCTGAGGT TGATGACGAA AAAGCTTCCG GCAGCGCCGG TATCGTCAAA ACCGGTGCTA
TAGCAAGTTC GGATGAAATG GCTTTCCAGA AGGCGCAAGA ATCATCCAAA AAGATGGCTG
CTAGACTTCC AACTGACCAG AAACGGCCTT CAATGTCCGA CCTGTCGTCC GCATCGTCAA
TACCCGTAAG CACGGAACAG ACTGACGTTA AAAAAAGGAA AAAGAGGCGT TCCCGGGGAT
GGACCTTCAA TGATGTCGAA AGCGCCACAG AACGCTCAAA CAAAAATATC AAGATTACAA
AAGCAGGGCC ATTTCGGGAT CATAAGGACA ATGCTGATAA ACGCGGTAAT TTCGAAGATG
TTGACGGTAA ACTGATATCT TATGATCAAG CCAGCACAGG TTCCTTAGCT GTGGAAGGAA
GGCTTGCCAA GAACTCTGGA CTGAGGAGAG AAGATGCCAA GGCCGCTGAG AAACGAGAGT
ACAATCGAGT GAATGCTGCC CGAGCTCGGC TCCGCAATAA AGAAATGGTC GAGGAATTGC
AAAAGAACGT TATTGATCTG AATGCCCATA TTGTCGAATT GGAACGATCA AACGAAATCC
TCCGGGCTCA AGTCGAGGTT TTGGGCAGCA GGAGTCAGAG TCTCCTCACG ACGAGCCAAG
TACCGACAGC TGCGGCCCCT GAACAAGAGT TAGACCATGT GCAGTCTTCC ACAGCTGCAA
TTGTTGCACC TAGTTTTAAC ACTCCAATTT TTTCTGTCCA GCAAAGTGGT ACAGGTACAG
GGCAACCTTC CCCTCATCAA AATGTGGTTG CAGTAGAGCA ACTGTTGGCT TCGATCCTAG
GAAGAAGCCT TCCCCAGTTA GAACAACCTC CATCACCGCC AGCGCTTGAC AATCTATCTT
TGTTGCTAAC GCTCGTGCAA GGGAGTAATG GAGCCAGTAT AAATTCCCAG CTGCAAAGCG
GTCCTCCACA ATTCAGTGGT GCCGCGATAG CTCCGCCCAC AGCACAACCA CTGCCTTCGA
CGCTCTTCGC GATGAATCCT TCCTTGCACC AGCAGCAACA GACTATTCAT TCTCGGGCAA
GACTCTTAAA TATGCAGCAG CCGTTTGATC CGTATGCTCA TTTGTCGAGC GCTAATTTAC
AATCTGTTCT GCAAAACCTT CCCGCCGGGA CCCTTTATGC TGCTTTACAG CAACAAAGGC
AATTACAGCA AGGCGACGGA CCTGGTTATC CAATCGATGA CAGCCTCCGC AACAAGAACG
ATAACAAATC CTCATCATTA GGGAAAGATG GACGATGA
 
Protein sequence
MEGKLPRERL SEHKRSVLTN DAVIKQGKQS SLQEVIEWRG RASNVASATH SNDSSSHHDT 
SNKSKSRPSE VDDEKASGSA GIVKTGAIAS SDEMAFQKAQ ESSKKMAARL PTDQKRPSMS
DLSSASSIPV STEQTDVKKR KKRRSRGWTF NDVESATERS NKNIKITKAG PFRDHKDNAD
KRGNFEDVDG KLISYDQAST GSLAVEGRLA KNSGLRREDA KAAEKREYNR VNAARARLRN
KEMVEELQKN VIDLNAHIVE LERSNEILRA QVEVLGSRSQ SLLTTSQVPT AAAPEQELDH
VQSSTAAIVA PSFNTPIFSV QQSGTGTGQP SPHQNVVAVE QLLASILGRS LPQLEQPPSP
PALDNLSLLL TLVQGSNGAS INSQLQSGPP QFSGAAIAPP TAQPLPSTLF AMNPSLHQQQ
QTIHSRARLL NMQQPFDPYA HLSSANLQSV LQNLPAGTLY AALQQQRQLQ QGDGPGYPID
DSLRNKNDNK SSSLGKDGR