Gene PHATRDRAFT_41483 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41483 
Symbol 
ID7199291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011698 
Strand
Start bp279997 
End bp281484 
Gene Length1488 bp 
Protein Length495 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185462 
Protein GI219130626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCAAA AGGCTGTGGA TGACCAAAAT TACACGTATT CGTCATTTTG GTCGGTGAGG 
ATAGTATCGA TCGAAGGATT GTTGAAATTG ATCAAGCTGG ACAAGGTCGG CCAGCTTCGT
CAACTCAAAC GCTTGCTAAG CAGGGTAGTA CTGATCACTA TGAGCAATTT AATAGTACCG
TCATCACAAA CGCTCACACC GAGTAGACGT CGGCGGCCGA AGGACAACCT CTATCCTGTA
TTCCTCGTCG TCTGTGGATT CTTTGGCTTG ATCCAGTTCT TCGATCCATT CGCAACTTTC
GAAGGGAGCC CAGGTCATAG TAGCGAGAAT GCGAGTGCTT CAATTTCGCT TGCTCTGGCT
CGGAGCAAGG TCCGCGACCA TGCTGAGAAG TTCCGGCAAA GGAAGGAGCG AGGCTATCCA
CAGAGGATAC CCTCCTCGGT GACTGAACAA GCATTTCGCC AAAGTGATGG TATTAAGACC
GGCGCCGTCA ACGCAACCAC GAGCCATGAA TGGCGGAGAA AATGGGACCA GCTTAAGGTG
GGACGCGAGC CCTTATTTCA GATGCTCTTT GAGGATGCCA AAATGGGCGT TGATTCCGTT
TCTTTACCAT CTCTAGAGGC CTTGCCGACT ACCGACGCTT TGCGACAACT GTATGGCGAC
CGAGTAATTG TACGAGGACT AGAAACCTGT CAAAAGTACA GAGATACAGT GGCATTGGAA
GATCGATACG TTGCCGTAGC AGGGACCTTC AACGTGGGCA CCAATCTCTT GGCCTTTCAT
TTGGAAAACA ATTTGCGTTT TCCGAATCGC ACGGATGCAG GCAGTGGGAG GAAGGCACAC
TGGCGATGGC AGGTGCGCTG GGGGAAACAT CAGCCAGCCA CAGTTCGCAA CCAAAACGTG
GCCCGTGGCT TTGAAGCCGA CAATATCGAT CACGTGCTAC CCATTATTAT GATTCGGGAT
CCACTCTTTG TGCTGCAGTC GCTATGTGCG CATCCGTACG GTGCACGATG GCGCCACGTA
GATGGTCATT GTCCCAATTT AGTACCTAAC GAAGTGGATC GCGCCTACTT CAAAGGTGTG
CCCGATATTT TCAAAGTTAC AATAGTGTAC GATAAGAGCC GGCAAACAAG ACACAATTCG
CTGATTCACT TTTGGAATGA ATGGTACCGC GAATACTTGG ATCAATTTGA CTACCCCGCG
CTCTGGGTTC GCTTTGAAGA TTTGGTCTAT AATCCACAGG CCATGCTGCA GCAAATTGCC
ACTTGTATTG GAGGTGCTGC ACCCACACAC CAGAACTTTC AATACCTGAC GAAAACAGCC
AAATCTCACG GTAGTGGAAC CAACATGCTG AAAGCTCTGA CGAAAACCGG TGACGCGGCG
GCCCGCGTGC GGAATATGAC CGTTGCAGAC CTAGACTACC TCCGAGATCA CGCCGATCAC
CAACTACTCC AACTCTTTGG CTACCGCATA CCAAATCCGG GGCGATAG
 
Protein sequence
MHQKAVDDQN YTYSSFWSVR IVSIEGLLKL IKLDKVGQLR QLKRLLSRVV LITMSNLIVP 
SSQTLTPSRR RRPKDNLYPV FLVVCGFFGL IQFFDPFATF EGSPGHSSEN ASASISLALA
RSKVRDHAEK FRQRKERGYP QRIPSSVTEQ AFRQSDGIKT GAVNATTSHE WRRKWDQLKV
GREPLFQMLF EDAKMGVDSV SLPSLEALPT TDALRQLYGD RVIVRGLETC QKYRDTVALE
DRYVAVAGTF NVGTNLLAFH LENNLRFPNR TDAGSGRKAH WRWQVRWGKH QPATVRNQNV
ARGFEADNID HVLPIIMIRD PLFVLQSLCA HPYGARWRHV DGHCPNLVPN EVDRAYFKGV
PDIFKVTIVY DKSRQTRHNS LIHFWNEWYR EYLDQFDYPA LWVRFEDLVY NPQAMLQQIA
TCIGGAAPTH QNFQYLTKTA KSHGSGTNML KALTKTGDAA ARVRNMTVAD LDYLRDHADH
QLLQLFGYRI PNPGR