Gene PHATRDRAFT_45441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45441 
Symbol 
ID7200681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp164401 
End bp165549 
Gene Length1149 bp 
Protein Length379 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179593 
Protein GI219117602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGAGCACCA TGGCGGATAT AAGCAAATCT ACGACCGATG CCACCAAGAA GAATCACAAC 
AAATATCGTC GTGACAAGCC ATGGGACAAT GATGACATCG ATCACTGGAA GTTGGCATCA
TGGAACGCGG ACGATGGTGG CGACACATTG CCCGGCGGCC GTCTCCTGGA AGAGAGTTCC
TTCGCAACTC TTTTTCCAAA GTATCGTGAA GCCTATTTGC GTCAAATTTG GCCCGTAGTG
ACGCGGCATT TAGATCAGCA CGGAGTAGCT TGTGAATTGA ATCTTGTGGA AGGAAGTATG
ACGGTACGGA CAACGAAAAG GACGAAGGAC CCATACGTGA TCCTGAAGGC GAGAGACCTG
CTGAAATTGC TAGCACGGTC ACTGCCGGTG GCACAAGCGG TCAAGATCCT TCAAGACGAC
TACCAGTGTG ATATTGTCAA AATTGGAGGC CTCGTTCGAA ACAAGGAGCG TTTTGTAAAG
CGACGCCAGC GCCTTCTTGG TCCAGATGGA AGTACATTGA AGGCCCTGGA GTTGCTTACG
GGCTGCTATA TTCTGGTGCA GGGCAATACC GTTAGCATCA TGGGTGATTC GTGGAAAGGT
TTGAAGCAGG CTCGTCGTGT GGTTTTGGAT TGCTTGAAGA ATATCCACCC TGTATACCAT
CTGAAACGCC TTATGATTCA GAAAGAGCTC GCGAAGGACC CGGCACTTCA GAACGAGGAC
TGGTCGCGAT TTTTGCCGCA ATTCCAAAAG AAAAACGTAC AAACTAAAAA ACCTTCCGTT
CGGAAGACAA AGAAATCATA CACACCGTTC CCGCCCGCCC AACAACCGAG CAAGATCGAT
CTACAACTCG AATCTGGAGA ATACTTCGCA ACGGAGTTTG AGCGCAAAAC CAAAAAGGTG
GCCGATCGAA AAGAGGCTTC CAAGAGCAAA TCAATTGCCA AAAGGAAAGC TCGCGAGTTG
GAAGAAGAAA CACCCGTGCT GGCATCGCGA AAAGCCAAAA CGACCACAAC GAAGGAAGAT
ATTCCTGTGG AACAGCGCAT CAAGGACAAG TTTCAAAAGG CTGCACAGGC TTCACAATCT
TCGACCTCCG ACCCTGCCGA CTTTATTCAA GGATCGGGCT CGGGATCCAA GAGGGGTAAA
AAATCGTAG
 
Protein sequence
MADISKSTTD ATKKNHNKYR RDKPWDNDDI DHWKLASWNA DDGGDTLPGG RLLEESSFAT 
LFPKYREAYL RQIWPVVTRH LDQHGVACEL NLVEGSMTVR TTKRTKDPYV ILKARDLLKL
LARSLPVAQA VKILQDDYQC DIVKIGGLVR NKERFVKRRQ RLLGPDGSTL KALELLTGCY
ILVQGNTVSI MGDSWKGLKQ ARRVVLDCLK NIHPVYHLKR LMIQKELAKD PALQNEDWSR
FLPQFQKKNV QTKKPSVRKT KKSYTPFPPA QQPSKIDLQL ESGEYFATEF ERKTKKVADR
KEASKSKSIA KRKARELEEE TPVLASRKAK TTTTKEDIPV EQRIKDKFQK AAQASQSSTS
DPADFIQGSG SGSKRGKKS