Gene PHATRDRAFT_46359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46359 
Symbol 
ID7201628 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp111219 
End bp112565 
Gene Length1347 bp 
Protein Length357 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180944 
Protein GI219120410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAATA TTCGTGTCCT GCGATTTCGT CAAATGCTGT TGGATCCTTT TGTACTGGGC 
GTTTGCCTTC TCGCGTTCGC CGGTACTACC ATCAATACGG TACACGCGCT CGTACCTAGC
CAAGGTGGCG AACTCCAAAA GGCAGCCCAG CAGTTGCAGC ACCAGTACGT TTCACAGTCC
CCGAGTGCTC CAGTGAGTAC GAGCCGAGCG GGACTCTCTC CTTCTGTCGC TCCTCCATTG
TCGTCACGCA TCCCTGGGCA TCCTCCGCTG GCTTCGACTA CGAGTCGCGC GGCGGCAGCC
AGGATGACCG AAGAGGAGAA CGAGTGGTAC ACTCCACCTC CGGCTCCCGT CCAATCCACC
CAGATTCCTA CCGAAATCGC CGTCGTGAAC TCCGACGAGG CTTGGCGGAG TTTCTGGCTC
TCGATGACGA CCGGCTCTGC GTAATTTCCT TTCACGCGTC CTGGTGCAAG AGTTGCCAAA
AGTTCGGACT CCTCTATAAA TCGCTAGCGC ACAAACTCGG AGACAAGCGT GACCGCAAGA
CGCAGAACAT CGTTGAACGC GGTTCCGTGC GTTTTGCCTC GGTCGAATGG GGCGCCAACA
CGGCACTGTG CCGATCCCTC GGTATTAAGC GACTGCCTAC CACTCAAATA TACCACGCAG
GGACCCTGCT CACCAGCTTT GCCTGTGCGC CGGCCAAATT TCAAACCCTC AAGGATCAAA
TCAAGTACAT GACCCGGACG CTGCAAAACA GCGATCGACT CGCGGCGATC AAGGCAAATC
CCTTGTTTCC GGACGAGCAA GCCGTCCAAG ATAAGGTTCA AGCGCTGCTA GCGGCCCATA
CCGAACAGGA ATTTTCCAAG ACCCTAGACA TTGGCGCGGC GCTCATTGAC GCTACCGTTA
TGGTGGCACC CCCATCATCC GATCAGTCGG ACGGCAGCAA TACGGAAGAC CATGGACAGG
GGCTCTCCCG TGCCGAGCGT ATGGCGGCGT TTGCGGAACG GATCCGCTCC AAACAAAACC
GAGCAGATCA TGTTGCAACA ACCGCTGAGA GTGTAGCCGA TTCCTCCCAC AAGGTGTGGT
GGCGCTTTCG ACGAGCACCC TAGACAAACC CAACACTTAC CCACTCACAA AGTCCACGTT
TGGGCTTTGT GGACATAGAT GTGTACCCTA GTACGAATGT GTGAAGCAGC AAACAGAAAA
GAATAGAAAA AGACAACTTG ATCTGGAAGG GACCTTGTTC CAGACCATCA AAAAAAGAAT
AGAGACTACA GAAAATGAGA CGCAGTGTTA GCAACGTAAC TATATATGGA TACCGCTCAT
TTAAATAGCT CTGCTGCCAT AGTCCAT
 
Protein sequence
MTNIRVLRFR QMLLDPFVLG VCLLAFAGTT INTVHALVPS QGGELQKAAQ QLQHHEYEPS 
GTLSFCRSSI VVTHPWASSA GFDYESRGGS QDDRRGERVV HSTSGSRPIH PDSYRNRRRE
LRRGLAEFLA LDDDRLCVIS FHASWCKSCQ KFGLLYKSLA HKLGDKRDRK TQNIVERGSV
RFASVEWGAN TALCRSLGIK RLPTTQIYHA GTLLTSFACA PAKFQTLKDQ IKYMTRTLQN
SDRLAAIKAN PLFPDEQAVQ DKVQALLAAH TEQEFSKTLD IGAALIDATV MVAPPSSDQS
DGSNTEDHGQ GLSRAERMAA FAERIRSKQN RADHVATTAE SVADSSHKVW WRFRRAP