Gene PHATRDRAFT_42828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42828 
Symbol 
ID7196487 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1255253 
End bp1257168 
Gene Length1916 bp 
Protein Length499 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176749 
Protein GI219109993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGAGA GTACCCTTGC GAATAGTAGC ACTTTGGCCA CCATTGACAA GATCAAGTTC 
ACTTGCAACC AGGATACATG ATTGTGGGAG TCACAATGCC ACAGGATCCC GGCATGGCAA
TACCTCCTTA TAGTCAGAGT GGGCAAGCGA ATGACGCATA TCCGGAAGGC GGCTCGGGCG
AGAAATTCGA CACGAGCCAG CCACAGGCCG TCGCAATGCC GGCCCTCCAA CCAGCAGTAG
ATGGTAATCT ACCGCCCCAA CAAGTCTTTG TGAAATCCCA TATTGACCCT ATGGGAAGCC
CGGAAATGCT TGGCTTGGAA GGTCAGTTCC AGTCTATCGG ATTCGCGCAA GAAGGATTCG
ATCACAATTC CAGCGCGCAC TACAATGGTG GCGATAGTAG TGGCAATGCC AGCGCCGACA
ACGATGATGG CGAAGACGAT CCCATGAAGC TTTTTGTTGG ACAGGTATGT AATAAGATAC
GGCTGGTCTC GCTTGGATGC ATCGAGTAGT GAGGGCTGGT TGGGTTCTGT GTATGAATCA
TTGTTCTACA TTGCTTGGAT GAAAAGCGGA ATGTCGTACT GTATTTCCCG TAAGCGAGAG
AATTTCGTGT CTCCTCAATA TCCGCTCATG TCTTGTACTT TCCACACAGG TTCCGAAGGC
AATGAGCGAG GAGGATGTGT TTCCAACGTT TGATTCGTTC GGTCCGCTCA AAGATGTCGC
TATCATTCGC GACAAGCACA CTGGTTTGCA CCGTGGGTGC GCGTTCGTCA CCTACTGGTC
GGCTGCCGAC GCAGAACGCG CGCAAGAAGC GCTCCACGAC ACCTTTACCT TCCCCGGAGC
GCGGAGAGCA GCACAAGTTA AACCTGCCGA ACCATCCGTC CCCGAGAACA AACTCTTTGT
GGGAATGCTT TCACGCAAGG CAACCGAAGT GGAAATTCGC GAGCTATTTG AACCGTTCGG
TGAAATTCGA GAAGTATACA TGATCCGCAA TGCTGACGGA TCTAGCAAGT GCGCAGCATT
TTTGAGATAC ATGAAACGAG GCGCGGCTGT TCAAGCCATT GAAACTCTTA ACAATATTTA
CATGATGGAA GGTGCAGCCA GGCCCCTCAT CGTTCGATTT GCGGACAATA AACATCAGCG
CCATCAGCGC CAGATACGAA ACATCAGAAG ACATGAAATG ATTGCAGCAA TGGGTGGCGG
CTATGCAACA TACCCCCCAC ATGTGCAGGT TCAGATGGGA ATGCCCGGGC ATCCCGGTGC
TAGTCCACAG TACACTGTAC CCGTACCTCC TCATTACGTT GAAGCTGCAT ACGGGCCACC
GAACGGTGCC CCAATGCCAG GGCATCCGTA CATGTACCCC CCTCAGCAAT ACGCTCCTAC
ACCAGCATAT ATTTACCCAG AACACACGTC TGAAGAAACT AAACCGACCA ATAACCGTCC
GCGTGAAGGC CCCGCTGGAG CGAATTTATT TGTATATCAT CTCCCTCATG ATTTAACCGA
CGCCGATCTA GCAACAGCCT TCAATCCTTT CGGGAACGTT ATTAGCGCCA AGGTGTATGT
CGACAAATAT TCAGGCGAAA GCAAGGGTTT CGGTAAGTTG CAGCACGCCT TCTGGTCTTT
GTACCCCTAT AAATGTCATT CTCACGCTCT GAAAATCCTT GCCAGGCTTT GTGTCGTATG
ACTCAGTTAT TGCGGCAGAA GCAGCAATCG AGCAGATGAA CGGATTTCAG ATCGGCAACA
AACGACTGAA GGTACAACAT AAGCGTGTTC ACGGAAACCA TCCGAATGCT CCGTCCTTAA
GCGATTCCCA AGACCCTCCC GAAGATTTGC TTTAAACGAC GCTTTCAACT TTTTCCCCAT
GTTGATATAT CCTTCCAGCC CTTACCACTG AAACGATCCA AACGAAACAA CAACCC
 
Protein sequence
MYESTLANSS TLATIDKINQ SGQANDAYPE GGSGEKFDTS QPQAVAMPAL QPAVDGNLPP 
QQVFVKSHID PMGSPEMLGL EGQFQSIGFA QEGFDHNSSA HYNGGDSSGN ASADNDDGED
DPMKLFVGQV PKAMSEEDVF PTFDSFGPLK DVAIIRDKHT GLHRGCAFVT YWSAADAERA
QEALHDTFTF PGARRAAQVK PAEPSVPENK LFVGMLSRKA TEVEIRELFE PFGEIREVYM
IRNADGSSKC AAFLRYMKRG AAVQAIETLN NIYMMEGAAR PLIVRFADNK HQRHQRQIRN
IRRHEMIAAM GGGYATYPPH VQVQMGMPGH PGASPQYTVP VPPHYVEAAY GPPNGAPMPG
HPYMYPPQQY APTPAYIYPE HTSEETKPTN NRPREGPAGA NLFVYHLPHD LTDADLATAF
NPFGNVISAK VYVDKYSGES KGFGFVSYDS VIAAEAAIEQ MNGFQIGNKR LKVQHKRVHG
NHPNAPSLSD SQDPPEDLL