Gene PHATRDRAFT_17728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_17728 
Symbol 
ID7196813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1748345 
End bp1750131 
Gene Length1787 bp 
Protein Length509 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176843 
Protein GI219110183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.304466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTATCCCCAT CTAGTAGTAG CCCCAGCGTC AGTTCTCTCA AACTGGGAAC GAGAATTTGA 
AAGGTTTGCC CCTCATTTGA ATGTTGTAAA ATACCACGGG AGTATGAACG AACGTTCAGA
GTTGCAAGAG GATCTACGCA TTTATTTACC AAGCAATCGA GCAGCTCGCA AAAAGCACAA
AGTGACACCA CTAGACGTTA TACTAGTGCC CATCACCTAC TTTCAAAAAG AGAAATCGGA
CGATCGCTCA TTTCTTCGTC GCTTCAACTA TCACTACATG GTGGTGGATG AAGCACATCT
ACTAAAGAAT GCCAAAGGAC TGCGATACAA GTCACTGGAT CGCTTCACAA CGCTCCATAG
ACTGTTATTG ACAGGTACAC CCGTTCAGAA TTCCCCCAAG GAGCTGATGT GTTTGTTATG
CTTTCTCATG CCGTTGTTCT CGCGAAAGGG AGGAAGCGAT TTTGATGATG AACAAGGGAA
TGATGGTGGA GAAAGTATGC TACAGCACTT CGTGTCGATG GAAGGAGGGA ACACTCTGCA
TGACGAGACA GCGTACAAAA AGCTGAAGCA GCTATTTGCC CCCTTTGTCT TGAGACGTAG
AAAGCAAGAT GTCCTTAGTC AGATCATGCC TCCAAAAGAG CACGCTGTGG AGATTGTGCA
GCTTGACGAG TCGTCTCGTT GCCTCTACGA TAAAATCATT TCCGACCATA TTCGTTCCAA
GAAAAAAGGC GACGCCTCGT CGAGAGAGCA TTTGTTTACT CAACTTCGAA AATGCGCTCA
TCATCCGCTA CTTCTTCGAG CTCGGTATAC TTCTCCGACC GAGAAGGAAC ATTTGGCGAA
ATGGTTTTAT CAGTACGGTG CCTTTCGTGG AGAAGGGTGT ACAATGGTCA AAGTTCGCGA
GGAATTGGAT CGATTCAACG ACTTTGAAAT TCATTTGACT GCTTTAGAAT TGCTGGAGGA
GAATCGACTT CGTCACGAGC AACTTGGTCG TTATGTTTTG CAAGAGAAAG ACTTGTTTTC
TTCAGCAAAA TGCAAGCGGC TTCGGGCCAT TCTACCGGAT TTGGTTGGTA AAGGACACCG
TATCTTAATT TTTTCCGTTT GGACAAGTTG CCTGGATCTG CTAAGTTGTT TGATGGAACA
AATGGGTCTA GGGTATCTAC GTATGGAAGG CAGCACACCT GTCAACGAGC GACAGGCCCT
GATCGATCGA TTTACGAGCG AGACCAGTAT TCCGGTTTTT CTGCTCTCCA CGAAGGCGTG
TGGGTTGGGT ATCAATTTGA CTTGCGCGGA TACCTGCATT ATGCACGATC TCGACTTCAA
TCCTTTCAAC GATTTGCAAG CGGAAGATCG TTGTCATCGT ATTGGGCAAA AGAAACCTGT
CAAAATTATC AAAATGATAA CGGAGGATAC AGTTGACGAG GATATTTACA AAATGCAGCA
ACGAAAGGCT CGAATGAATG CTGCCATCAT GGATACAGAT TCTAGGGAAT GGAACAACGT
TGCCGCCAAT GAAAAGGGAA ACATGCTGAA ACATGCAGTG GATCGCTTTT TGCGTTCACC
CACCCAGTCA AGGTCTTCGA AAGAAAGAGG TGACAAGGAA AATAGCGGCA ATATTGACAT
GACGGATGTA TAAACTGTGC AGCAACACAG ACTGGATGCC AGAAAAAAGT GGCCAGTTCC
ACTTTTTCTT TTTCAAATTG TACATAGCTT TCCGACAGCT ATTCAGTCAT CTCAATGGCA
TGGCAGTCCA GTTAACTCGA GAAATATTTA GTCAGCTTTA TGAACCC
 
Protein sequence
MNERSELQED LRIYLPSNRA ARKKHKVTPL DVILVPITYF QKEKSDDRSF LRRFNYHYMV 
VDEAHLLKNA KGLRYKSLDR FTTLHRLLLT GTPVQNSPKE LMCLLCFLMP LFSRKGGSDF
DDEQGNDGGE SMLQHFVSME GGNTLHDETA YKKLKQLFAP FVLRRRKQDV LSQIMPPKEH
AVEIVQLDES SRCLYDKIIS DHIRSKKKGD ASSREHLFTQ LRKCAHHPLL LRARYTSPTE
KEHLAKWFYQ YGAFRGEGCT MVKVREELDR FNDFEIHLTA LELLEENRLR HEQLGRYVLQ
EKDLFSSAKC KRLRAILPDL VGKGHRILIF SVWTSCLDLL SCLMEQMGLG YLRMEGSTPV
NERQALIDRF TSETSIPVFL LSTKACGLGI NLTCADTCIM HDLDFNPFND LQAEDRCHRI
GQKKPVKIIK MITEDTVDED IYKMQQRKAR MNAAIMDTDS REWNNVAANE KGNMLKHAVD
RFLRSPTQSR SSKERGDKEN SGNIDMTDV