Gene PHATRDRAFT_19552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_19552 
Symbol 
ID7199916 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp19871 
End bp21720 
Gene Length1850 bp 
Protein Length529 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179343 
Protein GI219117097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAGGTCCAA CAGGCATAGG AGTACTCTGG GAAATCGGAT CACCGGAAGC GAAATAAACT 
AACCACGTTT GAGCGAACTT GGGTCTGGAC GTGCAGATTT CTACCATGCT CGTTTTGTTC
GAAACGCCGG CGGGTTATTC GTTGTTCAAG GTACGTCAAC TTGGGGTGGA TTTCGATAGT
GAACGATCTC TGTCATCCAC TTTGAACACC TCTGGATGCG GACGTATCGT CCCGCGGAAC
GATCATCACT CACACCGTGC CACAAATACA ACGATGTCTA AATCAATGAT GGACCACTCT
TCTAGGTCAC GGATGAGAAA AAGTTGAAAA AGACGGACGC CGATGATATT CATGACACGT
TTTTTTCCGA TTTTGGCAAG GCTAGCAAAT TCTTGGAAAT GGTGTCGTTC AAGCCTTTTG
CCGACACTGC CGATGCCGTA TCAGCCGCCT CCGCTATGGT TGAAGGCAAG GTGAGTAAGT
CGCTCACCAG TTTTCTAAAA AAGAAACTAA AAAAGTCCAA CGACTTGAGC GTAGCCGTCG
CCGACAAGGC AATCGCGGCC CCACTCAAAG AGTCAGTTCG TGATGACCTC AAAATTGTGC
ACGACAGCAA ATCGCAGGAA ATATTTCGTG GCATCCGTGC GCACATGGAT GAATTGCTTA
CCAACGACGA TTCAAACGTA ACCAAAGAAG ATTTGCGCGC GATGCAGCTT GGTCTCTCTC
ATTCACTGTC GCGGTACAAG CTCAAATTTT CCGCCGACAA GGTGGATACC ATGGTCATCC
AAGCCGTGGG CTTGCTGGAC GAACTTGACA AAGAAATTAA CACATATGCC ATGCGTGTCA
AGGAATGGTA TGGTTGGCAC TTTCCTGAGT TGCAAGGGCT CGTTGGCGAC AATGCCAAGT
ACTCAAAACT AGTTCTTAAG GCCGGTATGC GACCTACTTT CAAAAACTAC GATTTGAGTG
ATATTCTGGA AGAAGAAGAC GTTGAAGCTG CAGTAAAGGA GGCTGCTGAA ATTAGCATGG
GCACCGAAAT TGCTGACTTT GATATTCTCA ACATTCAGTC TCTGGCTGAT CAAGTACTGA
GCATGACGGA GTATCGGTCG CAACTATATG AGTATCTCAA GAATCGTATG AACGCTATTG
CACCCAATTT GACCATTCTT GTCGGTGAAT TGGTTGGTGC CCGCTTGATT TCGCATGCTG
GATCGTTGAT GAATCTTGCT AAACAACCGG CCAGCACAGT ACAAATCCTT GGTGCCGAAA
AGGCACTCTT TCGCGCTTTA AAAACGAAAC ACGATACTCC GAAATACGGC TTGATCTATC
ATGCCTCACT GATAGGACAG GCAGCACCAA AGAACAAGGG AAAAATCTCG CGCGTACTGG
CTGCTAAGGC GTCTTTGGCC ATTCGGGTTG ATGCGCTGTC AGATGAAACC GCTGATCAGC
TTGACACGAC GATTGGTTTC GAAGGCCGCG CCAAAGTAGA AGCCCGGCTT CGACAATTGG
AAGGGGGAGT CTTCGTGACC AACGGTAATG TATCAGCATC CAAGACAGCC AGGTACGATC
CTGTGGCGGC CAAGACGGCC GCTGCTGCTC CTGCCTACAA CGATTCTAGT GACATGGTAT
TGGACGTGGG GACAAATGGA AGTAAGACGG ACGAGAATGC GACAAAGAAA AAAAAGAAAG
ATAAGAAAAA ATCAGACAAT GGAGATGAAG AATCACCGAA GAAAAAGTCA AAAAAGGACA
AGAAGCGGAA GGCTGAAGCT GTCGACGACG AAGAACAGGA TGATGACGAA GCGAAAAAGT
CAGCGAAAAA GGCCAAGAAA GACAAGAAAA AGAGGAAGTC ACAAGAATGA
 
Protein sequence
MLVLFETPAG YSLFKVTDEK KLKKTDADDI HDTFFSDFGK ASKFLEMVSF KPFADTADAV 
SAASAMVEGK VSKSLTSFLK KKLKKSNDLS VAVADKAIAA PLKESVRDDL KIVHDSKSQE
IFRGIRAHMD ELLTNDDSNV TKEDLRAMQL GLSHSLSRYK LKFSADKVDT MVIQAVGLLD
ELDKEINTYA MRVKEWYGWH FPELQGLVGD NAKYSKLVLK AGMRPTFKNY DLSDILEEED
VEAAVKEAAE ISMGTEIADF DILNIQSLAD QVLSMTEYRS QLYEYLKNRM NAIAPNLTIL
VGELVGARLI SHAGSLMNLA KQPASTVQIL GAEKALFRAL KTKHDTPKYG LIYHASLIGQ
AAPKNKGKIS RVLAAKASLA IRVDALSDET ADQLDTTIGF EGRAKVEARL RQLEGGVFVT
NGNVSASKTA RYDPVAAKTA AAAPAYNDSS DMVLDVGTNG SKTDENATKK KKKDKKKSDN
GDEESPKKKS KKDKKRKAEA VDDEEQDDDE AKKSAKKAKK DKKKRKSQE