Gene PHATRDRAFT_50031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50031 
Symbol 
ID7198728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp198698 
End bp200432 
Gene Length1735 bp 
Protein Length435 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184838 
Protein GI219129317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000158311 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATTCAACAT CAGAGGAACA AGTCCAACCA TTCAACATCA GATTCATTCT CTATCGAACC 
CAACTCACAA TCCAAAAGTA AAAGTCTTGT GTACACATTG CAATTGCGTC GACACGAATA
CTCGCCAAAC TATTCTCTCT CGTCGTCCGA ACAGCATCGA TCGATTTCTG AAATCTCCAT
TTCTCTCCAT CGCAAATTCT TTCCATTTGT CCTCTACTCC GCCATCCATT GAAATCATGA
TCGAGTTCCG TCGGAACGCA TACCATCGAA CCACCATACT CTGTCTGGTG TTGACCGTGA
GTGCTTCCAT ATCTGCGTGG ACTCTTCCGC GACTCCCGAT GCGGACCCAT CGCTCGTCCA
TCGGATCCTT ACCAGCGACC GTCGACGACA GTATCACTGC TTCTACGATC AGCGCCAGCG
GTACCACCAT CACCAGCAAC AACGTAGTAA CGACGAAGCT TCTCCCCGAA TTCCAAGCCG
TAACCGACGC AGCACAAGCC AAGCTTCTGG CTTCCATTCC GGAAGCTTAC CACGCAAAAA
TTGTCCCACT GCTGGCGCAC TTTGTCAACG AGTACATGAC GGCCTCGCAA AACGCCTACC
TCGCTACGGG CAATCCCTCC AGTGCTCCGG AGCAGGCCGC TTCGCGGATT CTCCAAGGCG
TCGGATACGG CGTACGCCTC GGTTTGCTGG AACCCTTTCA GTTCTCCACT TCCCACGTCG
CGCTCCGGGG GAAGAATCCG GAATTGGAAC AAGGCAACGA GATTGACTTT TACGAATTCG
GCTGCGAATT CTTCCGGACC GTCATGGACT TGGAACGCTC CGTCGTGCTC GGGCAGGATC
AAATTCCCAC GATACTGCAG CAGCTTGCCG ATGGTGAGAA CGTTGTCTTG TTGGCCAACC
ACCAGTCTGA AGCCGACCCG CAAGTTGTCA GCTGTTGTCT AGAAGCCATC GGATACGGAG
ACTTGGCCGC CGACGCCGTC TACGTTGCCG GACACAAGGT TACTACCGAT CCCCTCGCTA
TTCCGTTTTC CATGGGACGC AACCTGATCT GCATCCACTC CAAGAAACAC ATCAACGCCG
ACCCGGAAAC CAAGTCCGTC AAACAGCGTG AAAACCTCAA AGCCATGGGC GCCTTGCTCA
ACAAGTTCAA AGAAGGTGGT GCGCTTTTGT GGGTCGCCCC TTCGGGAGGG CGTGACCGAC
GGGATGTCAA CACCGGAAAA GTCCCACTTG CACCCTTTGA CTCTAAAACC ATCGACATGT
TCCGACTGAT GGGTAACAAG TCCAAAAAAA CGACACACTT TTATACCCTC GCCATGGTCA
GTTACGATCT CTGCCCCCCA CCGGACGTTA TTGAGCCCGG CACGGGTGAA CCCCGCAACG
TACGCTTTGG ACCCGTTGGT ATTGCCCTCG GCGCCGAGTG CATTTCTGTG GGGGGACTGG
AATCGCGGCA GGACTTTTGT CAACACGCCT TTGCCCAGTG CCAGGACGAT TACCTACGCT
TGCAGCAAGC GATTGCCAAT CCGACAACGA CGGATCAGGC CTAGTGACCA CCAGCGTTGC
GCATTTCACC TCCTTTACAC AATTGTGGTC GGCACAAGCA CAATACGTCA CTGCAGGAAA
TACACCACGT ATCCATCCTT GTGGCATGCG AACAATGCCG TGTTCGTTCC CATTTATTTG
GTCCGTATTC GTAGAACATA ATTATGTGTG ATCGAGGGAC ACTATTATAC AACCA
 
Protein sequence
MIEFRRNAYH RTTILCLVLT VSASISAWTL PRLPMRTHRS SIGSLPATVD DSITASTISA 
SGTTITSNNV VTTKLLPEFQ AVTDAAQAKL LASIPEAYHA KIVPLLAHFV NEYMTASQNA
YLATGNPSSA PEQAASRILQ GVGYGVRLGL LEPFQFSTSH VALRGKNPEL EQGNEIDFYE
FGCEFFRTVM DLERSVVLGQ DQIPTILQQL ADGENVVLLA NHQSEADPQV VSCCLEAIGY
GDLAADAVYV AGHKVTTDPL AIPFSMGRNL ICIHSKKHIN ADPETKSVKQ RENLKAMGAL
LNKFKEGGAL LWVAPSGGRD RRDVNTGKVP LAPFDSKTID MFRLMGNKSK KTTHFYTLAM
VSYDLCPPPD VIEPGTGEPR NVRFGPVGIA LGAECISVGG LESRQDFCQH AFAQCQDDYL
RLQQAIANPT TTDQA