Gene PHATRDRAFT_20708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20708 
Symbol 
ID7201382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp894693 
End bp896203 
Gene Length1511 bp 
Protein Length475 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180535 
Protein GI219119555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.558415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATTATTGTTC GCTCGGCACC AAGTCCATTC CCGTCGCTTC CGTCGTACGC AACCCCCCAT 
TTTGGCACAC ACAGTCCGAC ATCCTCATGA GGTCCGGTTT GGCTGTCGTC TCCATTTCTG
GGCTGGTGAT GCTCACCTCC ACTGCGGTTG GGGGGTTAGC GACGAGTCGA CCCAACTTGC
CGCCGCTTTC GGGTACCACT TCCATTGCAC CGCACCTGCC TTTGGCACGT AGTGCCATGG
AGTACTTGGA CGCCAGTCCC GATCCTTTCC ACGCCGTACA AACCTCCATC GAACGATTGG
AAGCGGCGGG CTTTACCTCA CTGTCCGAAA CCTCCACCGT CGACACCGGA AAAATCGTTC
CGGGCGGCAA ATACTACTTT ACCCGCAACA AATCCACACT CGTCGCGTTT GCTGTCGGAG
ACCGGTACCA ACCCGGCAAC GGCTTTAAAA TTATTGGCGG ACACACGGAC TCGCCCAATC
TGCGCGTCAA ACCGCGCTCG CTACGGACCG CCGCCGGCTG CGTGCAGGTC GGGGTGGAAT
GCTACGGGGG CGGACTCTGG CATACCTGGT TCGATCGGGA TTTGGGTGTG TCCGGGCGTG
TGTTGGTCCG GAGCCGCGAT GATGCGCGCA AGGTCACGCA GAGACTCGTT CGTATGGATC
GAGCCCTCCT GCGGGTATCC AACCTGGCGA TTCATCTGCA GTCTGCCAAA GAGAGGGAAG
CCTTTAAAGT GAACAAGGAA GACGATCTTT CACCAATTCT CGCGATGGAA GCGGAAAAGT
CCCTCAACGG CGGGGAAAAC AAGACCAAGG ATGGGTGGAC CGAGTACCAA GAACCCGCCC
TGCTCGAAGT ACTCGCACAC GAACTCAACG TCCGAGTCGA AGATATCGCC GACTTTGAGC
TCAGTCTGTT TGATGTCCAA AAAGCAAGTT TGGGCGGAGT CTTTTCGGAG TTTATCCACT
CGTCGCGTTT GGACAATCTC GCCAGTTGCT TCCTCGCGGT ACAAGCTTTG GTGGATCACG
TGGAGGCCGG CAGCACTGCT AAGGACTCGG ACATTTCCAT GATTGTCTTG TACGATCACG
AAGAGGTCGG TAGCAACTCC GCCGTGGGAG CCGCATCGCC AATAATGGCG GAAGCCGTCC
AACGCATTGC GGCAGCCTTG GGCAACCAGG AAAGTACGGA AACTTACGCA GCCTGTATTC
GCAACAGCTT TTGTTGCAGT GTCGATCAGG CCCACGCTTT GCATCCGAAC TATGCAAGCA
AGCACGAAAA GAATCACCAG CCAAAGATGA ACCAGGGCAT GGTGATTAAG CGCAACGCCA
ATCAAAGGTA CGCCACCAAC GCCGTGACGG GCTTCTTGAT GCGCGAAATT TCCCGCCGCG
CCGGGCTGCC ACCCATTCAG GAGTTTATTG TACGACAAGA TTGTGGCTGT GGTTCGACAA
TCGGACCGCT GATCAGTACA GCTACGGGTA TTCGGACAAT TGATATGGGC TGCCCCCAAC
TTTCCATGCA T
 
Protein sequence
MRSGLAVVSI SGLVMLTSTA VGGLATSRPN LPPLSGTTSI APHLPLARSA MEYLDASPDP 
FHAVQTSIER LEAAGFTSLS ETSTVDTGKI VPGGKYYFTR NKSTLVAFAV GDRYQPGNGF
KIIGGHTDSP NLRVKPRSLR TAAGCVQVGV ECYGGGLWHT WFDRDLGVSG RVLVRSRDDA
RKVTQRLVRM DRALLRVSNL AIHLQSAKER EAFKVNKEDD LSPILAMEAE KSLNGGENKT
KDGWTEYQEP ALLEVLAHEL NVRVEDIADF ELSLFDVQKA SLGGVFSEFI HSSRLDNLAS
CFLAVQALVD HVEAGSTAKD SDISMIVLYD HEEVGSNSAV GAASPIMAEA VQRIAAALGN
QESTETYAAC IRNSFCCSVD QAHALHPNYA SKHEKNHQPK MNQGMVIKRN ANQRYATNAV
TGFLMREISR RAGLPPIQEF IVRQDCGCGS TIGPLISTAT GIRTIDMGCP QLSMH