Gene PHATRDRAFT_14937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14937 
Symbol 
ID7203726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp46828 
End bp48138 
Gene Length1311 bp 
Protein Length436 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182886 
Protein GI219125225 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.723308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGTGC TAGAAGCTGG CTCCGATACG TTGTCGAAGG TGAAAATTTC AGGTGGAGGA 
CGATGCAACG TCCTGCACGA TACCGCTAAG GCTGTTCCAG AACTCCTTGC CGGCTATCCT
CGAGGGCGTC GAGAACTCAA CGGAATCCTA CACAAGCACT TCTCGCCCAA AATGGCGCAA
GAGTGGTTTA CCAGTCGTGG TGTAACACTC AAGACTGAGA ATGACGGTCG CATGTTTCCA
ACCACGGATA ATTCGCAAAC TATCATCAAG GCGCTACTGG AATCTGCCGA CGATGCCAGC
GTCTCGATCA AACATCGTGC CAAAGTTGAA GAAATAAAGA TAGATGGAAG CAAATTTGTT
GTTGATTATC TACAAAAGAA CCAAGGTTCT GAGAAAGAGA GTTTCTCTCG GGCGTTCGAC
GCTGTGATAC TCGCTACGGG ATCGGCACCC ATCGGCTACA AGCTAGCGTC GTCGCTTGGA
CTTGATATGG TTCCGACTGT ACCATCTCTG TTCACTCTCA ATGCCAAGCT CGACGTCAAA
GAAGGCGGTG TCTTGCACGG ACTCTCAGGC GTATCGGTGC CATTGGGGAA AATTTCGTAC
CAAGTGCTTG CTCAACAACC AACCTTGGAG GTCCCCGGGG ATATCACTAT AACGACGAAT
ACGAAAAAAT CTGTTTTGGA GCAACAAGGT CCTTTGCTGA TAACCCACCA CGGGTTGTCA
GGGCCAGCGG CCTTGCGCTT GTCTGCATTT GGAGCCCGAG AGCTCAATGG AGCGAATTAC
CGAGGCAAGT TGACTGTACA CTGGGCACCC TCGTTGGGAA ACGTTGACGA CGTTTTCGAA
GCGCTGTGGA TGATCACAGG GACAAATCCC AAAAAGACTG TTTCTAGTAT ATGCCCACTG
TTTTTGTCTG ACGGTAGTAC TGCCTTGCCG CGCCGGTTGT GGGCTTCCCT CGTCGGATGC
TCGGGCTTCG CACTTGACCA AACCTGGGGA CAGGCTTCTA AAAAGATAAC GCGCCAGCTC
GCTTTATTGG TAACAGCCTG TCCATTGCAG CTAACCGGAA AAGGAACATT CAAAGAAGAG
TTCGTGACGG CAGGGGGTGT TGATTTGAAG CAGATGGACA TGAAAACCAT GCAAGTCAAG
TCGTGCCCAG GTCTATTTGT ATGCGGTGAA CTTCTGAACG TAGATGGTGT GACCGGTGGA
TTTAATTTTA TGAACTGTTG GGGGACTGGG TATGTAGCGG GTAGCAGTGC TGCTACATTT
TCTGCTCAAT CTTTGCCTTC CAATCAAGAT TTTTCATTGG TTGAAGATTA G
 
Protein sequence
VTVLEAGSDT LSKVKISGGG RCNVLHDTAK AVPELLAGYP RGRRELNGIL HKHFSPKMAQ 
EWFTSRGVTL KTENDGRMFP TTDNSQTIIK ALLESADDAS VSIKHRAKVE EIKIDGSKFV
VDYLQKNQGS EKESFSRAFD AVILATGSAP IGYKLASSLG LDMVPTVPSL FTLNAKLDVK
EGGVLHGLSG VSVPLGKISY QVLAQQPTLE VPGDITITTN TKKSVLEQQG PLLITHHGLS
GPAALRLSAF GARELNGANY RGKLTVHWAP SLGNVDDVFE ALWMITGTNP KKTVSSICPL
FLSDGSTALP RRLWASLVGC SGFALDQTWG QASKKITRQL ALLVTACPLQ LTGKGTFKEE
FVTAGGVDLK QMDMKTMQVK SCPGLFVCGE LLNVDGVTGG FNFMNCWGTG YVAGSSAATF
SAQSLPSNQD FSLVED