Gene PHATRDRAFT_18877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_18877 
Symbol 
ID7198026 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp251901 
End bp253359 
Gene Length1459 bp 
Protein Length455 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178193 
Protein GI219114795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCAGT CGAGAGAGAA TAAAGACATT TCCGATTTGA CCTACAAGCT AGAAACAATG 
GGTTGCCAAA TGAATATGGC TGATTCCGAA CGAATCGAGG GTCAATTACA AGGTCTCGGT
ATTCGACCTT TAGATCCAGA CGTAGACAAG AACAAACAGC CGGATGTCGT CATATTGAAT
ACCTGCTCCA TTCGAGATCA CGCCGAGCAA AAAGTTTATT CATACATTGG ACCCCACGCC
AAACGCAAAC GCGACGGGGA GGACGTGACG ATTATTGTAG CTGGCTGCGT CGCCCAGCAA
GAAGGGGAAG CGCTGTTACG ACGTGTACCG GAAGTTGATC TCGTCATGGG ACCGCAGTAC
GCTAATCGGA TTGGCGATTT ATTAGAAGAT GTCAGCAATG GCAACCAAGT TGTGGCCACC
GAAGCCAGCC ATATCATGGA GGATTCCACA AAACCACGAC GACAATCAAC GGTCGCCGCG
TGGGTGAACG TCATTTACGG CTGCAACGAG CGATGTACCT TTTGCATTGT ACCTACCACG
CGTGGAGTAG AACAATCTAG GCCTGCTGAG AGTATTGTAA GAGAGGTTAC TGAACTTGTG
GAACAAGGGT TCAAGGAAAT CACGCTATTG GGTCAGAATA TTGACGCCTA CGGCCGTGAC
ATGATCCCGA AGCGAAAATT TTCGGATTTG ATCCGGATTG TTGGTGAAAT ACCAGGATTG
GACCGCTTGC GATTTGTAAC GTCTCATCCT CGTTACATGT CGCTGGGCGT CGTGGACTCC
GTTGCCGAGA CACCGGCAGC TTGTGAATGC TTTCATATTC CTTTTCAAAG CGGATCTAAC
GAGATACTCG CCGCTATGGG TAGAGGACAT ACCCGAGAAA AGTACCTGCA CATTGTGGAT
CGTATCCGAT CGGTAAGTTA ATGTCGACGG TCTTTTCCAA TCGGTACTCT CTCGTACTGT
CGTTGCGTTG ACTCACACAT TTCTTCGTTT ACTAAATCCA CAGAGAATAC CGGATGCAGC
GATCACCGCC GACGTGATCG TGGGCTTCCC TGGGGAAACG GAGGAGCAGT TTGAAGACAC
CTTGTCCTTA ATGCGCGAGG TGGTTTTTGA TTCCGTCAAT ACAGCCGCGT ACTCTCCCCG
CCCCAATACG CCGGCAGCCG TTTGGGACGA CCAAGTTGAC GACGCCGTCA AACAGAATCG
TCTGCAACGG ATCAATGCAC TCAATCTAGA ACACGCCGCT CAACGTCGGG CCCGCATGAA
GGGGCGGACG GTCGAAATAT TGGTGGAGGA ACGCAACGTA CGCGTGCCCA CGCAAGTAAT
GGGTCGTACG CGGCACGGGT ATATTGTCTA TTGCGACGGT GAGATTGATG AGCTTCGTGG
AAAGCTAGTC AACGTCGAGA TTGACACCTG CGAGCAATAC TATCTTGCCG GAAAGCCAGT
TGCCCAAGAT GGACACTGA
 
Protein sequence
MGQSRENKDI SDLTYKLETM GCQMNMADSE RIEGQLQGLG IRPLDPDVDK NKQPDVVILN 
TCSIRDHAEQ KVYSYIGPHA KRKRDGEDVT IIVAGCVAQQ EGEALLRRVP EVDLVMGPQY
ANRIGDLLED VSNGNQVVAT EASHIMEDST KPRRQSTVAA WVNVIYGCNE RCTFCIVPTT
RGVEQSRPAE SIVREVTELV EQGFKEITLL GQNIDAYGRD MIPKRKFSDL IRIVGEIPGL
DRLRFVTSHP RYMSLGVVDS VAETPAACEC FHIPFQSGSN EILAAMGRGH TREKYLHIVD
RIRSRIPDAA ITADVIVGFP GETEEQFEDT LSLMREVVFD SVNTAAYSPR PNTPAAVWDD
QVDDAVKQNR LQRINALNLE HAAQRRARMK GRTVEILVEE RNVRVPTQVM GRTRHGYIVY
CDGEIDELRG KLVNVEIDTC EQYYLAGKPV AQDGH