Gene PHATRDRAFT_31877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31877 
Symbol 
ID7196412 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1155188 
End bp1156510 
Gene Length1323 bp 
Protein Length407 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177229 
Protein GI219110955 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00010716 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAG AACAATCAGG ACCTGGTGTC ATTATGAGCG GCATTCGCCG CATTGGTCAT 
GGTCTCAAGA CTTCATTAGC CATAGTTGGT TTTGCAACAG CTTCCGGTCT TTACATGGAG
TACCGGAAGT ATTACCCTAT GGAAGAAGAC AACAATAAGA AGAAAGTTCT GGTGATTCCC
TTTCACCATT TACAGTTGAT TGAGAAAGAA AAAAAGAGCA TTAGATCCCA GCTATCGCGT
TTCGATGTGG ACACCAAAGA TTGCCCGGTG CAAATGGAAA TCAAGGATCT TGTGGATGTG
TTGCACCACG CGGCGTCAGA TCCCAGTATT GTCGCTTTGT ACGGCGTCTT TGGACACGGC
TCGACCTTGT CCCAAGCGGG TTGGGCGGAT TTGGAGGAAG TTCGGAACGC GTTACGAGTT
ATCCGCGAGT CGCATCGTTG GCATGCGGAG CCCAACCTTC AGCACAAGGC CCAGGTGATT
CCAGGAGTTG AGAATAAGCC CATGTACGCC TACGCGGATA CTTTTGCAAG TCTAGGGGAT
CCCGCTTACA AAGAGTATTA CTTGGCGTTG ATCTTTACAC ACATTCATAT GCAAAAGACC
GGGGAACTCA ACTTGTTTGG TGCCATGTTG CAGCAATTCT TTTTGCAGGG ACTACTGGAG
CAGTATGGTA TTGCATTACA CGTCTTCAAG CATGGATAGT ACAAGAATGC GACCAACATG
TTCACCAAAG CACGTTTGAA CAAGCCACAT TGTGAGAACG TCTCCAACAT TCTTGAACAG
ATCAACAACA ATGTATGCCA AGATATTACC AGATCTCATT CCAAGGCCTT GTTGACGTCT
TGGCTCAAGC AGGGTTGTCG GGATGACGTG GATTTGTGGA AGCGCATACA TCAATTGGGG
ACGTTTCCAG CTGTGACGGC ATACAAAGCA GGCCTCATAG ACTTCCTGCC CTGGCGCAAC
CAAAAGAACA CAAAAAGCAA GGTAAAGCCA CTTGATGGTA CAGGTAACAA AGAGTCTGCA
ATAGATGATA TTAAGAACAA ATGGGCATTG CAAGAAACTG ACTTTGAGCA ATTCAAAGCA
GACACGGCTG TCAGTCTCCA GGCCTACGCA AAACAAGTTG CAAAGAAGAA ACAAAATGAG
CAAGATATTT TCGACCAGTA TGGAACCCAA CATCCTGCCA TTCAAAGTAT TCTTGCCAAA
ATTGGCATGT CTTCTGTTGA TGATGGAGAA CCACATCCTC AGAAGGAAAC AATTGCATTG
CTAAGAGTCA ACAAAGGTAT TGGCAACTTG ACAGCCCGCA AGCTAGTCAA TTCGATTTGC
TGA
 
Protein sequence
MSKEQSGPGV IMSGIRRIGH GLKTSLAIVG FATASGLYME YRKYYPMEED NNKKKVLVIP 
FHHLQLIEKE KKSIRSQLSR FDVDTKDCPV QMEIKDLVDV LHHAASDPSI VALYGVFGHG
STLSQAGWAD LEEVRNALRV IRESHRWHAE PNLQHKAQVI PGVENKPMYA YADTFASLGD
PAYKEYYLAL IFTHIHMQKT GELNLFGAML QQFFLQGLLE QYGIALHINN NVCQDITRSH
SKALLTSWLK QGCRDDVDLW KRIHQLGTFP AVTAYKAGLI DFLPWRNQKN TKSKVKPLDG
TGNKESAIDD IKNKWALQET DFEQFKADTA VSLQAYAKQV AKKKQNEQDI FDQYGTQHPA
IQSILAKIGM SSVDDGEPHP QKETIALLRV NKGIGNLTAR KLVNSIC