Gene PHATRDRAFT_42927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42927 
Symbol 
ID7196184 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1538671 
End bp1539925 
Gene Length1255 bp 
Protein Length383 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177316 
Protein GI219111129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00363535 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATGA ACACGAAACT AGCACCAAAT GTTGTCCACT GCATGCTTTT TGGCATATCA 
AGCTTTTCCC TGATAACGAC TACGAAAGGA TGGCAAGGAG GATTTTCCAT TCAAAGGCCA
CGGAAATTCT TGATCGTCCC CAAGACGTTG CCGCAGAAGA CCAGTCCTCG GTTTCTCCAA
ACCGATGTAG TGGTGCAGCA AAGCGCTCGA GACATCGAAG ATCCCATATC AAATATACAT
AATGGAAAGC GCCTATTGCG TTCGGAAAAA ATTATGTCCA TGTTTACGAA GAATGGAAAA
TCCGAAAGAG AAGCAGTGGT TCTGCAACAA CTACGACAAG AGAATGACCT GCTGCGGGTT
GCTCTACAAC GAGCCGAAGC CGAGAACGAG CGGCTGCATA GACACTACGA TAATGGAAAT
CGTATTATTT TGGAAAGCTT TGAAGGAGAA GGAAGATTTC GAAGAGCCGA TGACGGAGTA
ATGTCGGACA TTCCTATGAC ACTCACAGGA GAAGAAATGC TCACGGAAGA AGCTTCACAG
TGGTGTGATG AATTGGAAGA TGATGCCTGT CCGCTGGAAC CGACCATTTC GTTTGGAGAG
GCACTACGAG ATCGAGCTTA CTGGTTGGTG GGGCTTTTGA TCATGCAATC ATGCAGCGGC
ATTATTCTGG CACGGAATGA GGTTCTACTG GCCAATCACC CTGTCAGTGA GTAATTTATG
TTTGCTGCTG ATTTGATCAA CAGACCCCTC ATTGTCGTGC GAGCTGTTCA TTTATTCTGT
CCTAACAGTA GAGTGTTTTC ATTTTTTAGT TATATACTTC TTAACCATGC TGGTGGGTGC
CGGCGGAAAC GCCGGCAACC AAGCCTCGGT CCGAGTGATA CGGGGGCTTG CTCTCGGTAC
ACTGAACGAA AAGACACAGG GCCAGTTTTT GTCACGAGAA CTCAAAATGG CGTGCGCACT
TAGTGCTATT CTCTCGGTAA CTGGGTTTGT CCGAGCCATT GCGTTTCGGA CGCCCTTCTC
CGAAGCGATC GCCGTTACAA GCGCGTTAGC ATTGATTGTT TTCTCCAGTG TGTGTCTAGG
GGCAATTCTT CCACTGGGAC TGAAAAGGTT AGGCGTCGAT CCCGCGCACA GCTCCACGAC
TATTCAAGTT ATCATGGACA TTCTCGGCGT CGTCATTGCT GTAGCTGTTT CCAGCATTTT
GCTCGACAGT CCGCTAGGGA TTCTTCTCAT TTCTAGACTT GGTGGGGGTT CCTGA
 
Protein sequence
MVMNTKLAPN VVHCMLFGIS SFSLITTTKG WQGGFSIQRP RKFLIVPKTL PQKTSPRFLQ 
TDVVVQQSAR DIEDPISNIH NGKRLLRSEK IMSMFTKNGK SEREAVVLQQ LRQENDLLRV
ALQRAEAENE RLHRHYDNGN RIILESFEGE GRFRRADDGV MSDIPMTLTG EEMLTEEASQ
WCDELEDDAC PLEPTISFGE ALRDRAYWLV GLLIMQSCSG IILARNEVLL ANHPVIIYFL
TMLVGAGGNA GNQASVRVIR GLALGTLNEK TQGQFLSREL KMACALSAIL SVTGFVRAIA
FRTPFSEAIA VTSALALIVF SSVCLGAILP LGLKRLGVDP AHSSTTIQVI MDILGVVIAV
AVSSILLDSP LGILLISRLG GGS