Gene PHATRDRAFT_40593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40593 
Symbol 
ID7198468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp381395 
End bp382573 
Gene Length1179 bp 
Protein Length392 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184609 
Protein GI219128835 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.164443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTTG GATTGCGCCC AGACAATCGC ATTGCACTGC TCTTGCTCAT CGCCAAATCG 
GCTACCTCGT TCTACATAGC GCCGACAAAA GTCCTCCGTG CCTCTCTTTT CAATTCGAGA
AGACCGCCAC TATCTAGGCG GACGGCTCTC ATAGATCCCG GCCAAGGTCT GGAATCCATA
CAGACTACAA TAAATGGCCT CCCTTTGCTT CTTCCCAGCT ACATTCATGA CATTTTTGCT
GCCGAGGATC ACCCGTTTCG ATATGCTTTC ATGTTTCCTA CGGTAATGGC TGTGTCTATT
ACGTGCCAGT CCGCGGGCAT CGGCGGAGCT GCTCTGCTGA GCCCTATTTT GCTTTTAATA
TTTCCGTTGC TAGGCCCCGA GTATCCTTTA CAAACCGCAG CCGCCGCGAT TGCCAGTGCC
CTATTGACTG AATGTTTCGG CTTTCTATCG GGCCTTTCCG GCTATTGGCG GCGAGGGCTG
GTCGACTGGG GAGTTGCCTT TAAGTTCTTG GGATTGGCCC TACCCACGTC CTTTGCTGGA
GCGCTGTTGG AGCCTTCGCT GGCGGGAGAA ACGACGTTTC TGCGAATCTT GTACTCTACG
CTCATGTTGA CACTCTGTGC ATTTCTACTG TTCAGCGAAA AACCAGTAGC ACTTAGCGAG
GAGTGCGATT TTTCAACACA AGAAGACCCA ATAATCCGCA CGAAAACGGC TGTCGACGGC
ACAGTTTTTA CCTATCTCGA GCCAGACAAG TTAACGTGGA AAACAATTGG AGCAACTACC
GCTGGAGCTT CGCTTACTGG TCTACTTGGT GTTGGAATCG GTGAAGTGAT TTTACCACAA
CTGGTACGAA TATCTTGCAT GCCGCTGCCG GTCGCTGCTG GTACATCAGT GGCTGTAGTA
GTCCTGACAG CGCTGACGGC CGCCACTGTG CAATTCTCAG TCCTGGCCAA TGAACTCATG
ATCCTGAGCC CAGATTTGAC GCTTCAGGCG GCTCTTGGGC AAGTGGTACC TTGGAATTTG
GTGCAGTACA CGGTGCCTGG TGCTGTAGTG GGCGGACAAG TCGCACCCTA TCTAGCTTCC
AAACGTGTCT TGGACGATGA GACTATCGAG TCCATCGTGG CGGCACTGTT TGGTATTATA
GGCGTAGCGT TTGCTGCGAA AGTAGTGCTT TCGGGGTAA
 
Protein sequence
MAFGLRPDNR IALLLLIAKS ATSFYIAPTK VLRASLFNSR RPPLSRRTAL IDPGQGLESI 
QTTINGLPLL LPSYIHDIFA AEDHPFRYAF MFPTVMAVSI TCQSAGIGGA ALLSPILLLI
FPLLGPEYPL QTAAAAIASA LLTECFGFLS GLSGYWRRGL VDWGVAFKFL GLALPTSFAG
ALLEPSLAGE TTFLRILYST LMLTLCAFLL FSEKPVALSE ECDFSTQEDP IIRTKTAVDG
TVFTYLEPDK LTWKTIGATT AGASLTGLLG VGIGEVILPQ LVRISCMPLP VAAGTSVAVV
VLTALTAATV QFSVLANELM ILSPDLTLQA ALGQVVPWNL VQYTVPGAVV GGQVAPYLAS
KRVLDDETIE SIVAALFGII GVAFAAKVVL SG