Gene PHATRDRAFT_39288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39288 
Symbol 
ID7195003 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp125307 
End bp126425 
Gene Length1119 bp 
Protein Length331 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183297 
Protein GI219126089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.21088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCGC CTGCTAACAA AGCTGCCTAT AGCTGGCTGC TTCTACCTGC TGGCAACAAC 
GAAAAGGCGG CTACGGGAGG ATTCAATGCT TCCATGGTCT TAGACGGCAC GAGAAACCAA
GGCTCAACGC ATGTTCAAAT CCATCGAAAT ATTACGTTCG AAGGTGGCTA CAGGCTGCAA
CGCAACGATA ACGATCTGGA AGTAATGGCA AATCGAAGCA TCATGTTCGT GCACGTCGGA
AAATCGGGCG GAGAGACTAT CAAGCGTGTT CTAGAGATCG CATGTCGGAC TAAAGCAAAC
AAAGGTCAAC GGAAGAGATG CTATAGGAAT TTGACGCAAA GTCGTCTGTC TTCTTCAGTC
AAAAGCTATT TTCACTGCTT CAAAATTCGG CCGGCATCTG TGAAAGAAAC TGCTATACAT
GATGCAGATG CATTCTTGTT CAGCTTGCGT CACCCTTTGG ATCGTACACT CTCATGGTTT
CGAAACTTTG ATCCTCAAAA CTGCCCTGAA GGGAAACGAA CAAAGGCGAG CTGCTTGACG
GCACATTCCA TTAAAAATGA TCCTACCAGT TGGGCCGGTC ACTTTTTTGG AGTTTGCTTC
TCCAGCGCAG AAGCATGGGC ACAAGCTTTG AGTCATCCGA AAGACAAATG CAACGTTCTC
GCATGGGACA CCATCCTGGG ACGAATTGCA ATCCAAGATG AGTCTTTAGT GGCACACATG
ACGGCAAACA TTCGACGATA TGCGAAAGCT ACAGCAGCAA GGTATCCAGA GAAAGATGTG
CTTGTTGTGC GAATGGAATA TATGTGGAAG GATCTTAAAG CTTTGGATCT TAGTTTAGGA
GGAACTGGTC GATTTGGAGC TATGGCTGGT TCCAAAGTCG CCCACGGAAG CGAGGTGTAC
GAATTCCAAA ATGAGGATCT TTCGGCGGCA TCCGTCGTCA CAATGTGTTG TGCTCTACGA
GATGAAATGG AAGTTTTTTA TTCTCTTCTT CATAAAGCGC AAAATCTCGA TGCCGTTCAT
AAACGTCGAA CATGGGAAGA TGCTTTAAGC TATTGCGGAT CTACCTCATG GATAGATTTT
GAAGCACAAT GTCGTACCAT TCAGCAATCA TTCCATTGA
 
Protein sequence
MESPANKAAY SWLLLPAGNN EKAATGGFNA SMVLDGTRNQ GSTHVQIHRN ITFEGGYRLQ 
RNDNDLEVMA NRSIMFVHVG KSGGETIKRV LEIACRTKAN KDAFLFSLRH PLDRTLSWFR
NFDPQNCPEG KRTKASCLTA HSIKNDPTSW AGHFFGVCFS SAEAWAQALS HPKDKCNVLA
WDTILGRIAI QDESLVAHMT ANIRRYAKAT AARYPEKDVL VVRMEYMWKD LKALDLSLGG
TGRFGAMAGS KVAHGSEVYE FQNEDLSAAS VVTMCCALRD EMEVFYSLLH KAQNLDAVHK
RRTWEDALSY CGSTSWIDFE AQCRTIQQSF H