Gene PHATRDRAFT_38891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38891 
Symbol 
ID7203616 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp467557 
End bp468760 
Gene Length1204 bp 
Protein Length373 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182969 
Protein GI219125397 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.319117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCACCA CCGACAACTC CAAACAAGCC GCTAGTGGCG CTGCCAGTCG TCTGGCTCGC 
AAAAATATCC TGGAACTCGC GCCCTACCGA TGCGCCCGGG ACGACTATAG TGAAGGCGTC
CTGTTGGACG CCAACGAAAA CGCGTTTGGT CCTCCTACCA GACCGGAACA CATCAAGGAT
CCGTTGGAAC GGTACCCGGA CCCGTACCAA GTGCCGTTGA AGCAAAAGCT CGCCGCGCAC
CGAGGCAACG AGCTGGAATC GTCGAATATT TTCGTGGGAG TCGGATCGGA CGAGGCAATT
GATCTACTCA TGCGCATCTT TTGTGTTCCG GGAAAGGATA AAATCATGCA AACCCCACCA
ACCTACGGAA TGTACAAAGT TTGCGCCAAA ATTAACGATG TGGAAGTGGT GAATGTCCCA
TTGACTGCCG ATTTTGACTT GATTATTCCC AATGTACGTC TGACGAACCG ATACCGTGTT
CATTGCTATC CTCGGCCTTT CTTAAACGAT TTCAACGCTT TGCTCTTGCA TGTAGATTTT
GGAAGCCATA ACGCCGGAGG CCAAACTGCT CTTTCTCTGT TCTCCCGGAA ATCCTACGGC
CAAGGCGTTG CCGTTGGCCG ACATTGAAGC CGTCTTACAG AGTCCCCAGA CGTGTGACAC
AATCGTGGTC GTAGACGAAG CGTACGTGGA CTTTTCGACA CAGGGATCGG CCGTGGGTTT
GGTGCACCGG TACCCCAACG TGGTGGTGTT GCAAACGCTG TCCAAAGCCT TTGGATTGGC
GGCGATTCGG TGCGGATTCT GCATCGGACC ACCGGATATC ATCCAACTCA TGAACAATTG
CAAAGCACCG TACAACGTCA ACGCGTTGAC TTCGGAATTG GCAATACAAG CGTTCGATCA
CGTGGATGTA CTGGACACGA ATATTGCGAG TTTGCTGTCG GAACGTGCCC GGGTGGCGGC
CTCGTTGGCA GAGTTGGACT TTGTGGAAAA GGTGTATCCG TCGGACGCCA ACTTTTTGCT
CTTTCGGGTG GCGTCGCACG CACAAGCCGT GTACAAGGAC ATGGCGGATC AGGGTGGTGT
GGTGACTCGC TTCCGGGGCA CCGAAATGCA TTGCGACGAA TGCATTCGGG TCACGGTCGG
CACTCCGGAC GAAAACGAAG CCTTTTTGAA GGCTTTGCAA ACGTCGTACC GGGCGTTGGC
GTAA
 
Protein sequence
MCTTDNSKQA ASGAASRLAR KNILELAPYR CARDDYSEGV LLDANENAFG PPTRPEHIKD 
PLERYPDPYQ VPLKQKLAAH RGNELESSNI FVGVGSDEAI DLLMRIFCVP GKDKIMQTPP
TYGMYKVCAK INDVEVVNVP LTADFDLIIP NILEAITPEA KLLFLCSPGN PTAKALPLAD
IEAVLQSPQT CDTIVVVDEA YVDFSTQGSA VGLVHRYPNV VVLQTLSKAF GLAAIRCGFC
IGPPDIIQLM NNCKAPYNVN ALTSELAIQA FDHVDVLDTN IASLLSERAR VAASLAELDF
VEKVYPSDAN FLLFRVASHA QAVYKDMADQ GGVVTRFRGT EMHCDECIRV TVGTPDENEA
FLKALQTSYR ALA