Gene PHATRDRAFT_47140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47140 
Symbol 
ID7201933 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp588528 
End bp589962 
Gene Length1435 bp 
Protein Length375 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181405 
Protein GI219122129 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.136457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTGA GAAACAGCCC ATCCAGGGGG AGGATCCATA GTCTAGCCGT GATAGCGACG 
GCCTTCATAT TCGTTTTGCA CATGTACGTT GGTGTATACG AAAGTGCGTC TTTTTACCAG
CAAATACCCA TGATTGGCAC GTGGTCTACT GAGTCAAAGT CCCGTGTGGA GACAGACGAC
ACCAAGCGCG CTACCCGCAA GCATTCCATG AGAGAGATTG TGCCGGAACC AGTTCCTCGA
CCTTTGATTG AAACTCTCGT TACTGGACAA AACGTTACAG GGGACGTTGC GTGGCTCCTG
AATATGGCGG TGCTGGGATT CTCAAAGTGC GGTACATCCT TCATGATGCG CTATCTCGGT
CGACACGAAG AAATAGCCAT GCTGACTGAC GGTGAACACT GTGAGCTGAC AAGGCGCAAT
GAAGATTCGG CCCTTATCAA GTCCTTGATG GATGGGCTTC CCAGCGGAAA GATAGCGCGC
GGCTTGAAAT GTCCTATCCA TTTGGAAAGC CCCAGAGCCA TGCAGAGCTT CTCCCGATAC
TTCCCGAACA CAAAGATAAT TGTTGGAGTC CGACATCCCG TCCTTTGGTA AGTAGTATGC
TTGAACACAC ATGAGCAACC AGAAAGGGCT CTATCCTGAT TTCTTCTTCT AATAACATTG
GAACAGGTTT GAATCCTTCT ACAACTGTGA GTGGATATAT GCTGAATTCT TCTTTCGGGA
CTGTCTTTGT ACCATAACTC TCATCCAACA TATTTGTATT CAGATCGGCA TCGAGACGGT
AAGACCCAGC TGCTGCCAGC CCAGGAACTG ATTGGAAAAT GTGCGGATTT GGGGCCCTTT
GAAAAGGTGG CTTCGGTCTG TACCGAAGGA GCCAAGTTCC ACGAGCCTTT GGCTCGCTTG
GGAAAGACGA ACATGCAGAG CACAGATGAG CGACAATACT TTTCGGCCGA CGCGCAGAAT
GTTTCAGACA CCGATGCTTT CTCCGGTATG AAAGATCTTC GTGTACGACG TAGCCCAGCT
CCAGGACAAG GACCACGACC GTTCTCAAAT CCTACTACAA GACTTGCAGA ACTTCCTGCA
AGTCACAAAG CCGTTCCAGC CGATGGTGGT AGAGCCCAAA AGACTGCATG ACGGAACCCG
TATTGATATC TGTGACCCCG AGTACAATCA TTTACGCGAG GTACTTGTGG ATACTGGAGT
GAAGGCGTCG AGATGGATTC GGCGATTTTT TGTCCATGCC GAAGGCGTGA CGGTGTCGTC
TCCCAAATTT TTGGACCAGG TGTTGGCCAA GTGGGAAGAA GATCCGTGCG AAGAACGCCG
GGCCGAGAAG AGCTCCGCCC CATCCCCTTG AATCAGATTG CCATGTACAA GTTGTAGGAG
TCGTCCAGTT GATAAATTAT GCAGGTATTG TTCGACCGGT ACGCATCATG TTACT
 
Protein sequence
MSLRNSPSRG RIHSLAVIAT AFIFVLHMYV GVYESASFYQ QIPMIGTWST ESKSRVETDD 
TKRATRKHSM REIVPEPVPR PLIETLVTGQ NVTGDVAWLL NMAVLGFSKC GTSFMMRYLG
RHEEIAMLTD GEHCELTRRN EDSALIKSLM DGLPSGKIAR GLKCPIHLES PRAMQSFSRY
FPNTKIIVGV RHPVLWFESF YNYRHRDGKT QLLPAQELIG KCADLGPFEK VASVCTEGAK
FHEPLARLGK TNMQSTDERQ YFSADAQNVS DTDAFSGMKD LRNFLQVTKP FQPMVVEPKR
LHDGTRIDIC DPEYNHLREV LVDTGVKASR WIRRFFVHAE GVTVSSPKFL DQVLAKWEED
PCEERRAEKS SAPSP