Gene PHATRDRAFT_36398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36398 
Symbol 
ID7201797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp353679 
End bp354839 
Gene Length1161 bp 
Protein Length386 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180809 
Protein GI219120127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCTT CACAGTCTCT TTCGCGAATT CGAATGCCTG CGGAGTGGGA ACGGCACGCC 
GCGTGCTTAA TTTTATTTCC TCACAATGCT GCAACCTTTC GACTCTCGTT GGCCCAGCCT
CAAGTCTTAA GAGTAGCGCG AACGATTGCC ACCGTCGGCC AAGAGCCTGT GATATTGTTC
GCCAATGATG AAATGGAAAC ATTCCGGTTA CGTGAATTGC TGAAGCTGGA CGAAAATATC
CGGGTCTTGA CTTGTCCCAG CAACGATACT TGGGCTCGTG ATACGGCTCC GACTTTCGTC
ACTCTAAACG ATGGCGACGG GCAAAACAAT GAGTTATTGC TCAGAGGTTT GGACTGGGAT
TTCAATGCCT ACGGAGGTGC CGAGGAAGGA TGTTACTGGC CCTGCTGTCT TGATCAGAAA
GTTGCGGCAA CAATGTGCCG ACAAATAAGT GACGTAGGAA TTTTGGCGGA GCCGATTGAG
TCGCTCCCGA TTTCCTTGGT GCTAGAAGGA GGATCCATCC ATACCGATGG TGAAGGAACT
ATTTTGACAA CCAGAGAATG CCTTTTGAAT AACAACCGGA ACCCCAGCAT GTCGCGGCAA
GAAATCGAGG AAATCATTTT ATGTAACACG GGCTGTACAA AGATGATTTG GCTAAGCGAT
GGGCTGGCCA ACGACGATGA TACGAACGGC CACGTCGACA ACTTTGCCTG CTTTATCAGA
CCAGGACACG TTTTGTTGGC TTGGACGGAT GATGAAGTTT ATGACACCGA AAATTACGTC
CGATGCCGCG CCGCTCTGCA AATATTACAG AAGGAGCGAG ACGCCCGTGA ACGCAACTTG
ACGGTGGACA AATTATACCT ACCGACGCCA ATGACGTACT CCCAAGAAGT AGTTGATTCT
CTCAATTCTT GTATATCTGG TCCAAATATC GCTGCTAGAC ATGCTGGTGA GAGACTTGCT
GCTTCTTACA TCAACTTTTA TATTGCGAAC GGTGCCGTAA TTGTTCCTCA ATTTGATGAC
GATGTTTATG ATTCCAAGGC TATCGAGACT CTTGAGGAAC TCTTCCCTGC GCATAAAGTA
GTCGGTGTTT CCAGTAAAGA AATTCTTATT GGCGGTGGGA ATATTCACTG CATCACACAA
CAAGTTCCTT CACTACTTTA G
 
Protein sequence
MKASQSLSRI RMPAEWERHA ACLILFPHNA ATFRLSLAQP QVLRVARTIA TVGQEPVILF 
ANDEMETFRL RELLKLDENI RVLTCPSNDT WARDTAPTFV TLNDGDGQNN ELLLRGLDWD
FNAYGGAEEG CYWPCCLDQK VAATMCRQIS DVGILAEPIE SLPISLVLEG GSIHTDGEGT
ILTTRECLLN NNRNPSMSRQ EIEEIILCNT GCTKMIWLSD GLANDDDTNG HVDNFACFIR
PGHVLLAWTD DEVYDTENYV RCRAALQILQ KERDARERNL TVDKLYLPTP MTYSQEVVDS
LNSCISGPNI AARHAGERLA ASYINFYIAN GAVIVPQFDD DVYDSKAIET LEELFPAHKV
VGVSSKEILI GGGNIHCITQ QVPSLL