Gene PHATRDRAFT_46289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46289 
Symbol 
ID7201221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp826496 
End bp827545 
Gene Length1050 bp 
Protein Length338 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180522 
Protein GI219119527 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAATC AACCGTCCGT AAGCGGCGGT GATCCCAGTC TCGGAGGTGG AGGAGGACCG 
CATCACCCAA GCGGTAGCAG CCACGATCCT CACACGCAAC GCAATATTCC GGGATCGTAC
CCGGGTCAGT ATCACCCACA AGGTGGCTTT ATGCCACCGA ATACTACTCC CACGCCCGAT
CCTACGGTGC CGCTGTCGGC TGATCAACAG CAACAAAATG ACGCGATGCA ACGATCCGGC
AGTGACGCTT CCATGACAAT TTTGGATGCC GACACCCCGC CGAGCGTTCC GTCGCAATCG
CAGCACGGGT CCGAGATGGA CAGTTGGTTG GACGAGGACG CGGTACCCAC CGTCTTTCGG
TGGGAACACG GAGGTCGTCA GGTCTACATT ACCGGCACGT TCAACGGATG GAGCCGACAA
ATTCCGATGC ATCGATCGGG TAACGACTTT ACATACATTC ACAATCTCAA GCGCGGGAAA
CACGCGTTCA AATTCATTGT AGATAACGAG TGGAGGTTTG CTCCGGATCA ACCGACCGTC
GCCGATATTG AAGGGCGTGT GAACAATTTC GTAGACGTGA CAGATTTCAA ACCCTATACG
GGCGATCGCG AATTCGAGAG AGAAAAGGCG GCCGCCGAAT ACGGCGCTCC CTTGGAAGCC
GAAGACCAAC AGGACGAGGA CAATGTCAAC GTCGTCAGCA CCAGCATACC AAACGTCGAC
GGGCAGGCGT CTGGTAGTAA AGCCGATCAA GATGGCGAGG TCTTTTCCAA TACCATGCCT
GACGTGGACG ATTACACCAA GGAACCGCCG CCGCTCCCAC CCCACCTGCG ACACATAATC
CTAAACAAAC CACCCCAACT GCAGGATACG GCTGCTTTGC CCGTTCCGCA GCACGTGGCT
TTGAATCATT TGTACTGCAC CGCCATCAAA GACAACATGA TGGTGTTGGG GATTACGCAG
CGATACAAGA CCAAGTTTGT TACCACGGTT TACTACTCAC CGTGTTCGAG TAGCTAGGGT
TGATTTAACC TATAGGAACG ACCAAACTAC
 
Protein sequence
MGNQPSVSGG DPSLGGGGGP HHPSGSSHDP HTQRNIPGSY PGQYHPQGGF MPPNTTPTPD 
PTVPLSADQQ QQNDAMQRSG SDASMTILDA DTPPSVPSQS QHGSEMDSWL DEDAVPTVFR
WEHGGRQVYI TGTFNGWSRQ IPMHRSGNDF TYIHNLKRGK HAFKFIVDNE WRFAPDQPTV
ADIEGRVNNF VDVTDFKPYT GDREFEREKA AAEYGAPLEA EDQQDEDNVN VVSTSIPNVD
GQASGSKADQ DGEVFSNTMP DVDDYTKEPP PLPPHLRHII LNKPPQLQDT AALPVPQHVA
LNHLYCTAIK DNMMVLGITQ RYKTKFVTTV YYSPCSSS