Gene PHATR_44089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44089 
SymbolARP1 
ID7203860 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp933100 
End bp934548 
Gene Length1449 bp 
Protein Length391 aa 
Translation table 
GC content51% 
IMG OID 
Productactin related protein 
Protein accessionXP_002186156 
Protein GI219113145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.549451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCCACGCGT GAGGACAAAG ATTGACCATG AGCGCCGAAG GGTCTGGTCC GGAAGGCGGG 
TCACTCCTAC TCAACCAACC TGTTGTCATC GATAATGGCA CAGCTAGCAT CAAAGCTGGG
TTTGCGGGAT CGTCCAAACC AAAGGTACGG ACCGAACGCA CCGAAAGTAC CGCAACGAAG
CGTCCGAGGG ACGTCGCTAT GCCCAAAGAT CAGGATCCAG TGTCGGTACA ATTGCAGACT
CCATTGTTTC TACTATACTA ACCAGTCTTC AATTGTTGCA GGTTGTAGTT GGTACCAAAG
TTGGACGGCC TAAGCATATG CGAATCATGC CGGGAGGTGC TCTCGAGCTT GATCAAGGCA
GTAGCATTTT TGTTGGGCCG AAACTGGACG AACACCGTGG CGCCTTTGTT CTGGAACACC
CTATGGACAA AGGTATGGTG ACGGACGGAG GATGGGACGC AATGGAACAC CTTTGGGAGG
TACGTCGAGT GTCAAAGCGA CTTAAATTGT TCGTCATTGG CTACAGTCAC CTAGAAATTG
GTAATGTTCA CCGTAACTCA ACCGTGTGTC TTTTTTTACA CTGCCTAGCA CGTGTATTCG
AAACAGTGCC TGAACGCAAA GCAAGAGGAA CATCCTGTTC TCTTGACGGA AGCGCCGCTT
AACCAACGCA AAAATCGTGA CCAAATTGCG GAAATCTTTT TCGAATCCAT TCGGGCACCC
GCACTGTTTT TTACACCGCC CTCAGTTCTC AGCTTGTACG CTTCTGGTCG CACTACTGGT
GTTGTCTTGG ACGTTGGTGA AGGTGTCACT CACGCTGTAC CAGTCTATGA AGGATTTGCT
TTGCAGCATT CGGTAGTGAG GAGCGATGTT GCGGGTCGGG ACGTCACCAA ACAACTGCAG
ATGCAACTGA GACGAGCTGG CCTGTCGTTC ACCACTACGG CAGAGTCGGA ACTGGTCAAG
AAAATTAAGG AAGAGTCGTG CTATTTGACG CCATCTCGGC TGGCAAATGA AAATACCATC
AAAGAATCCA ATACGCCGTA CCAACTACCC GATGGACAAA CTGTGAACCT GTCCACGGAA
CGGTACCAGG CGCCCAACGT TCTTTTTGAC CCGTCCCTAA TTGGATCCGA GGAACCTGGA
GCCGCCGAAG TTTTGGTCAA CTCGATTATG AAAAGTGACA TTGACTTGCG ATCAAAGTTG
TTCTCGCAAA TCGTGTTGGC CGGCGGTTCT TCGCTCACAC CAGGCTTTGG CGACAGAATG
TTGTACGAAG TTCGTTCTCG AGCTCCCTCA CATTTACGCA TACGTATCTC GGCACCGCCA
GAAAGATTGC ACTCAGCCTA CGTAGGAGGA TCAATCTTGG CAAGTCTGTC GACGTTCAAG
AGCATGTGGG TTTCCCGGTC CGAGTATGAG GAACACGGAA GCAACATTCT ACACCGCCGA
GAGTTGTAG
 
Protein sequence
MSAEGSGPEG GSLLLNQPVV IDNGTASIKA GFAGSSKPKV VVGTKVGRPK HMRIMPGGAL 
ELDQGSSIFV GPKLDEHRGA FVLEHPMDKG MVTDGGWDAM EHLWEHVYSK QCLNAKQEEH
PVLLTEAPLN QRKNRDQIAE IFFESIRAPA LFFTPPSVLS LYASGRTTGV VLDVGEGVTH
AVPVYEGFAL QHSVVRSDVA GRDVTKQLQM QLRRAGLSFT TTAESELVKK IKEESCYLTP
SRLANENTIK ESNTPYQLPD GQTVNLSTER YQAPNVLFDP SLIGSEEPGA AEVLVNSIMK
SDIDLRSKLF SQIVLAGGSS LTPGFGDRML YEVRSRAPSH LRIRISAPPE RLHSAYVGGS
ILASLSTFKS MWVSRSEYEE HGSNILHRRE L