Gene PHATRDRAFT_17779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_17779 
SymbolNAP 
ID7196847 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1901024 
End bp1902364 
Gene Length1341 bp 
Protein Length318 aa 
Translation table 
GC content45% 
IMG OID 
Productnucleosome assembly protein 
Protein accessionXP_002176865 
Protein GI219110227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.184472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGGAACGTG CCCCCCTGTA CTACATCGCG ACTAATTCGT GGCTAACACG ATGGTAACGA 
ACGCGTTCAA AGGAATTGGT AAATCCCTCT TTGTCCTTTA CAGTTAGTCA CCACACAGAG
ATTTATATGA GCTTTTAAAA TGACGGACAT TAACAACAAC GAGCAAGATC CTCTGCAAAA
TATCGGTGTT GGGGACGAAG ACTCAGACGC AGAGGACGAT GATACCGCTG AAGACAACCC
GATGGCCGAT CTCCCCGATT ACGTTGCACA TCGCGTGGAA AAACTGCGTG GTCTGAATGA
GAAGCGAGAA GAGATCATGA AAGATTACCT CACGGAGCGA GCCGACTTGG AACGCAAATA
CGCGGTTATT TTAAACCCGC TCTATGAAGA ACGTGCGACG ATTGTGAACG GTGAGAAGGA
TGATGAAATA AGCGCCGAAG TCACTCGCCG CGGAGATAGC TCTAGCGCAC ACCATAATGA
TGCGGAACCG TACGTAAAAG GCATTCCACA ATTTTGGCTG AGTACTATGA GCCAAGAAGA
GACGATCAGT GAGAGTCTGA CAGAAGAAGA CGTTGACTGT TTGGAACACC TCGAAAACAT
CACATGTGAA GACTTTGCTG ATGGAAAAGG ATTCGTTCTT CGTTTTCATT TTGCTCCTAA
CGACTATTTT CATGATGCTG TATTGGTGAA GACATACGAT GTTCCGAATC TTTTGCTTTC
CGATGAACCC ATCTTGAAGA ACGTCCACGG ATGCAAGATT CAGTGGAAAG AAGGGAAATC
TCTGACACAT CGCCAGATCA AGAAGAAACA GCGCGGAAAG GGCAAGAATG CTGGTCAGGT
ACGCACCATC TCCAAAATGG AGAAGAAGGA ATCGTTCTTC CATTGGTTCG AGCCGCCAGC
AATGCCAAAG ATGGATGAAG TTGACGAGGA ACAGGCGGAC GAGCTAGAAG AATTTTTCGA
TTCAGATTAC GAAATAGCGC AGGCGTTTCG GTCACATGTT ATTCCTTCAG CCGTTCTTTG
GTTCACCGGA GAAGTAAGTT CTACGAACAC GAACTTTTCG TCCAGATTGC TGCACCTCTA
ATATTTTTTT TCCCTCTAGA TTATGGCTCA GGAAATGATA CACGCAATCG AAGATCTTAG
AGAATCAGAG GAAACAGATT GATGAGGATT GAATGTGTAA TTGGAATTGA CGGATCGGCA
AGATTACAAT GATTAATCGA CAAAGACTGC ATAGTTTTTG ATCGAAGGAA GAAAAATATA
TATGCAACGC TTGCTACTCA TATGTTAGAT ACTGGTATCT CACTGTGAAT TCGATAGATT
CCTGTCTTGG TGGGCCCAGT C
 
Protein sequence
MTDINNNEQD PLQNIGVGDE DSDAEDDDTA EDNPMADLPD YVAHRVEKLR GLNEKREEIM 
KDYLTERADL ERKYAVILNP LYEERATIVN GEKDDEISAE VTRRGDSSSA HHNDAEPYVK
GIPQFWLSTM SQEETISESL TEEDVDCLEH LENITCEDFA DGKGFVLRFH FAPNDYFHDA
VLVKTYDVPN LLLSDEPILK NVHGCKIQWK EGKSLTHRQI KKKQRGKGKN AGQVRTISKM
EKKESFFHWF EPPAMPKMDE VDEEQADELE EFFDSDYEIA QAFRSHVIPS AVLWFTGEIM
AQEMIHAIED LRESEETD