Gene PHATRDRAFT_54381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54381 
SymbolH1 
ID7200154 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011674 
Strand
Start bp762091 
End bp763262 
Gene Length1172 bp 
Protein Length229 aa 
Translation table 
GC content51% 
IMG OID 
Producthistone linker H1 
Protein accessionXP_002179285 
Protein GI219116981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCG GGAAGCTGTT GTTACGTTGT CTGACTGTGA GTATGACAAG CCCGGTCTGC 
GCAGTTCCAG AAACAGTGTC AATTATGCTG TCGTCTGATT TCTGTCCTAC GACGACGCCG
CCCCGGCGTG CGTAGTGCTC ACATTCAAAA GGAATCCATT TTGTGGAACC AGTACTGTAC
TTTTCTGTAT GTAGCATTTA CGAAGCGTTT TGTACTAGAC TGAAATCCGG CGCACCCTGT
GTTTCGAACT GGGCCCGGAC GCTGGACCCC GACAGAGCGC ACACATCTCT CTCTCAACAG
GTACGAGACG CTTTCCTCAC CATGACGGCG ATTCCGCACC GATTCATCGG CGTTCCCTTT
CTCTCTGTTA GTGATCTAGG TTATCCTTCC TCTCCGCATG TCTTTCTAAC GATCGTCTTT
CTATTATCTG CCAATTCATA GCCGAGAATC GTATAATCGT AGTTTCCATC ATGTCGTACA
AAGCCGGTAT CGAAGAAGCC ATTACGGACC TTAAGGACCG CACGGGTTCG AGTATGATTG
CGATTCGCAA GTACATGCAA TCCAAACTTC CCGCCGATAA GAAGTGGCAG AACGCTGTCT
TTCTGTCCAG TCTCAAGAGC GGAGTCGCCG CTGGTGACTT TGTTCAGGTC AAAAACTCGT
ACAAGATCTC GGCCGACTAC AAGAAAAAGA AGGCTGCCGC GGTCAAGAAA GCTGCTGCTC
CCAAGAAGGT CGCCCCGAAG AAGAAGGCGC CTACCGCGAA AAAGAGTACG GCCGCGAAGA
AGAAGACCAC GGCACCCAAA AAGACCACCG CGCCGAAGAA AAAGGCTCCG ACCGCCAAGA
AGGCCACTAC GGCACCCAAG AAGACTGCCA CAAAGAAGGC CACCGCGCCG AAGAAAAAGG
CAGCCACCGC CAAGAAGCCG GCGGCGCCCA AGGCGACAAA GCCGAAGGCT GCTCCCAAAA
AGAAGGCAGC CTCTAAGAAG GACGCTGCTC CCAAGCCAGC TGAAACCAAA TAAATATTTG
GCTTGACTGA GAGCTTGCAC TCTAGCTGTG CCTGACGCAT CCAGTGTCAG TATAGCAACC
AAAACCGGAT TTTCTAAAAT CCACCCAACC CACAAAGGCA GTCAATTGTA TCCTCTTATT
GTACAATCAA GCAGTGAACT TCTTTTTGCG TA
 
Protein sequence
MTVGKLLLRC LTTEIRRTLC FELGPDAGPR QSAHISLSTA ENRIIVVSIM SYKAGIEEAI 
TDLKDRTGSS MIAIRKYMQS KLPADKKWQN AVFLSSLKSG VAAGDFVQVK NSYKISADYK
KKKAAAVKKA AAPKKVAPKK KAPTAKKSTA AKKKTTAPKK TTAPKKKAPT AKKATTAPKK
TATKKATAPK KKAATAKKPA APKATKPKAA PKKKAASKKD AAPKPAETK