Gene PHATRDRAFT_49801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49801 
SymbolhHrd1 
ID7198372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp363488 
End bp365386 
Gene Length1899 bp 
Protein Length632 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184604 
Protein GI219128825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCA TACCGATGGA CGACCATGGC AACGACGACG ACCACGACGA GCAACGACTC 
CAGCGTGAGC GCGAACGCGC CGTGGAGCAA ATGCTCTGGG CACAGGAGCA AGCGGAACAA
CAAGACCAGA GACAATCGGC ACACGATCCG AATCCAACAC GTTTACCACC ACCCGACGAT
CCTGCCGTAC CCTTCCCTCC GCTCCCCGAC GAACAAGAAC CGCCCCTGCC TCCGCCATTG
TCCCACCAAA ATAAGTCCTG GTCCTACACG CAGTGGAGTT TCGCCGCCGC CGGAGCGACA
CTCTGGTACG CACTACGCAC CCGGGACGAA CAGTGGTATC TGGCCGTCGT CTATCTCCAT
TCCTCGCGGT GGGCGTGTGC CGTCCTCGGC AACGCGCTAC TCGCCGCGGC CGTCGCCACT
TTCCAACTCA CCGTCCGTCT GTTCCTACCC AACGGAGGCT TGCGCGTACA CGAAGCGGAA
GGTTTGCAAG ATTTCTTTCG TTGGAACGTG ACGGAAACGT GTCTCGCCTT GACCATGTTC
CGCTCCGAAC TGACCGTGCA GACGGCGGTG GAATTTGTGG TCCTCATTCT CTGCAAGTGT
CTACACCACG TGGCGAATAT GCGGGAACAG CACGTCCGTA TGACGCAGGA TGCCGTGGTC
CGGTGGCGTC CGGAACGGAT CGCACCACAA GCCTCCTGGC CACCACTCCC CGCCGTGCCG
ACAGCGCACT GGAGGATCCT GGTCTTTTTG GGAATCCTCC AACTTGGTGA TCTCTACGCA
CTCCAGTACT TTGGTCGGGA CATTGCCGAG AGAGGACCCT CCGTCAATAT ACTCTTCGCC
TTTGAAGCCG CCATTCTCCT GGTCTCGGCA TGGAGTCACC TGTTGCTCTG GCATATATAC
GTAGGGGACG GATTGCTCCA TTTTGGACAC GACCACTATC CGCGCAGTTT CGTGGCGCGA
TGGCTCCATA CCTGGAAAGA ATACAAGGCC ACCTTGACGT TTGCGGTCGA GTTGCAGGCA
CAAACCGTAC AGTTCCTCTT CTATTTGACC TTTTTCGCCA TTGTCATGAC GTACTACGGC
GTACCAATTA ATCTGTTCCG GGAAGTATAC GTTAGTTTTG CCGCACTCAA GGACCGGCTC
TGGGCGTTTC TGCGCTACCG CCAGCTCATG GCCAGCATGG ACCGCTTCGA CAGCGTCACG
GACGAGGAAC TCGAACAAGC CGGTCGGGAT TGCATTATTT GTCGAGACGA AATGAAAACG
CACGACTGCA AAGCCCTGCC CGTATGCCGC CACCTATTCC ACAAATCCTG TCTCCGCGAA
TGGCTCGTCC AACAACAAAC CTGTCCCACC TGTCGGAGTG ATATTGGTGC CAACGAGGTG
ACGCAAGAAC GACGCCGTGC GGCACAAGCC GCAGCGCAAG AACGACAGTC CGCCGACGAA
TCAACACCGT CGCCCGCCAC CACGTCACCA GATATTCTGT CACCCGCGGA TGCGAGCGGT
TCCGTGGAGC CAACGTCCGG GGCTGAATCC CCACCCACTC TCACCGAAGA AGACGGGCAC
GATTTCGAAA CCATGCTCCG ACACTATCAA ACCACGCTAC AAGCTCGGAT TCGACAACGA
TCGCGGCCGG CTCTTGTCCT GCCCGGACTG TACCAGGTCA CGCGATCAAG TGGTGCGTCC
GTTTACACCG GTGCACACGA CGATGCCACC CACCAAACGC CGACCGTGGT GAGAACCGTT
CCCCGTGGCG TCGTCGTGCT GGCTCTCGAG GGAGCAACGC TACGGTTCGT TGGTCCCGAA
CCCGTCGAGG CCGTACGTAT TCCAGATGGT TGGATGGCCC TGGCGGATGT AGAATTTCGG
CTCGCCATTG GTAAAGAAGC ACCACGCTCA GCAATTTAA
 
Protein sequence
MTIIPMDDHG NDDDHDEQRL QRERERAVEQ MLWAQEQAEQ QDQRQSAHDP NPTRLPPPDD 
PAVPFPPLPD EQEPPLPPPL SHQNKSWSYT QWSFAAAGAT LWYALRTRDE QWYLAVVYLH
SSRWACAVLG NALLAAAVAT FQLTVRLFLP NGGLRVHEAE GLQDFFRWNV TETCLALTMF
RSELTVQTAV EFVVLILCKC LHHVANMREQ HVRMTQDAVV RWRPERIAPQ ASWPPLPAVP
TAHWRILVFL GILQLGDLYA LQYFGRDIAE RGPSVNILFA FEAAILLVSA WSHLLLWHIY
VGDGLLHFGH DHYPRSFVAR WLHTWKEYKA TLTFAVELQA QTVQFLFYLT FFAIVMTYYG
VPINLFREVY VSFAALKDRL WAFLRYRQLM ASMDRFDSVT DEELEQAGRD CIICRDEMKT
HDCKALPVCR HLFHKSCLRE WLVQQQTCPT CRSDIGANEV TQERRRAAQA AAQERQSADE
STPSPATTSP DILSPADASG SVEPTSGAES PPTLTEEDGH DFETMLRHYQ TTLQARIRQR
SRPALVLPGL YQVTRSSGAS VYTGAHDDAT HQTPTVVRTV PRGVVVLALE GATLRFVGPE
PVEAVRIPDG WMALADVEFR LAIGKEAPRS AI