Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49801 |
Symbol | hHrd1 |
ID | 7198372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 363488 |
End bp | 365386 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184604 |
Protein GI | 219128825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATCA TACCGATGGA CGACCATGGC AACGACGACG ACCACGACGA GCAACGACTC CAGCGTGAGC GCGAACGCGC CGTGGAGCAA ATGCTCTGGG CACAGGAGCA AGCGGAACAA CAAGACCAGA GACAATCGGC ACACGATCCG AATCCAACAC GTTTACCACC ACCCGACGAT CCTGCCGTAC CCTTCCCTCC GCTCCCCGAC GAACAAGAAC CGCCCCTGCC TCCGCCATTG TCCCACCAAA ATAAGTCCTG GTCCTACACG CAGTGGAGTT TCGCCGCCGC CGGAGCGACA CTCTGGTACG CACTACGCAC CCGGGACGAA CAGTGGTATC TGGCCGTCGT CTATCTCCAT TCCTCGCGGT GGGCGTGTGC CGTCCTCGGC AACGCGCTAC TCGCCGCGGC CGTCGCCACT TTCCAACTCA CCGTCCGTCT GTTCCTACCC AACGGAGGCT TGCGCGTACA CGAAGCGGAA GGTTTGCAAG ATTTCTTTCG TTGGAACGTG ACGGAAACGT GTCTCGCCTT GACCATGTTC CGCTCCGAAC TGACCGTGCA GACGGCGGTG GAATTTGTGG TCCTCATTCT CTGCAAGTGT CTACACCACG TGGCGAATAT GCGGGAACAG CACGTCCGTA TGACGCAGGA TGCCGTGGTC CGGTGGCGTC CGGAACGGAT CGCACCACAA GCCTCCTGGC CACCACTCCC CGCCGTGCCG ACAGCGCACT GGAGGATCCT GGTCTTTTTG GGAATCCTCC AACTTGGTGA TCTCTACGCA CTCCAGTACT TTGGTCGGGA CATTGCCGAG AGAGGACCCT CCGTCAATAT ACTCTTCGCC TTTGAAGCCG CCATTCTCCT GGTCTCGGCA TGGAGTCACC TGTTGCTCTG GCATATATAC GTAGGGGACG GATTGCTCCA TTTTGGACAC GACCACTATC CGCGCAGTTT CGTGGCGCGA TGGCTCCATA CCTGGAAAGA ATACAAGGCC ACCTTGACGT TTGCGGTCGA GTTGCAGGCA CAAACCGTAC AGTTCCTCTT CTATTTGACC TTTTTCGCCA TTGTCATGAC GTACTACGGC GTACCAATTA ATCTGTTCCG GGAAGTATAC GTTAGTTTTG CCGCACTCAA GGACCGGCTC TGGGCGTTTC TGCGCTACCG CCAGCTCATG GCCAGCATGG ACCGCTTCGA CAGCGTCACG GACGAGGAAC TCGAACAAGC CGGTCGGGAT TGCATTATTT GTCGAGACGA AATGAAAACG CACGACTGCA AAGCCCTGCC CGTATGCCGC CACCTATTCC ACAAATCCTG TCTCCGCGAA TGGCTCGTCC AACAACAAAC CTGTCCCACC TGTCGGAGTG ATATTGGTGC CAACGAGGTG ACGCAAGAAC GACGCCGTGC GGCACAAGCC GCAGCGCAAG AACGACAGTC CGCCGACGAA TCAACACCGT CGCCCGCCAC CACGTCACCA GATATTCTGT CACCCGCGGA TGCGAGCGGT TCCGTGGAGC CAACGTCCGG GGCTGAATCC CCACCCACTC TCACCGAAGA AGACGGGCAC GATTTCGAAA CCATGCTCCG ACACTATCAA ACCACGCTAC AAGCTCGGAT TCGACAACGA TCGCGGCCGG CTCTTGTCCT GCCCGGACTG TACCAGGTCA CGCGATCAAG TGGTGCGTCC GTTTACACCG GTGCACACGA CGATGCCACC CACCAAACGC CGACCGTGGT GAGAACCGTT CCCCGTGGCG TCGTCGTGCT GGCTCTCGAG GGAGCAACGC TACGGTTCGT TGGTCCCGAA CCCGTCGAGG CCGTACGTAT TCCAGATGGT TGGATGGCCC TGGCGGATGT AGAATTTCGG CTCGCCATTG GTAAAGAAGC ACCACGCTCA GCAATTTAA
|
Protein sequence | MTIIPMDDHG NDDDHDEQRL QRERERAVEQ MLWAQEQAEQ QDQRQSAHDP NPTRLPPPDD PAVPFPPLPD EQEPPLPPPL SHQNKSWSYT QWSFAAAGAT LWYALRTRDE QWYLAVVYLH SSRWACAVLG NALLAAAVAT FQLTVRLFLP NGGLRVHEAE GLQDFFRWNV TETCLALTMF RSELTVQTAV EFVVLILCKC LHHVANMREQ HVRMTQDAVV RWRPERIAPQ ASWPPLPAVP TAHWRILVFL GILQLGDLYA LQYFGRDIAE RGPSVNILFA FEAAILLVSA WSHLLLWHIY VGDGLLHFGH DHYPRSFVAR WLHTWKEYKA TLTFAVELQA QTVQFLFYLT FFAIVMTYYG VPINLFREVY VSFAALKDRL WAFLRYRQLM ASMDRFDSVT DEELEQAGRD CIICRDEMKT HDCKALPVCR HLFHKSCLRE WLVQQQTCPT CRSDIGANEV TQERRRAAQA AAQERQSADE STPSPATTSP DILSPADASG SVEPTSGAES PPTLTEEDGH DFETMLRHYQ TTLQARIRQR SRPALVLPGL YQVTRSSGAS VYTGAHDDAT HQTPTVVRTV PRGVVVLALE GATLRFVGPE PVEAVRIPDG WMALADVEFR LAIGKEAPRS AI
|
| |