Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20809 |
Symbol | AroB |
ID | 7201792 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 335277 |
End bp | 336725 |
Gene Length | 1449 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | 3-dehydroquinate synthase |
Protein accession | XP_002180805 |
Protein GI | 219120119 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGTTTCTT CTAGCCGTCG GGTCTGATCT TGTAAATATT TGGAAAACAA TCATGACGAG AAGAGACGTG TTGTTCCAAA TTCTCTGGTG TTTAGGATCT GCTACCGTTT CCGACGCATT CATCCCGGGA ACGGTAGGAC GATTTCCGAT TGTTCCCCCG GTGGCGAGCA CCGCCATCCG TGGCAACGAT CTGGGCAGCA ACACCGTCCA CAAGGAGAAT TTTGACATTG TTAGAGTGGA TTTGGACGAT GGACGAGACT ATCCCATTTA CATTGGCACG GGGTATTCGG AAGAACAAGG TGCGTCGGAG TGTCGAACGT GTTGGGGGAG GGACATCATC CTCCTGTTTT CTCACTGTCG TATTCCCCTT TTGGTGCGTT CCAATACTGT TATGTTGCAG CAACGGAAAT TCTGCAATCT CATATCAAGG GAAACAAAGT TCTGATTGTG ACGAATGATC GTATCGCTCC CATGTATTTG GAAAAGTACG CAAATCTACT CAAGGCCGGC GACAAGCTGT CGGTAGAGAC GTTGGTGGTG CCCGACGGAG AAGAGCACAA AAATATGGAA ATCATGCAGA CCATTCTCGA CAAATGTTTG GAGACCAGTT TGGATCGGAA AGCGACACTA GTGGCCCTGG GTGGCGGGGT AATTGGAGAC ATGGTTGGCT TTGCGGCCGC CATTTACCAG CGGGGAGTTA ACTTTGTCCA GGTACCGACC ACCGTCATGG CCATGGTGGA TTCGTCCGTC GGGGGAAAAA CGGGAGTAAA TCATCCGGGT GGTAAAAATA TGATTGGAGC CTTTCATCAG CCACAGTGCG TCTTTGTGGA TACGGAGACG TTATCGACCT TGCCCGACCG GGAATTGCAG AGTGGGATTT CCGAGATTGT CAAATACGGA TTGATTCGGG ATAAGGAATT CTTTGAGTGG CAGGAGGATC ACATGGAACG AATGATGGCC CGAGACCCAG AAGCTCTGCG TTTCGCCATT ACGCGATCTT GCCAAAATAA GGCCGCGGTG GTCAAGGCGG ACGAGAAGGA AGCCGGCCTT CGCGCCACTC TCAACCTGGG GCATACGTTT GGACACGCTA TTGAAAATCA CTCCGGGTAC GGTACGTGGT TGCATGGTGA AGCCGTTGCA ATTGGTACGG CCATGGCAGC GACAATGAGT GCACGAATGG GATGGATCGA GCCCCAGCTA GTACAGCGAA TATACAAACT TTTGGAACGT GCCAAACTGC CGGTCGAGCT TCCACCGGAT TCTCCAATGG ACCGCGATTC GTTCTTGAAA CTGATGAGTG TGGATAAAAA GGTCGAAAAC GGTAACTTGA GATTAATCCT CCTGAAAGGA GCGCTGGGAA ATTGTGTGTT TACCGGAGAT TTCGACGAGC AAGCCCTTTT GGACACAATT GACGAATTTG TAGCGGAATG CTCCGGAGTG AAAAAATAA
|
Protein sequence | MTRRDVLFQI LWCLGSATVS DAFIPGTVGR FPIVPPVAST AIRGNDLGSN TVHKENFDIV RVDLDDGRDY PIYIGTGYSE EQATEILQSH IKGNKVLIVT NDRIAPMYLE KYANLLKAGD KLSVETLVVP DGEEHKNMEI MQTILDKCLE TSLDRKATLV ALGGGVIGDM VGFAAAIYQR GVNFVQVPTT VMAMVDSSVG GKTGVNHPGG KNMIGAFHQP QCVFVDTETL STLPDRELQS GISEIVKYGL IRDKEFFEWQ EDHMERMMAR DPEALRFAIT RSCQNKAAVV KADEKEAGLR ATLNLGHTFG HAIENHSGYG TWLHGEAVAI GTAMAATMSA RMGWIEPQLV QRIYKLLERA KLPVELPPDS PMDRDSFLKL MSVDKKVENG NLRLILLKGA LGNCVFTGDF DEQALLDTID EFVAECSGVK K
|
| |