Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09881 |
Symbol | ilvD |
ID | 5730491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 877268 |
End bp | 878938 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641285355 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001550873 |
Protein GI | 159903529 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.457791 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.235349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCGAT CCAGTGCAAT CACCCAAGGA ATTCAGAGAT CACCAAATAG GGCTATGCTC CGAGCGGTTG GATTTAATGA TAATGATTTT AACAAGCCAA TAATTGGTAT TGCTAATGGG TACAGCACAA TTACACCATG CAATGTTGGA TTAAATAACC TAGCCCTACA CGCAGAAGCC TCAACAAAAG TATCAGGCGC TATGCCACAA ATGTTTGGCA CAATAACTGT TAGCGATGGA ATATCTATGG GGACCGAAGG GATGAAATAT TCACTAGTTT CAAGAGAGGT TATAGCCGAT TCAATAGAGA CAGCATGTAA TGCTCAAAGC ATGGACGGTG TTCTTGCAAT AGGAGGATGT GATAAAAATA TGCCTGGAGC CATGATTGCC ATGGCAAGAA TGAATATACC CGCAATTTTC GTGTATGGAG GGACAATTAA ACCAGGGAAA TTAGACGGTT GTGATTTAAC AGTAGTAAGT GCATTTGAAG CCGTTGGGCA ATTAACAAGT GGAAAGATTA CTGAAGATAA ACTTATTGCT GTAGAAAAGA ATTGTATTCC TGGAGCAGGA AGTTGTGGAG GAATGTTCAC AGCTAATACC ATGTCCGCAG CAATAGAAAC GCTCGGATTA AGCTTGCCTT ATAGCTCAAC AATGGCTGCT GAAGATAAGG AAAAAATAGA GAGTGCTGAG CGTAGTGCCA AAGTCCTGGT AGATGCAATT GAAAAAAATA TACGTCCTTT AGACCTCCTA ACCAAGAAAT CATTTGAAAA TGCTATTGCC GTAGTCATGG CTGTTGGCGG GTCGACAAAT GCCGTATTAC ATTTATTAGC TATTGCTAGG TCATCAGGAG TGGACTTATG CATAGATGAT TTTGAAAGAA TTCGACAGAA GGTACCTGTG ATTTGTGATT TAAAGCCAAG TGGCAAATAT GTCACTGTAG ATCTACATAA AGCTGGTGGA ATACCTCAGG TAATGAAGCT GCTTTTAGAT GCGGGCTTAT TACATGGTGA TTGTTTAACT ATTGAAGGTA TAACTATCGA TGAGTCCTTA AAAAATATCC CTTCAGAGCC ACCAGCGAAC CAAAATGTGA TATCTCCAAT CACAAAGCCA ATATATAAAA AAGGACATCT AGCAATCTTA AAAGGAAATC TCGCAACCGA AGGATGCGTA GCCAAAATAA GTGGGATTAG GACCCCAGTG TTGAAAGGTC CAGCAAAAGT ATTTGAAAGT GAAGAAGACT GTCTCGACGC AATCTTAAAA GAACAGATAC AAGAAGGAGA TGTAATAGTT ATTAGAAACG AAGGCCCTGT AGGAGGACCC GGAATGAGAG AAATGTTAGC ACCAACCTCT GCCATAGTTG GACAAGGCCT AGGGGATAAG GTTGCTCTAA TAACAGATGG ACGTTTTAGC GGTGGTACTT ATGGTTTGGT AGTGGGACAT ATAGCACCAG AAGCATCAGT GGGAGGGAAT ATAGCCCTAG TCAAAGAAGG TGACATGATT ACTGTTGACG CGCACAAAAA GTTGATTCAA TTGGAAGTAG ATGAAAAGGA GCTCTCAAAA AGGAGGACTT TATGGGAAAA ACCAGAAGTT AAATATAAGA CTGGAATACT AGGTAAATAT GCGCGTTTAG TTAGCAGTTC AAGCAAGGGT GCTGTAACTG ATCAACCATG A
|
Protein sequence | MLRSSAITQG IQRSPNRAML RAVGFNDNDF NKPIIGIANG YSTITPCNVG LNNLALHAEA STKVSGAMPQ MFGTITVSDG ISMGTEGMKY SLVSREVIAD SIETACNAQS MDGVLAIGGC DKNMPGAMIA MARMNIPAIF VYGGTIKPGK LDGCDLTVVS AFEAVGQLTS GKITEDKLIA VEKNCIPGAG SCGGMFTANT MSAAIETLGL SLPYSSTMAA EDKEKIESAE RSAKVLVDAI EKNIRPLDLL TKKSFENAIA VVMAVGGSTN AVLHLLAIAR SSGVDLCIDD FERIRQKVPV ICDLKPSGKY VTVDLHKAGG IPQVMKLLLD AGLLHGDCLT IEGITIDESL KNIPSEPPAN QNVISPITKP IYKKGHLAIL KGNLATEGCV AKISGIRTPV LKGPAKVFES EEDCLDAILK EQIQEGDVIV IRNEGPVGGP GMREMLAPTS AIVGQGLGDK VALITDGRFS GGTYGLVVGH IAPEASVGGN IALVKEGDMI TVDAHKKLIQ LEVDEKELSK RRTLWEKPEV KYKTGILGKY ARLVSSSSKG AVTDQP
|
| |