Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50100 |
Symbol | iolD |
ID | 7763861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5076837 |
End bp | 5078774 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643807841 |
Product | myo-inositol catabolism protein IolD |
Protein accession | YP_002802075 |
Protein GI | 226947002 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.443282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGA TACGACTGAC CATGGCCCAG GCCCTGGTGA AATTCCTCGA CAACCAATAC GTCAGCGTCG ACGGTGTCGA GAGCAAGTTC GTCAAAGGCA TCTTCACCAT TTTCGGCCAC GGCAACGTGC TGGGCCTGGG GCAGGCGCTG GAGCAGGATC CGGGCGAGCT GATCGTCCAC CAGGGCCGCA ACGAGCAGGG CATGGTCCAC GCCGCCATCG GCTTCGCCAA GCAGAAGATG CGCCGGCAGA TCTACGCCTG CACCTCCTCG GTCGGCCCCG GCGCGGCGAA CATGATCACC GCCGCCGCCA CCGCCACCGC CAACCGCATT CCCGTGCTGC TGCTGCCGGG CGACGTCTAC GCCACCCGCC AGCCCGACCC GGTGCTGCAG CAGATCGAGC AGAGCCACGA CCTGTCGATC AGCACCAACG ACGCCTTCCG GGCGGTCAGC AAGTACTGGG ACCGCGTCAG CCGCCCCGAA CAACTGATGA GCGCGGCGAT CAACGCCATG CGCGTGCTCA CCGATCCGGC CGAGACCGGC GCGGTGACCC TGGCGCTGCC GCAGGACGTG CAAGGCGAGG CCTACGACTA CCCGGACTAC TTCTTCGCCA AACGGGTGCA CCGCATCGAC CGCCGCCCGG CCACCGCGGC CATGCTGGCC GACGCCGTGG CGCTGCTCAA GGGCAAGCGC AAGCCGCTGC TGATCTGCGG CGGCGGGGTG AAATACTCCG GCGCCGCCGA GGCGTTGCAG CGTTTCGCCG AGCGTTTCGA GATTCCCTTC GCCGAGACCC AGGCCGGCAA GAGCGCCATC GTCTCCGCCC ACCCGCTGAA CGTCGGCGGC ATCGGCGAGA CCGGCTGCCT GGCGGCTAAC CTGCTGGCCA GGGAGGCCGA CCTGGTGATC GGCGTCGGTA CCCGCTACAC CGACTTCACC ACCGCCTCCA AGTGGATCTT CCAGAACCCC GAAGTGGCCT TCCTCAACCT CAACGTCAGC GCCTTCGACG CCTACAAGCT CGACGCCGTG CAGGTGGTGG CCGACGCCCG GGCCGGCCTG GAGGCGCTCG GCGAAGCCCT CGGCCACGGC GGCTACCGTG CCCAGTGGGG CGAGGCGACG GCGCAGGCCA AGGCCAGGCT GAAGGCGGAA GTCGACCGCG TCTACGCCGT GGAATACAGC GGCGAAGGCT TCGTCCCGGA GATCGACGAC CACCTGCCGC GCAGCGTGCT GGAAGAGTTC ATCGAACTGA CCGGCTCCAG CCTGACCCAG AGCCAGGTGC TCGGCGTGCT CAACCGGACC CTGGCCGACG ACGCCATCAT CGTCGGCGCC TCCGGCAGCC TGCCGGGCGA CCTGCAGCGG ATCTGGCGTT GCAAGGGGAC CGACACCTAC CACATGGAGT ACGGCTACTC CTGCATGGGC TACGAGGTGA ACGCCGCCCT CGGGGTGAAA ATGGCCGAGC CCGAGCGCGA GGTCTACACC CTGGTCGGCG ACGGCTCCTA CATGATGCTG CACTCGGAGC TGCCCACCTC CATCCAGGAG CGCCGCAAGA TCAACATCGT CCTCTTGGAC AACATGACCT TCGGCTGCAT CAACAACCTG CAGATGGAAC ACGGCATGGA CAGCTTCGGC ACCGAGTTCC GCTACCGCAA CCCGGAGACC GGCAAGCTCG ACGGCGGCTT CGTGCCGGTC GACTTCGCCA TGAGCGCCGC GGCCTACGGC TGCAAGACCT ACCGGGTGAA GACCCTCGAC GAGCTGCATG CGGCGCTGGA AGACGCCCGC CGGCAGAGCG TTTCCACCCT CATCGACATC AAGGTGCTGC CCAAGACCAT GATCCACAAG TACCTGTCCT GGTGGCGGGT CGGTGGCGCC CGGGTCTCGA AGAGCGAGCG CATCGCGGCG GTCGCGCGGA TGCTCGAGGA CAACATCGCC AAGGCCCGGC AGTACTGA
|
Protein sequence | MSTIRLTMAQ ALVKFLDNQY VSVDGVESKF VKGIFTIFGH GNVLGLGQAL EQDPGELIVH QGRNEQGMVH AAIGFAKQKM RRQIYACTSS VGPGAANMIT AAATATANRI PVLLLPGDVY ATRQPDPVLQ QIEQSHDLSI STNDAFRAVS KYWDRVSRPE QLMSAAINAM RVLTDPAETG AVTLALPQDV QGEAYDYPDY FFAKRVHRID RRPATAAMLA DAVALLKGKR KPLLICGGGV KYSGAAEALQ RFAERFEIPF AETQAGKSAI VSAHPLNVGG IGETGCLAAN LLAREADLVI GVGTRYTDFT TASKWIFQNP EVAFLNLNVS AFDAYKLDAV QVVADARAGL EALGEALGHG GYRAQWGEAT AQAKARLKAE VDRVYAVEYS GEGFVPEIDD HLPRSVLEEF IELTGSSLTQ SQVLGVLNRT LADDAIIVGA SGSLPGDLQR IWRCKGTDTY HMEYGYSCMG YEVNAALGVK MAEPEREVYT LVGDGSYMML HSELPTSIQE RRKINIVLLD NMTFGCINNL QMEHGMDSFG TEFRYRNPET GKLDGGFVPV DFAMSAAAYG CKTYRVKTLD ELHAALEDAR RQSVSTLIDI KVLPKTMIHK YLSWWRVGGA RVSKSERIAA VARMLEDNIA KARQY
|
| |