Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50040 |
Symbol | iolC |
ID | 7763855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5068941 |
End bp | 5070884 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643807835 |
Product | myo-inositol catabolism protein IolC |
Protein accession | YP_002802069 |
Protein GI | 226946996 |
COG category | [S] Function unknown |
COG ID | [COG3892] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAA TCACCTTCGC AAGCGGACGT CAGTTGGACG TCATCTGTCT GGGGCGCCTC GGCGTCGACC TGTACGCCCA GCAGATCGGC GCACGGCTCG AGGACGTCGG CAGCTTCGCC AAATACCTGG GCGGCTCGTC CGCCAACATC GCATTCGGTA CCGCCCGCCT GGGTCTCAAG TCAGCCATGC TGACCCGCGT GGGCGACGAC CACATGGGCC GCTTCCTGAT CGAGGCGCTG GAGCGCGAGG GCTGCGACAC CCGGGCGATC AAGGTCGACC CGGAACGCCT GACCGCGATG GTCCTGCTGG GCATCAAGGA CCGCGATACC TTCCCGCTGA TCTTCTACCG CGAGAACTGC GCCGACATGG CGCTGCGCGA GGAGGACATC GACGAAGCCT TCATCGCCTC CAGCAAGGCG CTGTTGATCA CCGGCACCCA CTTCTCCACC GAAAGGGTCT ACAAGGCCAG CAGCAAGGCG CTGGACTACG CCGAAAAGCA CAACGTCAAG CGCGTGCTGG ACATCGATTA CCGGCCGGTG CTCTGGGGCC TGACCGGCAA GGCCGACGGC GAGACCCGCT TCATCGCCAG CGCCGAGGTC AGCGCGCACG TGCAGCGCAT CCTGCCGCGC TTCGACCTGG TGGTCGGCAC CGAGGAGGAA TTCCTCATCG CCGGCGGTTC CGAGGATCTG CTCAGCGCCC TGCGCAAGGT GCGCGAGGTG ACCGCCGCGA CCCTGGTGGT CAAGCTCGGT CCGCTGGGCT GCACGGTCAT CCACGGCGCC ATTCCGGCGC GCCTGGAGGA CGGCAACATC TATAAAGGCA TCCGTGTCGA GGTGATGAAC GTGCTGGGCG CCGGCGACGC CTTCATGTCC GGCTTCCTGC GCGGCTGGCT GACCGGCGGT GACGACGAGC GCTGCAGCCG TCTGGCCAAC GCCTGCGGCG GCCTGGTGGT ATCGCGCCAC GCCTGCGCCC CGGCGATGCC GACCCCGGCC GAACTCGACT ACATCCTCAA CAGCCCGGTA CCCATCACCC GCCCGGACCT CGACCCGCAC CTGAACCGCC TGCACCGGGT CAGCGTACCG CGCAAGAACT GGAAGCCGCT GTTCATCTTC GCCTTCGACC ATCGCGGTCA ACTGGTGGAA CTGGCCCAGC AGGCCGGACG CGACCTCGCG GCGATTCCCG AACTCAAGCA ACTGTTCATC ACTGCCATCG AACGGGTCGA GGCCGATCTC CAGCGCCAGG GCATCGAAGG CGACGTGGGT CTGCTGGCCG ACCAGCGCTT CGGCCAGGAC GCCCTCAACA GCGCCACCGG CCGCGGCTGG TGGATCGGCC GCCCGGTCGA GCTGCAAGGC TCGCGGCCGC TGGCCTTCGA GCATGGCCGC TCGATCGGCA GCAATCTGGT GCAGTGGCCG CGCGAGCACA TCATCAAGTG CCTGGTGCAG TTCCACCCGG ACGACGAGCC CCTGCTGCGC CTGGAACAGG AGGGCCAGCT CAAGGGCCTC TACGAGGCGG CCCAGGCCAG CGGCCACGAA CTGCTGCTCG AGGTGATCCC ACCGAAGAAC CATCCCTCCA CGCATCCGGA CGTGCTCTAC CGGGCGATCA AGCGGCTCTA CAACATCGGC ATCCATCCGG ACTGGTGGAA GATCGAGCCG CAGCCGGCCG AGGTGTACTC GAAACTCGAT GCGCTGATCA CCGAACGCGA TCCCTACTGC CACGGCGTGG TCCTGCTCGG CCTCAATGCG CCGGCCGAGG AACTCGCCGA AGGCTTCCGC CAGGCCGCCG GCAGCCAGGT CTGCCGCGGC TTCGCGGTCG GCCGGACGAT CTTCCAGGAA CCCAGCCGCG CCTGGCTGGC CGGCGAGATC GACGACGAGA CCCTGATCGC GCGGGTGCGG GCCACCTTCG AGTTCCTGAT CAAGTCCTGG CGCGAGGCGC GCGGCCAGGT CTGA
|
Protein sequence | MGEITFASGR QLDVICLGRL GVDLYAQQIG ARLEDVGSFA KYLGGSSANI AFGTARLGLK SAMLTRVGDD HMGRFLIEAL EREGCDTRAI KVDPERLTAM VLLGIKDRDT FPLIFYRENC ADMALREEDI DEAFIASSKA LLITGTHFST ERVYKASSKA LDYAEKHNVK RVLDIDYRPV LWGLTGKADG ETRFIASAEV SAHVQRILPR FDLVVGTEEE FLIAGGSEDL LSALRKVREV TAATLVVKLG PLGCTVIHGA IPARLEDGNI YKGIRVEVMN VLGAGDAFMS GFLRGWLTGG DDERCSRLAN ACGGLVVSRH ACAPAMPTPA ELDYILNSPV PITRPDLDPH LNRLHRVSVP RKNWKPLFIF AFDHRGQLVE LAQQAGRDLA AIPELKQLFI TAIERVEADL QRQGIEGDVG LLADQRFGQD ALNSATGRGW WIGRPVELQG SRPLAFEHGR SIGSNLVQWP REHIIKCLVQ FHPDDEPLLR LEQEGQLKGL YEAAQASGHE LLLEVIPPKN HPSTHPDVLY RAIKRLYNIG IHPDWWKIEP QPAEVYSKLD ALITERDPYC HGVVLLGLNA PAEELAEGFR QAAGSQVCRG FAVGRTIFQE PSRAWLAGEI DDETLIARVR ATFEFLIKSW REARGQV
|
| |