Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_23220 |
Symbol | prpC |
ID | 7761238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2317033 |
End bp | 2318196 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643805204 |
Product | methylcitrate synthase |
Protein accession | YP_002799485 |
Protein GI | 226944412 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01800] 2-methylcitrate synthase/citrate synthase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCTA CCGCAGAAAC CACCACTGTT GGCTTCAAGC CGAAGAAGTC AGTGGCCCTG AGTGGCACCG CCGCCGGCAA TACCGCTCTG TGCACCGTGG GTCGCACCGG CAATGACCTG CACTATCGCG GCTATGACAT TCTCGACTTC GCCACCACCT GCGAATTCGA GGAGGTCGCC CATCTGCTGG TGCACGGCAA GCTGCCCAAT GTCGCCGAAC TGAACGGCTA CAAGGCCAAG CTCAAGTCCC TGCGCGGCCT GCCGGCCGGC GTGAAGGCCG CCCTCGAGCA GTTGCCGCCG TCCTCTCACC CGATGGACGT GATGCGCACC GGCGTGTCCG TGCTCGGCTG CCTGTCGCCC GAGAAGGAAG ACCACAACTA CCCCGGCGCC CGCGACATCG CCGACAAGCT GATGGCGTCC CTGGGCTCCA TGCTGCTGTA CTGGTACCAC TTCAGCCACA ACGGCAAGCG CATCGACGTG GAAACCGACG ACGACTCCAT CGGCGGCCAC TTCCTGCACC TGCTGCACGG CAAGAAGCCG AGCGACAGTT GGGTGCGCGC CATGCACACC TCGCTGATCC TCTACGCCGA GCACGAGTTC AACGCCTCCA CTTTCACCTC CCGGGTGATC TCCGGCACCG GCTCCGACAT GTTCTCCTGC ATCACCGGCG CCATCGGCGC GCTGCGCGGG CCGAAGCACG GCGGCGCCAA CGAGGTGGCC TTCGAGATCC AGAAACGTTA CGACACCCCG GACGAGGCCG AGGCCGATAT CCGTGCCCGC GTCGAGCGCA AGGAAGTCGT GATCGGCTTC GGCCACCCGG TCTACACCGT TGGCGATCCG CGCAACAAGG TGATCAAGGA AGTCGCCCGC GAGCTCAGCG TCGAACAGGG CAACACTAAG ATGTACGACA TCGCCGAGCG CCTCGAAAGC GTGATGTGGG AAATCAAGAA GATGTTCCCC AACCTGGATT GGTTCAGCGC CGTCAGCTAC CACATGATGG GCATTCCCAC CGCCATGTTC ACCCCGGTGT TCGTGATCGC CCGCACCTCC GGCTGGGCCG CCCACGCCAT CGAGCAGCGC ATCGACGGCA AGATCATCCG GCCGAGCGCC AACTACGTCG GTCCGGAAAA CCTGAAGTTC GTTCCGCTCA GGGAGCGCAA GTAA
|
Protein sequence | MSATAETTTV GFKPKKSVAL SGTAAGNTAL CTVGRTGNDL HYRGYDILDF ATTCEFEEVA HLLVHGKLPN VAELNGYKAK LKSLRGLPAG VKAALEQLPP SSHPMDVMRT GVSVLGCLSP EKEDHNYPGA RDIADKLMAS LGSMLLYWYH FSHNGKRIDV ETDDDSIGGH FLHLLHGKKP SDSWVRAMHT SLILYAEHEF NASTFTSRVI SGTGSDMFSC ITGAIGALRG PKHGGANEVA FEIQKRYDTP DEAEADIRAR VERKEVVIGF GHPVYTVGDP RNKVIKEVAR ELSVEQGNTK MYDIAERLES VMWEIKKMFP NLDWFSAVSY HMMGIPTAMF TPVFVIARTS GWAAHAIEQR IDGKIIRPSA NYVGPENLKF VPLRERK
|
| |