Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_08480 |
Symbol | |
ID | 7759802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 804331 |
End bp | 806127 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643803766 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_002798068 |
Protein GI | 226942995 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00807055 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACGA CAGTCGGTGA CTTTCTGGTC GAGCGCCTGT ACCAGTGGGG TGTGCGACGC ATCTACGGCT ATCCTGGCGA CGGCATCAAC GGGGTGTTCG GCGCCCTGAG CCGTGCCCGG GAGAAGATCC GCTTCATCCA GGTCCGCCAC GAGGAAATGG CCGCCTTCAT GGCTTCGGCG GAGGCCAAGT TCGGCGGCGG CCTGGGCGTC TGCATCGCCA CCTCGGGACC CGGCGCCTCG CACCTGATCA CGGGGCTCTA CGATGCCCGC CTGGACCACA TGCCGGTGCT GGCGATCGCC GGCCAGCAGG CGCGCACCGC CATGGGCGGG CACTATCAGC AGGAGGTGGA CCTGGTGTCG ATGCTCAAGG ACGTGGCGGG AGCCTTCGTC CAGCAGGCCA GTGCGCCGGA GCAGGTGCGC CATCTGGTCG ACCGGGCGAT CCGCACCGCG CTGGGCGAGC GCCGGGTGAC CGCCATTGTT TTGCCCAACG ACCTGCAGGA GATGGACTAC AGCGAGCCGC CGCACGCGCA CGGTACCCTG CACTCCGGCA TCGGCTACAG CCGCCCCAGG AAGCTGCCCT ACGACGAGGA CCTGCAGCGC GCCGCCGAAG TGCTCAATGC CGGCAGCAAG GTCGCCATCC TGGTCGGCGC CGGTGCCCTG GGGGCCAGCG ACGAGGTCAT CCAGGTCGCC GAAAGGCTCG GCGCCGGCGT GGCCAAGGCG CTGCTCGGCA AGGCGGCGCT GCCCGACGAA CTGCCCTGGG TCACCGGCTC CATCGGCCTG CTCGGCACCG AGCCGAGCTA CAAGCTGATG AGCGAATGCG ACACGCTGCT GATGATCGGC TCCGGCTTTC CCTATTCCGA ATTCCTGCCC AGGGAGGGCC AGGCGCGTGG CGTGCAGATC GACCTCAAGG CCGACATGCT CGGCCTGCGC TACCCGATGG AGGTCAACCT GGAGGGCGAC GCCGCCGAAA CCCTGCGCGC CCTGCTGCCG CTGCTGCTGG AAAAGGAGGA TCGCCGCTGG CGCGCCGACA TCGAGGAGTG GCGCGGCGAC TGGGAGAAGA AGCTCGAGCG CCGCGCCCTG GCCTCGGCCA AGCCGATCAA CCCGCAGCGG GTGGTCTTCG AACTGTCGCC GCGCCTGCCC GAGCGGGCCG TCGTCACCTG CGACTCCGGC TCCTGCGCCA ACTGGTTCGC CCGCGACCTG AAAATCCGCC GTGGCATGAT GTGCTCGCTG TCCGGCGGTC TGGCCTCCAT GGGCGCCGCC GTGCCCTATG CCATCGCCGC CAAGTTCGTC CATCCGGAGC GCGCGGTGGT GGCCCTGGTC GGCGACGGCG CCATGCAGAT GAACAACATG GCCGAACTGA TCACCGTCGC CAAGTATTGG CGGGAATGGC GGGACCCGCG CTGGATCTGC TGCGTATTCA ACAACGAGGA CCTCAACCAG GTCAGCTGGG AGCAGCGGGT CATGGCGGGC GACCCGAAAT TCGAGGCTTC GCAGGACATT CCCGACGTGC CCTATCACCG TTTCGCCGAA TCCATCGGCC TCAGGGGCAT TTACGTCGAC CGCGAGGATC TCGTCGCCGC CGCCTGGGAG GAAGCCCTGG CCGCCGACCG GCCGGTGCTG ATCGAGTTCA AGACCGACCC CGACGTGCCG CCGCTGCCGC CGCACATCAC GCTCGAGCAA GCCGGGAAAT TCGCCGGCAC CTTGCTGTGG GGCGACCCGA ACCAGGCCGG CATCATCGCC CAGACTGCCA AGCAGGTGCT CGGCAGCGTC CTGCCCGGCC GTCACCGCGA TGGCTGA
|
Protein sequence | MATTVGDFLV ERLYQWGVRR IYGYPGDGIN GVFGALSRAR EKIRFIQVRH EEMAAFMASA EAKFGGGLGV CIATSGPGAS HLITGLYDAR LDHMPVLAIA GQQARTAMGG HYQQEVDLVS MLKDVAGAFV QQASAPEQVR HLVDRAIRTA LGERRVTAIV LPNDLQEMDY SEPPHAHGTL HSGIGYSRPR KLPYDEDLQR AAEVLNAGSK VAILVGAGAL GASDEVIQVA ERLGAGVAKA LLGKAALPDE LPWVTGSIGL LGTEPSYKLM SECDTLLMIG SGFPYSEFLP REGQARGVQI DLKADMLGLR YPMEVNLEGD AAETLRALLP LLLEKEDRRW RADIEEWRGD WEKKLERRAL ASAKPINPQR VVFELSPRLP ERAVVTCDSG SCANWFARDL KIRRGMMCSL SGGLASMGAA VPYAIAAKFV HPERAVVALV GDGAMQMNNM AELITVAKYW REWRDPRWIC CVFNNEDLNQ VSWEQRVMAG DPKFEASQDI PDVPYHRFAE SIGLRGIYVD REDLVAAAWE EALAADRPVL IEFKTDPDVP PLPPHITLEQ AGKFAGTLLW GDPNQAGIIA QTAKQVLGSV LPGRHRDG
|
| |