Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09920 |
Symbol | eutB |
ID | 7759937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 942629 |
End bp | 944023 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643803897 |
Product | Ethanolamine ammonia lyase, large subunit |
Protein accession | YP_002798199 |
Protein GI | 226943126 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4303] Ethanolamine ammonia-lyase, large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.430518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGTT TTGCCCATGC CGTCGGCGGC CAGACCTGGC GCTTCGACAG TCTTCGCGAA CTGCTGGCCA AGGCCAGTCC GGCGCGCTCC GGCGATTATC TGGCCGGAGT GGCGGCACAG AGCGACGCCG AGCGGGTGGC GGCGCAGATG GCGCTGGCCG AGGTGCCGCT CGGGCACTTC CTCAACGAGG CGGTGATCCC CTACGAGGCG GACGAGGTCA CCCGGCTGAT CCTCGACAGC CACGACGCCG TCGCCTTCGC CCCGGTCGCC CATCTCACCG TCGGCGGCCT GCGCGACTGG CTGCTCGGCC CCGAAGCCGA CGAGACGACG CTGGCCGCCC TGGCGCCGGG GCTGACCCCG GAGATGGCCG CCGCGGTATC CAAGCTGATG CGCGTCCAGG ACCTGGTGCT GGTGGCGCAG AAGATCCGCG TGCAGACACG CTTTCGCAAC AGCCTCGGGC TGCGCGGACG GCTCTCCACC CGCCTGCAGC CCAACCATCC CACCGACGAT CCGGCGGGCA TCGCCGCCAG CGTCCTCGAC GGGCTGCTCT ACGGCAACGG CGATGCGGTG ATCGGCGTCA ACCCGGCCAG CGACAGCCTG GAGGCGATCG GCGAACTCTT GAAGATGCTC GACGCGGTGA TCCAGCGCTA CCAGATCCCC ACCCAGGCCT GCGTGCTCAC CCATGTCACC TCGTCGATCG CCGCCATCGA GCGCGGCCTG CCGGTCGACC TGGTGTTCCA GTCCATCGCC GGCACCCAGG CGGCCAACGC CAGCTTCGGC ATCGACCTGG AACTGCTGCG CGAGGGCTAC GAAGCCGGCC TCGGCCTGAA GCGCGGCAGC CTCGGCGACA ACCTCATGTA CTTCGAGACC GGCCAGGGCA GCGCGCTGTC GGCCAACGCC CACCACGGCG TCGACCAGCA GACCTGCGAG GCGCGCGCCT ACGGCGTGGC GCGGCGCTTC CGGCCACTTT TGGTGAACAC CGTGGTCGGC TTCATCGGCC CGGAATACCT CTACAACGGC AAGCAGATCG TCCGCGCCGG TCTCGAGGAC CACTTCTGCG GCAAGCTGCT CGGCCTGCCG ATGGGCTGCG ACATCTGCTA CACCAACCAT GCCGAGGCCG ACCAGGACGA CATGGACATG CTGCTGACCC TGCTCGGCGT GGCCGGGGTC AACTTCATCA TGGGCGTGCC GGGTTCCGAC GACGTGATGC TCAACTACCA GAGCACCTCC TTCCACGACG CCCTCTACGC CCGGCAGACC CTCGGCCTGC GCGCCGCGCC GGAGTTCGAG GAATGGCTCG CGCGGATGGG CATCCTGCGC CAGGACGGCG GCCGGCTGGT CCTCGGCGAC GAGTTGCCGG CGGCCTTCCG CCCGGCGCTG GCGCGCTTGT CCTGA
|
Protein sequence | MSGFAHAVGG QTWRFDSLRE LLAKASPARS GDYLAGVAAQ SDAERVAAQM ALAEVPLGHF LNEAVIPYEA DEVTRLILDS HDAVAFAPVA HLTVGGLRDW LLGPEADETT LAALAPGLTP EMAAAVSKLM RVQDLVLVAQ KIRVQTRFRN SLGLRGRLST RLQPNHPTDD PAGIAASVLD GLLYGNGDAV IGVNPASDSL EAIGELLKML DAVIQRYQIP TQACVLTHVT SSIAAIERGL PVDLVFQSIA GTQAANASFG IDLELLREGY EAGLGLKRGS LGDNLMYFET GQGSALSANA HHGVDQQTCE ARAYGVARRF RPLLVNTVVG FIGPEYLYNG KQIVRAGLED HFCGKLLGLP MGCDICYTNH AEADQDDMDM LLTLLGVAGV NFIMGVPGSD DVMLNYQSTS FHDALYARQT LGLRAAPEFE EWLARMGILR QDGGRLVLGD ELPAAFRPAL ARLS
|
| |