Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_39490 |
Symbol | hom |
ID | 7762838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3998793 |
End bp | 4000112 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643806811 |
Product | homoserine dehydrogenase |
Protein accession | YP_002801063 |
Protein GI | 226945990 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.126248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAGTT TCAGCGTGAA ACCGGTCAAA GTAGGCATCT GTGGACTGGG TACCGTCGGC GGCGGCACCT TCAATGTGCT CAAGCGCAAT GCCGAGGAGA TCGCCCGCCG CGCCGGGCGT GACATCGAGA TCGCCCAGAT CGCCATGCGC AGCTCCAACC CGAAATGCGA TATCGCCGGC ACCCCCATCG CCGCCGATGT CTTCGAACTG GTCGACAATC CGGAGGTCGA GGTCGTCGTC GAACTGATCG GCGGTTGCAG TCTGGCCCGC GAGTTGGTGC TCAAGGCCAT CGACAACGGC AAGCACGTGG TCACCGCGAA CAAGGCGCTG ATCGCCGTGC ATGGCGACGA ACTCTTCGCC AAGGCCCGCG AGAAGGGCGT GATCGTCGCC TTCGAGGCGG CGGTGGCCGG CGGCATTCCA GTGATCAAGG CGATCCGCGA GGGTCTGGCC GGCAACCGCA TCAACTGGGT GGCCGGCATC ATCAACGGTA CCGGCAACTT CATTCTCAGC GAAATGCGCG AGAAGGGCCG GGCCTTCGCC GACGTGCTCA AGGAGGCCCA GGCGCTGGGC TACGCCGAGG CCGATCCGAC CTTCGACGTG GAGGGCATCG ATGCCGCCCA CAAGCTGACC ATCCTCGCCT CCATCGCCTT CGGCATTCCG CTGCAATTCG ACAAGGCCTA CACCGAAGGC ATTTCCAGGC TCACCAGCGC CGACGTCGGC TACGCCGAGG CGCTGGGTTA CCGCATCAAG CACCTGGGCA TCGCACGGCG CACCGCCGAA GGCTTCGAGC TGCGCGTGCA CCCGACGCTG ATCCCGGCCG ACCGCCTGAT CGCCAACGTC AACGGCGTGA TGAACGCGGT GATGGTCAAC GGCGATGCCG TGGGCTCGAC CCTGTTCTGC GGCGCCGGCG CCGGTATGGA GCCGACCGCC TCGGCGGTGG TGGCCGATCT GGTGGACGTG GTCCGCGCCA TGACTTCCGA CCCGGAGAAC CGCGTGCCGC ACCTGGCCTT CCAGCCCGAT GCGCTGTCCG CCCACCCGAT TCTGCCGATT TCGTCCTGCG AGAGCGCCTA CTACCTGCGC ATCCAGGCCA AGGACCATCC GGGCGTGCTG GCCCAGGTGG CGACCATCCT TTCCGAGCGT GGCATCAACA TCGAGTCGAT CATGCAAAAG GAGGCCGAGG AGCACGACGG CCTGGTTCCC ATGATCCTGG TCACCCACCG GGTCCGGGAG CGCTGTATCG ACGAAGCCAT CGCCGCCATG GAGGCGCTCG AGGGCGTGGT CGGCAAGGTC GTCCGCATCC GCGTCGAACA GCTCAACTAA
|
Protein sequence | MGSFSVKPVK VGICGLGTVG GGTFNVLKRN AEEIARRAGR DIEIAQIAMR SSNPKCDIAG TPIAADVFEL VDNPEVEVVV ELIGGCSLAR ELVLKAIDNG KHVVTANKAL IAVHGDELFA KAREKGVIVA FEAAVAGGIP VIKAIREGLA GNRINWVAGI INGTGNFILS EMREKGRAFA DVLKEAQALG YAEADPTFDV EGIDAAHKLT ILASIAFGIP LQFDKAYTEG ISRLTSADVG YAEALGYRIK HLGIARRTAE GFELRVHPTL IPADRLIANV NGVMNAVMVN GDAVGSTLFC GAGAGMEPTA SAVVADLVDV VRAMTSDPEN RVPHLAFQPD ALSAHPILPI SSCESAYYLR IQAKDHPGVL AQVATILSER GINIESIMQK EAEEHDGLVP MILVTHRVRE RCIDEAIAAM EALEGVVGKV VRIRVEQLN
|
| |