Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_00650 |
Symbol | thrB |
ID | 7759032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 68251 |
End bp | 69201 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643802991 |
Product | homoserine kinase |
Protein accession | YP_002797307 |
Protein GI | 226942234 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR00938] homoserine kinase, Neisseria type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.362782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGTCT TCACCCCTCT CGAACGCCAC GAGCTGGAAA CCTTCGTCGC GCCCTACGGG CTCGGCCGGC TGCTGGATTT CCAGGGCATT GCCGAAGGCA GCGAGAACAG CAACTTCTTC GTCACCCTGG AGCAGGGCGA GTACGTCCTG ACCCTGGTCG AACGCGGGCA GATGGACGAT CTGCCGTTCT TCATCGAACT GCTCGACGTG CTGCACGCGG CCGACCTGCC GGTGCCCTAC GCCCTGCGCA CCGGCGACGG CCTGGCCCTG CGCAAGCTGG CCGGCAAGCC GGCGCTGTTG CAGCCGCGCC TGTCCGGCAG GCACGTGCAG GAGCCCAATG CGCAGCACTG TCGCGAAGTC GGCACGCTGC TCGCCCGCCT GCATCTGGCG ACCTGCAAGA GCCCGCTGCC GCGACCCAGC GACCGCGGCC TGGAATGGAT GCTGGAGAAG GGCCCGGAAC TCGCCCTGCA ACTGCCGGAC GAGCAACTGC CGCTGCTGCG CGAGGCGCTG GCCGAGGTCG CCCGCCTGAA GAGCCGCCTG CTCGCCCTGC CTACCGCCAA CCTGCACGCC GACCTGTTCC GCGACAATGC GCTGTTCGAC GGCCCGCACC TGACCGGAGT GATCGACTTC TACAACGCCT TCGCCGGCCC CATGCTCTAC GACCTGGCGA TCGCCGTGAA CGACTGGTGC TCGAAGGAAG ACGGCAGCCT CGATCCGCAA CTCACCCAGG CCATGCTCGC CGCCTACTCC GCCAAGCGCC CGTTCCGCGC CGCCGAGGCC GAACTCTGGC CGGCCATGCT GCGCATCGCC TGCCTGCGCT TCTGGCTGTC CCGGCTGATC GCCGCCCAGG CCTTCGCCGG CCAGCAGGTG CTGATCCACG ATCCCGAAGT GTTCCGCCGC CGCCTGCAGG CGCGCCGCCA CGTCCAGGTC GCCCTGCCCT TCGCCCTGTA A
|
Protein sequence | MSVFTPLERH ELETFVAPYG LGRLLDFQGI AEGSENSNFF VTLEQGEYVL TLVERGQMDD LPFFIELLDV LHAADLPVPY ALRTGDGLAL RKLAGKPALL QPRLSGRHVQ EPNAQHCREV GTLLARLHLA TCKSPLPRPS DRGLEWMLEK GPELALQLPD EQLPLLREAL AEVARLKSRL LALPTANLHA DLFRDNALFD GPHLTGVIDF YNAFAGPMLY DLAIAVNDWC SKEDGSLDPQ LTQAMLAAYS AKRPFRAAEA ELWPAMLRIA CLRFWLSRLI AAQAFAGQQV LIHDPEVFRR RLQARRHVQV ALPFAL
|
| |