Gene Avin_00650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_00650 
SymbolthrB 
ID7759032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp68251 
End bp69201 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content69% 
IMG OID643802991 
Producthomoserine kinase 
Protein accessionYP_002797307 
Protein GI226942234 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR00938] homoserine kinase, Neisseria type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.362782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGTCT TCACCCCTCT CGAACGCCAC GAGCTGGAAA CCTTCGTCGC GCCCTACGGG 
CTCGGCCGGC TGCTGGATTT CCAGGGCATT GCCGAAGGCA GCGAGAACAG CAACTTCTTC
GTCACCCTGG AGCAGGGCGA GTACGTCCTG ACCCTGGTCG AACGCGGGCA GATGGACGAT
CTGCCGTTCT TCATCGAACT GCTCGACGTG CTGCACGCGG CCGACCTGCC GGTGCCCTAC
GCCCTGCGCA CCGGCGACGG CCTGGCCCTG CGCAAGCTGG CCGGCAAGCC GGCGCTGTTG
CAGCCGCGCC TGTCCGGCAG GCACGTGCAG GAGCCCAATG CGCAGCACTG TCGCGAAGTC
GGCACGCTGC TCGCCCGCCT GCATCTGGCG ACCTGCAAGA GCCCGCTGCC GCGACCCAGC
GACCGCGGCC TGGAATGGAT GCTGGAGAAG GGCCCGGAAC TCGCCCTGCA ACTGCCGGAC
GAGCAACTGC CGCTGCTGCG CGAGGCGCTG GCCGAGGTCG CCCGCCTGAA GAGCCGCCTG
CTCGCCCTGC CTACCGCCAA CCTGCACGCC GACCTGTTCC GCGACAATGC GCTGTTCGAC
GGCCCGCACC TGACCGGAGT GATCGACTTC TACAACGCCT TCGCCGGCCC CATGCTCTAC
GACCTGGCGA TCGCCGTGAA CGACTGGTGC TCGAAGGAAG ACGGCAGCCT CGATCCGCAA
CTCACCCAGG CCATGCTCGC CGCCTACTCC GCCAAGCGCC CGTTCCGCGC CGCCGAGGCC
GAACTCTGGC CGGCCATGCT GCGCATCGCC TGCCTGCGCT TCTGGCTGTC CCGGCTGATC
GCCGCCCAGG CCTTCGCCGG CCAGCAGGTG CTGATCCACG ATCCCGAAGT GTTCCGCCGC
CGCCTGCAGG CGCGCCGCCA CGTCCAGGTC GCCCTGCCCT TCGCCCTGTA A
 
Protein sequence
MSVFTPLERH ELETFVAPYG LGRLLDFQGI AEGSENSNFF VTLEQGEYVL TLVERGQMDD 
LPFFIELLDV LHAADLPVPY ALRTGDGLAL RKLAGKPALL QPRLSGRHVQ EPNAQHCREV
GTLLARLHLA TCKSPLPRPS DRGLEWMLEK GPELALQLPD EQLPLLREAL AEVARLKSRL
LALPTANLHA DLFRDNALFD GPHLTGVIDF YNAFAGPMLY DLAIAVNDWC SKEDGSLDPQ
LTQAMLAAYS AKRPFRAAEA ELWPAMLRIA CLRFWLSRLI AAQAFAGQQV LIHDPEVFRR
RLQARRHVQV ALPFAL