Gene Avin_12940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_12940 
SymbolhisC 
ID7760236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1257673 
End bp1258728 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content61% 
IMG OID643804196 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002798495 
Protein GI226943422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAAAT TCTGGAGTCC CTTCGTCAAG AACCTGGTGC CCTATGTGCC AGGCGAACAG 
CCGAAGCTGA GCAAGCTGGT CAAGCTCAAT ACCAACGAAA ATCCCTACGG GCCGTCGCCG
CGGGCGATCG CCGCCATGCA GGCCGAGCTG AACGATGGCC TGCGTCTGTA TCCCGATCCC
AACGGCGAGC GCCTGAAGCA GGCGATCGCC GACTACTATG GAGTGCGGAG CAGTCAGGTG
TTCGTCGGCA ATGGCTCCGA CGAGGTGCTG GCGCACATCT TCCATGGGCT GTTCCAGCAT
GGCCGGCCGC TATTGTTCCC GGATGTGACC TACAGTTTCT ATCCGGTGTA CTGTGGTCTC
TACGCAATTC CCTTCGAGAC CGTTGCGCTG GACGAGGAGT TTCGCATTCG CGTCGAAGAC
TTTGTGCGTC CCAACGGAGG CATCATCTTC CCCAATCCCA ATGCTCCGAC AGGCTGCCTG
CTGCCGCTCG AGTCCATCGA GCGGCTGCTC GAGAGCAATC CGGACTCGGT GGTGGTGGTG
GATGAGGCCT ATGTGGATTT CGGTGGCGAG ACGGCGATCG GTCTCGTTGA CCGGCATGCC
AATCTCCTGG TGACCCAGAC CCTGTCCAAG TCGCGTTCCC TGGCCGGGCT GAGGGTCGGT
CTGGCGGTCG GTCACCCGGA ACTGATCGAG GCGCTGGAGC GGATCAAGAA CAGCTTCAAT
TCCTATCCGC TGGATCGTAT GGCCATAGCT GGAGCGGCGG CGGCCTTCGA GGATCGTGCC
TACTTCGAGC AGACCTGCCG GCAGGTGATC GACAGCCGCG AACGCCTGGT CGGTGAGTTG
CAGCGTCTGG GCTTCGAAGT ATTGCCATCG GCAGCCAACT TCGTCTTCGC TCGCCACCCT
GTTCACGATG CCGAACGATT GGCGGCAGGG CTGCGCGAGC AGGGGGTGAT AGTTCGCCAT
TTCAAGCAGG AACGGATTCG TCAGTTCTTG CGCATCACCG TCGGAGCCCC GGAACAGAAC
CGGGCGCTGA CCGATGTGCT GGCGGCCCTT TGCTGA
 
Protein sequence
MSKFWSPFVK NLVPYVPGEQ PKLSKLVKLN TNENPYGPSP RAIAAMQAEL NDGLRLYPDP 
NGERLKQAIA DYYGVRSSQV FVGNGSDEVL AHIFHGLFQH GRPLLFPDVT YSFYPVYCGL
YAIPFETVAL DEEFRIRVED FVRPNGGIIF PNPNAPTGCL LPLESIERLL ESNPDSVVVV
DEAYVDFGGE TAIGLVDRHA NLLVTQTLSK SRSLAGLRVG LAVGHPELIE ALERIKNSFN
SYPLDRMAIA GAAAAFEDRA YFEQTCRQVI DSRERLVGEL QRLGFEVLPS AANFVFARHP
VHDAERLAAG LREQGVIVRH FKQERIRQFL RITVGAPEQN RALTDVLAAL C