Gene Avi_9607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvi_9607 
SymbolhisC 
ID7381947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAgrobacterium vitis S4 
KingdomBacteria 
Replicon accessionNC_011991 
Strand
Start bp75240 
End bp76346 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content51% 
IMG OID643653282 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002551453 
Protein GI222109188 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATT CATGCCAACT CGTTTTTCCA TCTCACATTG AAAGGTTGCC GCGTTATAGG 
CCAGCAGCAG ATTTAGCTGT AGTCTCCGAG ACCTCAGCTG AACCTTTGGT CAACCTGGCA
TCAAACGAAA ATCCTTATGG TACGAACCCG GGGTTTGCTG ATGCCTTAAA TGAAATCACG
CGTTTTAACC TCGCGGAGTA CCCTGACCCC GATGCGCTCC GTCTGAAGAC CGCTATCGCG
GCGAAAAACC ATGTTTCGAT CGATCAGCTA ATTATCGCGA ACGGCTCCGA TGAACTAATT
GACTTGTCAG CTCGGACGCT GCTGGCGCCA GGAACGAACG CGATCTTCGA TGAGTATTCC
TTTGTAGCCT ATCGTAAGGC GACTTACCTC GCAGGAGCTA CCGGTGTCAG TGTCCGACCT
TCAGGCTGGA ATGCAGATCT CAATGAGATG CTTCGTGTCA TTGACACCAA TACTCGGATG
ATATTTTTAG CAAATCCGAG CAATCCGACT CCAGGTTTTA TTTCAACGGC AGAATTTGAC
AGCTTCATCA GCAGGGTTCC TGCTACCGTC CTTGTAGTGC TCGATGAAGC CTATATCGAT
TTTGTTGAGC CGAACGAGCG GATCGATTGT AAGCTGCTGC TCCAATCAAG AAGCAATGTC
TTTATAACGC GCACCTTCTC CAAGGCATAC GGACTTGCAG GTGTGCGGGT CGGCTATGGC
ATCGGTTCGC CGACACTTAT TAACATGATG AACAGGATCA GGCAGCCCTT CTCCGTTGGC
GTGTTGCCAC AACTGGCCGC GGTAAACGCG CTGGCCAATG AAGGCTTCGT GAACGAAACT
AGAGCAAAAA ACATCGAGCA GAAGGCCAGA CTATCAGAAG GATTGAGCGA CCTTGGAATT
GAGCACGCGG CATCCAAAGG AAATTTCATC ATTGTAAAGT TGCGTGCTCC ATCAGCAGCG
CACGAGGCGC TGCAGGCGAA GCGAATCCTT GTGCGCCGCT TGGCTTCCTA TGGCCTGAGC
GACTGGCTGC GTTTGACAAT TGGCACAGAA TCTCAAAACC GGATTGTACT TGACGCATTT
CGAACCCTGA CACAACAAGC AAACTGA
 
Protein sequence
MSNSCQLVFP SHIERLPRYR PAADLAVVSE TSAEPLVNLA SNENPYGTNP GFADALNEIT 
RFNLAEYPDP DALRLKTAIA AKNHVSIDQL IIANGSDELI DLSARTLLAP GTNAIFDEYS
FVAYRKATYL AGATGVSVRP SGWNADLNEM LRVIDTNTRM IFLANPSNPT PGFISTAEFD
SFISRVPATV LVVLDEAYID FVEPNERIDC KLLLQSRSNV FITRTFSKAY GLAGVRVGYG
IGSPTLINMM NRIRQPFSVG VLPQLAAVNA LANEGFVNET RAKNIEQKAR LSEGLSDLGI
EHAASKGNFI IVKLRAPSAA HEALQAKRIL VRRLASYGLS DWLRLTIGTE SQNRIVLDAF
RTLTQQAN