Gene Rpic12D_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic12D_3021 
Symbol 
ID8020704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12D 
KingdomBacteria 
Replicon accessionNC_012856 
Strand
Start bp3190549 
End bp3191616 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content65% 
IMG OID644831818 
Producthistidinol-phosphate aminotransferase 
Protein accessionYP_002982962 
Protein GI241664602 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGAT ATTGGAGCGA TATCGTTCAG CAACTTGTGC CCTATGTCCC GGGCGAACAG 
CCGGCCATCG CGCGGCCCAT CAAGCTGAAC ACCAACGAGA ACCCGTACCC GCCGTCGCCG
CGCGTGGTGG CCGCGATCGC TGCAGAGCTG GGAGAGACGG GCGACAGCCT GCGCCGTTAT
CCCGATCCGC TCTCGCGCCG GCTGCGCGAG ACGGTCGCCG CGCAGGTCGG CCTCAAGCCG
GAACAGGTTT TTGCCGGCAA CGGGTCGGAC GAAGTGCTGG CGCACGTCTT TCAGGCGCTG
CTCAAGCACG ACAAGCCGCT GCGCTTTCCC GACATCTCGT ACAGCTTCTA TCCGACCTAC
GCGCGGCTGT ACGGCGTGCA GCACGAAGTG GTGCCGCTGG CAGACGATTT CTCCATCCGC
GCCGATGACT ATCTTGGCGA CGCAGGCGGC GTACTGTTCC CGAACCCGAA CGCTCCGACC
GGCCACGCGC TGCCGATGGC AGAGGTCGCG CGGATCGTCG CCGCGAATCC GTCGTCGGTG
GTCGTGGTGG ATGAGGCCTA TGTGGATTTC GGCGCCGAGT CCGCCATTGC GCTCATCGAC
CGCTATCCCA ACCTGCTGGT TGTGCACACG ACGTCGAAGT CGCGCTCGCT GGCCGGCATG
CGGGTCGGCT TCGCGTTCGG GCACGCGGCG CTCATCGAAG CGCTCAACCG CGTGAAAGAT
AGCTTCAATT CGTATCCGCT GGACCGCCTT GCGCAGGCGG CGGCCCAAGC GGCTTACGAA
GACGATGCGT ACTTCCGCAC CACGTGCGCG CGCGTGGTGG CAAGCCGCGT GCGACTGACG
CAAGCGCTGC AGGCGCTGGG GTTCGAAGTC GTGCCGTCGA TGGCGAACTT TGTCTTCGCC
CGGCATCCGG TCCACGATGC GGCCACGCTC GCGGCGCGCC TGAAAGAGCA AGCCATCTTC
GTGCGCCACT TCAAGCTCGC ACGCATCGAC CAGCATCTGC GCATCACCGT CGGCACCGAC
GATGAATGCG ACGCATTCTT AAATGCGTTG CGAGGGCTGC TGAAATAA
 
Protein sequence
MSRYWSDIVQ QLVPYVPGEQ PAIARPIKLN TNENPYPPSP RVVAAIAAEL GETGDSLRRY 
PDPLSRRLRE TVAAQVGLKP EQVFAGNGSD EVLAHVFQAL LKHDKPLRFP DISYSFYPTY
ARLYGVQHEV VPLADDFSIR ADDYLGDAGG VLFPNPNAPT GHALPMAEVA RIVAANPSSV
VVVDEAYVDF GAESAIALID RYPNLLVVHT TSKSRSLAGM RVGFAFGHAA LIEALNRVKD
SFNSYPLDRL AQAAAQAAYE DDAYFRTTCA RVVASRVRLT QALQALGFEV VPSMANFVFA
RHPVHDAATL AARLKEQAIF VRHFKLARID QHLRITVGTD DECDAFLNAL RGLLK