Gene Hoch_5512 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5512 
Symbol 
ID8547925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7559080 
End bp7560468 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content71% 
IMG OID646390185 
Productargininosuccinate lyase 
Protein accessionYP_003269888 
Protein GI262198679 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.269359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACG CAAAAGGCGC ACGGGGCCGC CGCTTCGCCG GCTCGCTCGC ACCGGACGCC 
ACCGAGGTCA ACGCCTCGGT GGGCTTCGAC TGGCGGCTCT TGCCGCACGA CGTGGCCGGC
TCGCTCGCCC ACGCCCGCAT GTTGGCCAAG CAGGGCATCA TCTCCGCCGA CGACCTGGCG
CGCATCGAGG CCGGCATCCA GCGCGCGGCC GAGCGCCTGC AGAGCGGCGA GGTGCCCTGG
GATCCCGCGC TCGAGGACGT GCACATGAAC GTCGAGGCGC GGCTCATCGA GGAGGTCGGC
GAGCCCGGCC GCCGCCTGCA CACGGCGCGC AGCCGCAACG ATCAAGTGGC CACGGATCTG
CGCCTGTACG CGCGCGCGCA GGCCGCGGTG CTGATCGAGC GCATCGACGA GCTGCGCCGC
GCCCTGGCCG ACAAGGCCGC GTGCTACCTC GACGTGTTCA TGCCCGGCTA CACCCACCTG
CAGCGCGCCC AGCCGGTGCG CCTGGCCCAT CACCTGCTGG CCTACGACGC CATGCTGCGC
CGCGATCGCG GGCGCATCGA GGACGCGTCC GCGCGCGCCG GTGAGTGTCC GCTCGGCGCC
GGCGCCCTGG CCGCGACCAC CTTCCCCATC GACCGCGAGG CCACGGCCCG CGAGCTGGGC
TTCACCGGGG TCACGCGCAA CAGCCTCGAC GCGGTCGGCG ATCGCGACTT CGCGGTCGAG
CTGGTGGCCG CGATCGCGCT GTGCCAGGTG CACCTGTCGC GCCTGGGCGA GGAGATCGTG
CTGTGGCTGT CGCAGGAGTT TGGCTTCGCG CGCCTCGACG AGTCGTACTG CTCGGGCTCG
AGCATCATGC CGCAGAAGAT GAACCCCGAC CTGGCCGAGC TCATCCGCGG GAAAACCGGA
CGCGTGGTCG GTCACTGGGT GTCGCTGGTG ACCGTGCTCA AGGGATTGCC GCTGGCTTAT
AACAAGGATC TCCAGGAGAG CCAGGAGCCG CTCTTCGACG CGGTCGAGAC CCTCGACGCC
AGCCTGCGCG TGGCCCGCGG CATGATCGAC AACCTGGTGT TCGACGAGCA GCGCCTGGCG
CGCGCCGTCA CCCAGGGCTT TTTGCTGGCC ACCGAGGTCG CCGACTACCT GGTCACCAAG
GGCATGTCCT TCCGCGAGGG CCACCACATC GCCGGCGCGC TGGTGCGCAC GGCGCTCGAG
CGCCAGGTCG GTCTCGAGGC GCTGCCGCTC GAGGTCTTTC GCGGCGAGAG CGAGCTGTTC
GAGGACGATA TCTTCTCGTG GCTGGAAGTC GGACGCGCGG TCGATCGTCG CGATGTCGTC
GGCGGCCCGG CGCGCTCGCA GATCGAAGCC GAGCTGGTGC GCATCCGCGC CGAGCTGGAG
ACGCGATGA
 
Protein sequence
MTDAKGARGR RFAGSLAPDA TEVNASVGFD WRLLPHDVAG SLAHARMLAK QGIISADDLA 
RIEAGIQRAA ERLQSGEVPW DPALEDVHMN VEARLIEEVG EPGRRLHTAR SRNDQVATDL
RLYARAQAAV LIERIDELRR ALADKAACYL DVFMPGYTHL QRAQPVRLAH HLLAYDAMLR
RDRGRIEDAS ARAGECPLGA GALAATTFPI DREATARELG FTGVTRNSLD AVGDRDFAVE
LVAAIALCQV HLSRLGEEIV LWLSQEFGFA RLDESYCSGS SIMPQKMNPD LAELIRGKTG
RVVGHWVSLV TVLKGLPLAY NKDLQESQEP LFDAVETLDA SLRVARGMID NLVFDEQRLA
RAVTQGFLLA TEVADYLVTK GMSFREGHHI AGALVRTALE RQVGLEALPL EVFRGESELF
EDDIFSWLEV GRAVDRRDVV GGPARSQIEA ELVRIRAELE TR