Gene Hoch_4065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4065 
Symbol 
ID8546466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5584667 
End bp5586256 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content69% 
IMG OID646388742 
ProductPyridoxal-dependent decarboxylase 
Protein accessionYP_003268457 
Protein GI262197248 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0076] Glutamate decarboxylase and related PLP-dependent proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.125159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.183613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTCT CGAATTGCGA GTTGCCGGCT CTGTTCTCTG ACGAAGAACT GGATTTCGAC 
GAGGATTTCA CGCGCGAGGC GCCCGCGAGC ATCGCGCCCG AGCGCTTCCG CGACATGGGC
CATCGTCTCG TCGATCGCAT TTCTGGACAC ATGTCCGGTA TGCGCGAGCG GCCGGTCGCC
AAGCAGCACG GTCTCGAGAC CGTGCGCGAG GCGCTGGGTT CGATGCCGCT GCCCGAGAGC
GGGATGGCGG CCGAGGAGGT GCTCGAGCGC GCCACCTCGC TGCTCATGGA GCACTCGGTG
ATGACCGCGC ACCCGCGCTT CTGGGCCTAC GTGCACGGCG CGCCCAGCCC GCTCGGCGCG
CTCGCCGATT TCCTGGCCTC GGCCATCAAC TCGCCGGCGA CCAGCTTCCA GACCGGCCCC
ATGGCCTCGG CCATCGAGAA GCAGGCGGTC GAGTGGATGG CCGAGCTGGT CGGTTTCCCG
CAGGATTGCG GCGGCATCTT CCTGTCCGGC GGCTCGATGG CGACCATCAC CGCGGTCACC
ACCGCGCTGC GCCGCAAGGC CGGCTGGGAC GTGCGCGGCG AGGGCGTCAC CGGCATGCGC
GGCCGCCTGC TGCGGCTCTA CGCCACCGCC CAGACCCATA GCTGCGTGCG CACGGCGGCC
GATATGTGCG GCCTCGGCGA GAGCGCCATC CACCGCGTGC CCACCGACGC CATGGGCCGC
ATGGATCCCA ACGCGCTGGC CGCCTGCATC GAGCTCGACC GCCGCTCGGG TCTGCGTCCC
TTCCTGGTGG TCGGCACCGC GGGCACGACC TCGACCGGCA GCATCGACCC GCTGGCCGAG
ATCGCGGCGG TGTGCCGCGA GAGCGACCTG TGGTTTCACG TCGATGGCGC CTACGGTGCC
GTCGCCGTGG TCTCGCCCGA CGCGCCCGAG GCGCTCAAGG GCCTGCGCGA GGCCGACAGC
CTGGCGCTCG ATCCCCACAA GTGGATGTAC ATGCCGCTCG AGGCCGGCTG CCTGCTGATG
CGCGATCGCC ACGCGCTCTA CGACACCTTC TGCTACCGCG CCGACTACTA CTCGCACAAC
CAGGCGGCGC CCGATGACGC GCTGCCCTTC CGCGACCAGA GCGCGCAGAC CTCGCGCGGC
CTGCGCGCGC TCAAGGTGTG GCTGGCGCTG CAGAGCATCG GCCGCGACGG CTACCGGCAG
ATGATCAGCG ACGACATGGC CCTGGCCAAG CGCCTGTATC GCAAGGTCGC GGCGCATCCC
GCGCTCGAGG CGCTGTCGCA CAGCCTGAGC ATCACCACCC TGCGCTACGC GCCGCCGGAG
CTGGCCGCCA ACGTGAGCCC GGCGTATCTC GACCTGCTCA ACGAGCGCAT CCTCAAGCGC
CTGCAGAGCA CCGGCATGGC CTATCCCTCG CACACCTATG TCGATGGCAA GTACGTGCTG
CGCGTGTGCA TCGTCAATCA CAACACCCAG GTCTGCGACG TCGATGCGCT GCCGCAGATG
GTCGCGACCC TGGGCGACGC GCTGCACCAA GAGGCGCTCA CCGAGCTGGC CGCGCGCATG
GCTGACGAGA GCAGCGGCAT GCTTCTGTGA
 
Protein sequence
MIFSNCELPA LFSDEELDFD EDFTREAPAS IAPERFRDMG HRLVDRISGH MSGMRERPVA 
KQHGLETVRE ALGSMPLPES GMAAEEVLER ATSLLMEHSV MTAHPRFWAY VHGAPSPLGA
LADFLASAIN SPATSFQTGP MASAIEKQAV EWMAELVGFP QDCGGIFLSG GSMATITAVT
TALRRKAGWD VRGEGVTGMR GRLLRLYATA QTHSCVRTAA DMCGLGESAI HRVPTDAMGR
MDPNALAACI ELDRRSGLRP FLVVGTAGTT STGSIDPLAE IAAVCRESDL WFHVDGAYGA
VAVVSPDAPE ALKGLREADS LALDPHKWMY MPLEAGCLLM RDRHALYDTF CYRADYYSHN
QAAPDDALPF RDQSAQTSRG LRALKVWLAL QSIGRDGYRQ MISDDMALAK RLYRKVAAHP
ALEALSHSLS ITTLRYAPPE LAANVSPAYL DLLNERILKR LQSTGMAYPS HTYVDGKYVL
RVCIVNHNTQ VCDVDALPQM VATLGDALHQ EALTELAARM ADESSGMLL