Gene Hoch_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0020 
Symbol 
ID8542390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp23070 
End bp24242 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content69% 
IMG OID646384808 
ProductSaccharopine dehydrogenase (NAD(+), L-glutamate- forming) 
Protein accessionYP_003264555 
Protein GI262193346 
COG category[S] Function unknown 
COG ID[COG3268] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.947878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGACA AACGCGACTT CGACGTGGTG GTTTTCGGCG CCACCGGCTT CACCGGACGG 
CTCGTGGCCG AGTATTTGAC CCGCAAGGCC ATGCCCGAGC TGCGCTGGGC CATCGCCGGC
CGCAGCCGCG ACAAACTCGA GCGCGTGCGC GCCGAGCTGG CCAAGATCGA CCCCGGCGCC
GCCGACATCG GCGTGCTCGA GGCCGACGCC CGCGACTGGG CCTCGCTGGC GGTGATGGCC
AACAAGACCC GCGTGGTGCT GACCACCGTC GGGCCCTACA TCGACGACGG CATCCAGCTC
GTGCGCGCCT GCGTGGCCAG CGGCACCGAC TACGTCGACA TCACCGGCGA GCCCCTGTTC
GTGAATGAGG TCGTGTCCAA GTACGACGCG CCCGCGCGCG AGCAGGGCGT GCGCATCGTC
AACTGCTGCG GCTTCGACAG CATCCCGCAC GACCTCGGCG TGATGTACAC GATCGACCAG
CTCGAGGCCA AAGGCCCGGT CGAGATCGAG GGCTTCGTGC GCGTGCGCGG CAACTTCTCG
AGCGGCACCA TCCGCTCGGC CATCAAGTCG ATGGCGCAGA TGAACAAGCT CAAGGGCGAC
GCCTCGGTGC GGCCGCAGCC GAGCACCGAG GGCCGCCGCG TGCGCAAGCT GCGCGCGCGC
TTGCACCACG ATCCGCGCAT GCAGTCGTGG ACCATGCCGA TGATGACCAT CGACTCGTGG
ATCGTGCGCC GCAGCGCGGC CATGCTCGAC AGCTACGGAT CCGACTTCGC GTACGCGCCC
TACATCTGCC AGACCAAGCT GAGCAGCGTC GGCAAGCTCA CGCTCGGTGT CGGCGCCGTG
ATGCTGCTGT CGCAGTTCCG GCCCACGCGC GAGATGCTGC TGGCGCGCTT TCCCTCGGGC
AAAGGCCCCA GCGAGGAGGA CATCGCCCAC GGTCGCTTCG AGCTCACCTT CTTCGCCCGC
AGCGGCGACA GCGAGCTGAT CACGCGCGTC TCGGGCGGCG ACCCGGGCTA CGGCGAGACC
AGCAAGATGG TCGCCGAGTC GGCCCTGTGC CTGGCCTTCG ACCGCGACCG CCTGCCCGAA
CGCACCGGCG TGCTCACGAC CGCGACCGCC ATGGGGCAGC CGCTGCTCGA GCGCCTGCAA
GCGGCCGGCA TCGACTTCGA AGTCGTCGGC TAA
 
Protein sequence
MADKRDFDVV VFGATGFTGR LVAEYLTRKA MPELRWAIAG RSRDKLERVR AELAKIDPGA 
ADIGVLEADA RDWASLAVMA NKTRVVLTTV GPYIDDGIQL VRACVASGTD YVDITGEPLF
VNEVVSKYDA PAREQGVRIV NCCGFDSIPH DLGVMYTIDQ LEAKGPVEIE GFVRVRGNFS
SGTIRSAIKS MAQMNKLKGD ASVRPQPSTE GRRVRKLRAR LHHDPRMQSW TMPMMTIDSW
IVRRSAAMLD SYGSDFAYAP YICQTKLSSV GKLTLGVGAV MLLSQFRPTR EMLLARFPSG
KGPSEEDIAH GRFELTFFAR SGDSELITRV SGGDPGYGET SKMVAESALC LAFDRDRLPE
RTGVLTTATA MGQPLLERLQ AAGIDFEVVG