Gene Hoch_5271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5271 
Symbol 
ID8547683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7249017 
End bp7250123 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content73% 
IMG OID646389945 
Productpeptidase M20 
Protein accessionYP_003269649 
Protein GI262198440 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01900] succinyl-diaminopimelate desuccinylase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGCGT CTGCCGTCCC CGAGCAGGCG CTGCGCGACA CCCTGCTGGC GCTCACGGCC 
ATCGCCAGCC CCATCGGCGA GGAGCAGGCG CTGTGCGACG CGGTCGAGCG CCGGCTGCGC
GCCAGCCTGG GCGAGGCCGC GGTGACGCGT CATGAGCACA GCCTGGTGGT GCACGCGGCG
CCGCGTCCCG GGCTGCCGCG CATCGGCCTC ATCGGCCACC TCGACACCGT GCGCACCGAA
CACGACGGAC CCGCGCGCAT CGACGGCGAG CGGCTCTACG GCGCCGGTGC GGCCGATATG
AAATCCGGCC TGGCGGTGAT GATCGAGCTC AGCGAGCGGC TCGAGCGCGC GTCCTTGCCC
TGCGACCTCA CCCTGGTGTT CTACGAGCGC GAGGAGGGGC CCTTCGAGGA GAACATGCTG
GGCCCGCTGC TCGAGCGCTT CGACGCCCTG CGCCAGCTCG ACCTGGCCAT CTGCCTCGAG
CCCAGCGACA ACAAGCTGCA GCTCGGCTGC ATGGGCTCGG TGCACGCCAC CGTGCGCTTT
CTCGGACGCA CGTCCCACAG CGCCCGGCCG TGGCAGGGCG AGAACGCCAT CACCGGCGCG
GCCGACTTCC TGGCCCTGCT GCGCGACCGC GCGCCCAACG ATGTCGTGCT CGACGGCCAC
CACTTCCGCG AGGTGGTGTC GCCGACCATG GCCAGCGGCG GCCGCGGTCG CAACATCATC
CCCGACAGCT TCGAGATCAA CGTCAACTAC CGCTTCGCCC CCGGGCGCAC GCCCGAGCAG
GTGGTCGACG AGCTGCGCGC GCTGGTGCGC GAGTGCGCGG GCGATCGCGC CGAGCTCGTG
CCCACCGATC TCAGCCCGGC CGGGCGTCCG CACGCCAGCC ATCCGCTGGT CCTGCACCTG
CGCGACTGCG GCGTGAGCGC GCTCGAGACC AAGCAGGCGT GGACCGATGT GGCCCGCTTC
GACGCCGCCG GCGTGCCGGC CGTCAACTTC GGGCCCGGCA CCCAGGCCCA GGCCCATCAG
CGCAACGAAT ACACCGAGCT GCCGCCGCTC TACGCCGGCT ACGCCATCCT CGAGCGCTTC
CTCAGCAGCG TGCCGGCGCC CGCCTGA
 
Protein sequence
MSASAVPEQA LRDTLLALTA IASPIGEEQA LCDAVERRLR ASLGEAAVTR HEHSLVVHAA 
PRPGLPRIGL IGHLDTVRTE HDGPARIDGE RLYGAGAADM KSGLAVMIEL SERLERASLP
CDLTLVFYER EEGPFEENML GPLLERFDAL RQLDLAICLE PSDNKLQLGC MGSVHATVRF
LGRTSHSARP WQGENAITGA ADFLALLRDR APNDVVLDGH HFREVVSPTM ASGGRGRNII
PDSFEINVNY RFAPGRTPEQ VVDELRALVR ECAGDRAELV PTDLSPAGRP HASHPLVLHL
RDCGVSALET KQAWTDVARF DAAGVPAVNF GPGTQAQAHQ RNEYTELPPL YAGYAILERF
LSSVPAPA