Gene Hoch_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3939 
Symbol 
ID8546335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5432686 
End bp5433945 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content72% 
IMG OID646388611 
Productlysine 2,3-aminomutase YodO family protein 
Protein accessionYP_003268331 
Protein GI262197122 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1509] Lysine 2,3-aminomutase 
TIGRFAM ID[TIGR00238] KamA family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0153502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0100362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCCCC GCCGCCCGCT GCCCACCCTG CCCTCGCCGC CCGCTACCCA GCCGGCGGCG 
CCGCCGCTGC GCCGCGAACA GGTCGCTGAC GAGGACTGGA ACGACTGGCG CTGGCAGGCG
CGCAATATGC TCACCACGGC CGAGGAGTTT GCCCGCGTGG TCGAGCTGAG CGACGAGGAG
CGCGCTGCGC TCGTGGATAC GGCGCCGATG TTTCGCACCG GCGCCACGCC GTATTACGCC
AGCCTGATGG ACCCGGCGCG CGCCGACTGC CCGATCCGCA AGCAGGCCAT CCCCTCGCGG
CGCGAGCTCG ACTTCGCGCC CGAGGAGCTG CGCGACCCGC TGGGCGAGGA CAGCCAGAGC
CCGGCGCCGT GCGTGGTGCA CAAGTACCCG GACCGGGTGC TGCTGCTGGT GCTCGACCGC
TGCGCGATCT ACTGCCGCCA CTGCAACCGC CGGCGCCTGG TCGGCGGTGA CGCGCCGCCC
GCGCGCGACG ACATCGACGC CGGCATCGAC TACATCGCGC GCACGCCGCA GATCCGCGAT
GTGCTGCTCT CGGGCGGCGA TCCGCTGCTC TTGTCGAATG CGCGCCTGGC CCACATCCTG
GGCCGCCTGC GCGCGATCGA GCACGTCGAG ATCATCCGCA TCGGCACGCG CCTGCCGGTG
GTGCTGCCGA TGCGCATCGA CGACGAGCTG TGCGCGACCT TGCGCCGCTT CCACCCGCTG
TACATCAACA CGCACTTCAA TCACCCCAAG GAGATCACGA GCGAGGCGCG CGCGGCCTGC
GAGCGCCTGG TCGACAGCGG CATCCCGGTG GGCAACCAGG CGGTGCTGCT GCGCGGGGTC
AACTCGTCTG TGCGCTGCAT CCGGGCGCTG ATGCGCGCGC TGCTGCGCAT GCGCGTGCGT
CCGTATTACC TGTTTCAGGG GGATACGGTG CTGGGCACGG ACCATATGCG CACGCCGGTG
GACGCGGCCA TCGCGCTGAT GGAGGGGCTG CGGGGCTGGA CCAGCGGCAT GGCGATTCCG
CACATGGTCA TCGACGCGCC CGGCGGCGGC GGCAAGCTGC CCTTTGGCCC CGAGTACGTG
CTCGAGCGCC ACCCGGACCA CGTGCTGGTG CGTACCTACC GCGGCCGCGT GGTGCGCTAT
CCCGAGCCGC GTGAGCGCGA CTGCCGGGTG AGCTACGACG AGGTGTTCTT CGCCGACGCT
GGGGACACGG ACGGGGACGA CCCCGGCCTG CACACGCTCG ACGCCGCGTC CGAGGCATGA
 
Protein sequence
MSPRRPLPTL PSPPATQPAA PPLRREQVAD EDWNDWRWQA RNMLTTAEEF ARVVELSDEE 
RAALVDTAPM FRTGATPYYA SLMDPARADC PIRKQAIPSR RELDFAPEEL RDPLGEDSQS
PAPCVVHKYP DRVLLLVLDR CAIYCRHCNR RRLVGGDAPP ARDDIDAGID YIARTPQIRD
VLLSGGDPLL LSNARLAHIL GRLRAIEHVE IIRIGTRLPV VLPMRIDDEL CATLRRFHPL
YINTHFNHPK EITSEARAAC ERLVDSGIPV GNQAVLLRGV NSSVRCIRAL MRALLRMRVR
PYYLFQGDTV LGTDHMRTPV DAAIALMEGL RGWTSGMAIP HMVIDAPGGG GKLPFGPEYV
LERHPDHVLV RTYRGRVVRY PEPRERDCRV SYDEVFFADA GDTDGDDPGL HTLDAASEA