Gene Hoch_3416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3416 
Symbol 
ID8545804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4716202 
End bp4717338 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content73% 
IMG OID646388083 
Productamidohydrolase 
Protein accessionYP_003267811 
Protein GI262196602 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.729562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.139217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC GCTTTGTCTC TTTGCCGCTT GGCGGTGTCG GCTGCTTCGG TTACAGGTCG 
AACATGCCCC CTCACTCGCG CCGCGGTCAC GTCAACGCTC ACACGCACAT CTACAGCGCC
CTGGCGCCCT TCGATATGCC GGCACCGCAG CCGCCGCCCG AGAGCTTTTT GCAGATCCTC
GAGCGGGTGT GGTGGCGTCT CGACCGCGCG CTCGACGAGC AGGCGCTGGC CGCGGGCGCC
GACTACTACG TGGCCGAGGC GCTGATGGCG GGCACCACCT GCCTCATCGA TCATCACGAG
TCGCCCAACT TCATCGACGG CTCGCTCGAT GTGCTGGCCG AAGCCTGCCA GCGCCTGGGG
ATGCCGGCGG TGCTGTGCTA CGGCGCCACC GAGCGCAACC GCGGCCGCGA CGAGGCCCGC
GCCGGTCTGG CCGAGTGCGA GCGCTTCCTG CGCACCAATG AGCGCCCGCT GGTGCGCGGC
GCGGTCGGCC TGCACGCGTC GTTCACGGTC TCGGACGACA CCATCGGCGA GGCCGCGGCC
CTGGCCCGCT CGCTGGGCGC GGTGCTGCAC CTGCACGTGG CCGAGGGTCC CGAGGATGTC
GCCGACGCCC GCCGCCGCGG CGACGCCAGC CCGCTGGCGC GCCTGCGCCG GCTCGACGCG
CTGGTGCCCG GGTCGATCCT GGTCCACGGC GTGTACCTCA CGGCCGAGGA GGTGGCCGAG
TGCGAGCAGC GCGGGCTGTG GCTGGTGCAG AACCCGCGCT CGAACCGCGG CAACGGGGTC
GGCTATCCCA GGGCGCTCAC GCACAGCCGG TGCGTGGCGC TGGGCACCGA CGGCTACCCG
GCGGATATGA ACGACGAGGT CGCCGCGCTG TTCGCCGAGG CAGAGGAGGT CGAGGATGAA
TCGCCGCGTC TGGGCAACCG CCTGGGCGCC GGCCACGCTC TGTGCGCGGC CCTGTTCGGC
GGCGAGCCGC CCGAGGTCGA CGTCCACGAG CCCATGGGCA GCCCCGAGAT GCGCGTGGAC
GTCGCCGGGC GCGAGGTGGT CGCCGGCGGC GAGCTGCTCA CCGGTGATCG CGCGGCCTTC
GAGGCCCGCG CCCGGACCCA GGCCGAGCGG CTGTGGCAGC GCATGGCCGC GCTGTGA
 
Protein sequence
MSARFVSLPL GGVGCFGYRS NMPPHSRRGH VNAHTHIYSA LAPFDMPAPQ PPPESFLQIL 
ERVWWRLDRA LDEQALAAGA DYYVAEALMA GTTCLIDHHE SPNFIDGSLD VLAEACQRLG
MPAVLCYGAT ERNRGRDEAR AGLAECERFL RTNERPLVRG AVGLHASFTV SDDTIGEAAA
LARSLGAVLH LHVAEGPEDV ADARRRGDAS PLARLRRLDA LVPGSILVHG VYLTAEEVAE
CEQRGLWLVQ NPRSNRGNGV GYPRALTHSR CVALGTDGYP ADMNDEVAAL FAEAEEVEDE
SPRLGNRLGA GHALCAALFG GEPPEVDVHE PMGSPEMRVD VAGREVVAGG ELLTGDRAAF
EARARTQAER LWQRMAAL