Gene Hoch_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3764 
Symbol 
ID8546157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5173697 
End bp5175328 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content72% 
IMG OID646388434 
Producthypothetical protein 
Protein accessionYP_003268157 
Protein GI262196948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.256312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTGT ACCCGAGACT GCGTAAAATG TGTAGCACCT ACAATATTCT CGTCATCAGC 
GATCTGCACC TGGGCGAGGA TCTCAATCCC ACGGCCACGC CGCAGATCGC GCGCGACGTG
GCGCTCGCGG AGCGCAATCT GGTCGCCTTT CTCGGCCACC ACACGCGGCG CCGCGTGGAC
GGTCTGCCGT GGCGTCTGGT GATCAACGGC GACATGCTCG ACATCCTGAC CGTGTGCGTG
TTCCCAGAGG ACGCGCGCGC GGACGGTGCC GGAGCCGGTG CCGGGCGCGA TGCGGGGCGC
AATTCGGACA GTTGGCCGGT CATGCACCCC GAGGAGCACA AGCACGGCCT GGGCCGGCGC
ATGGACGCCT CGTGCGCCAA GGTCGGCGCG GTCATCGACC GCCATCTGCC CTTCGTGCAG
AGCATGGCGC GCTTTCTGGC GGCGGGAAAT CGCATCGAGA TCATCTGCGG CAACCACGAC
GCCGAGCTGT TCTGGCCCGA GGTCAAGAGC GCCTTCCGCC GCGGGCTCGA GCGGGTGTGG
CGGAGCATGC CGGAGGCCGC GCGCCCGGGC GCGCCCGACG CGACCAGCCT GGGGCAGCGG
CTCGGCTTCC ACGACTGGTT CTTCTACGAG CCCGGCGTCG CCTGGATCGA GCACGGCCAC
CAGTATGACG AGTGCTGCTC GTTCGAGTAT CTGCTCAACC CGGTGGACGC CGAGGGCCGG
TTCCTGGTCG ACAACGTCGA TTCGGCCAGC CTGCGCTACA TCACCAACAT GGTGCCCGAT
CTCGTGCCCC ACGGCAGCGA GGACTGGACC ATGGCCGGCT ACCTGCGGCT CACCTACGAC
GCCGGGCCCC GCGCCGGGCT GCAGATCGCG CGCGGCTTCC TGCTGGCGTC GATCCGCCTG
CTGCGCGAGT GGCGGGCGTC GCGGCGGCTG CGCGAGGGGC GGCGGCGGCG CGAGCGCCAT
CTCGACGGCC TGCGCGAGCT CAGCTCGCGG GTGCCGCTGT CCTACGAGCA GCTGCGCACG
CTCAGCGGCC TGCAGCGGCG CCCGGTGGTG ACCAGCCTGC GCCGTCTGAT GCAGGTGCTG
ATGCTCGACA AGCTGGCGCT GGCGGTGACG GCCATCGTGC TGAGCGCGCT GGCGCTGTGG
CTGCTGCCGC TCGGCTGGGC GCTGCTGGGC GTGGCGTTCA CGGCGCTGGG CACCGGGCTG
GCCGGCCGGC TGCTCAGCCG CGGGCGCATG GTCGATCCCG CGCTGCCGCT GGACCTGGTG
CCCGCGCGCA TCCTCGAGCA CGTCGACGCC CGCTTCGTGG TCTTCGGTCA CACGCACATG
CCGGTGGCGC GGCCGCTGCC GGGCGACGGC TGGTACTACA ACACCGGCAC CTGGGTGCCG
AGCGGCAAGC CCGGGCTGCT GCGCTGCTTC ACGCATCTGC GCATCCTGCA GCGTCCCGAT
GGGCCGGTCG CGGCGCTGTG CCAGTGGCGC GACGGCAGCA GCCAGGAGTT CTCGCCGGCG
CCGGCGCAGC GGCCGGGCTC GGCCAGCGAC GCCAACCCGC TCGGTCGTCG CGCGGACTCG
GTGCCGCTGC TGCCGCTGGG CAGCCCGCCG GGCGCCGCGC CCGGCATCGG TTACGGCACG
CCCGTCGCCT AG
 
Protein sequence
MHVYPRLRKM CSTYNILVIS DLHLGEDLNP TATPQIARDV ALAERNLVAF LGHHTRRRVD 
GLPWRLVING DMLDILTVCV FPEDARADGA GAGAGRDAGR NSDSWPVMHP EEHKHGLGRR
MDASCAKVGA VIDRHLPFVQ SMARFLAAGN RIEIICGNHD AELFWPEVKS AFRRGLERVW
RSMPEAARPG APDATSLGQR LGFHDWFFYE PGVAWIEHGH QYDECCSFEY LLNPVDAEGR
FLVDNVDSAS LRYITNMVPD LVPHGSEDWT MAGYLRLTYD AGPRAGLQIA RGFLLASIRL
LREWRASRRL REGRRRRERH LDGLRELSSR VPLSYEQLRT LSGLQRRPVV TSLRRLMQVL
MLDKLALAVT AIVLSALALW LLPLGWALLG VAFTALGTGL AGRLLSRGRM VDPALPLDLV
PARILEHVDA RFVVFGHTHM PVARPLPGDG WYYNTGTWVP SGKPGLLRCF THLRILQRPD
GPVAALCQWR DGSSQEFSPA PAQRPGSASD ANPLGRRADS VPLLPLGSPP GAAPGIGYGT
PVA