Gene Hoch_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4838 
Symbol 
ID8547245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6621156 
End bp6622478 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content71% 
IMG OID646389511 
ProductProtein of unknown function DUF2183 
Protein accessionYP_003269220 
Protein GI262198011 
COG category[S] Function unknown 
COG ID[COG4850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.140965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAAGG GAATCGTGAA ACGAGCATTG ATACTGGTGG CGCGGGCGCT GGCGCGCGTG 
CTGCAATTTC TCATGCTACC GCTGCGGCGG CGGGCCGGGC AGCGGCCGGC CGCGGTGCTC
GCCTATCGCG GCTACGGCAA CCGCGAGCGC CTGCACGTGC TCGGCCGCGT CGTCGAGGAT
ACCGGCGGAC CGTCGACGCG CGCGTCGTAC TCGCTGTGGG CCGAGCTCAA GCGGCTGTAC
CGCGCGCTGG CGAGCTGGCC GGTGGCCGAT GCCGAGCTGC GCCTGCACTG CGACGGCGTG
AGCGTGCCGG CGCGGAGCGA CGAGAACGGG TTCTTCCGCG CCATCGTGCA CCGACCCGAG
GCGGCCATGG CCACGCCTGC CCCGGGCGGC CGTGACGAGG CGCTGCGGGA CTCTCCGTGG
ATCGATGTCG AGGTCGAGCT GGTCGCGCCC AGCGGGCCGA CGCCGGTGCG CGAACGCTCG
CGCGTGCTGG TGCCGACCTC GCGTTCGCGG CGCGTGATCA TCAGCGATAT CGACGACACG
CTGATGGTGA CCGGGGTGGC CAATCGCCTG CTGATGTTCT GGCGGCTGTT CGCCAAGGCG
GCCGATAGCC GGGTGCCCTT TCCCGGGGTG GCGGCGTTCT ATCGCGCCCT GCACGGCGGC
GCCGCGGGCG GCGAGGACAA CCCGATGCTG TACGTCTCGC GCGGTCCGTG GACCATCTAC
GCCGAGCTCG ACAAGTTCTT CGGGCTGCAC GACATCCCCG TCGGGCCGGT GCTGCGGTTG
CGCGACTGGG GCATGTCGTT TCACCACCCG CTGCCCAAAC AGCAGAAGGG CCACAAGCTG
GACCTCATCG AGGACATGCT GTCGGTGTAT GGCGATCTGC CCTTCGTGTT GCTCGGCGAC
AGCGGGCAAC ACGATCCCGA GATCTACAGC GAGATCGTGC GCCGCCATCC GCGCCGGGTC
GAGGCCGTGT ACATTCGCGA CGTCACCGGC GATGAGGGTC GCGCCAGCAC CATCCACGGG
CTCGGGCGCG AGCTGACGAA GGCGGGCGCC GATCTGATTT TGTCGGACAG CACGCTGGAG
ATGGCCGAAC ACGCGCTGGC GCGCGGCTGG ATTGATGAGG ACGCGCTGCG CGAGATGCGT
CGGGAGGTGT CGCGCGAGGG CCGCCCCGAG CAGGCCGATG TCGCCGCGGC CGAGGCAGCG
GACGCCGCGG CCGCAGCGCA AGCGGAAGCC GACGCCGAGG CGCGTAATAG CGACGACGTC
GACGATGACG ACGACGAGCG CGGCGCCGGG CGACGACGCT ACCCGCCCGT GCCCGTGCCC
TGA
 
Protein sequence
MVKGIVKRAL ILVARALARV LQFLMLPLRR RAGQRPAAVL AYRGYGNRER LHVLGRVVED 
TGGPSTRASY SLWAELKRLY RALASWPVAD AELRLHCDGV SVPARSDENG FFRAIVHRPE
AAMATPAPGG RDEALRDSPW IDVEVELVAP SGPTPVRERS RVLVPTSRSR RVIISDIDDT
LMVTGVANRL LMFWRLFAKA ADSRVPFPGV AAFYRALHGG AAGGEDNPML YVSRGPWTIY
AELDKFFGLH DIPVGPVLRL RDWGMSFHHP LPKQQKGHKL DLIEDMLSVY GDLPFVLLGD
SGQHDPEIYS EIVRRHPRRV EAVYIRDVTG DEGRASTIHG LGRELTKAGA DLILSDSTLE
MAEHALARGW IDEDALREMR REVSREGRPE QADVAAAEAA DAAAAAQAEA DAEARNSDDV
DDDDDERGAG RRRYPPVPVP