Gene Hoch_5640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5640 
Symbol 
ID8548054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7744062 
End bp7745666 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content69% 
IMG OID646390308 
Productchromosome segregation and condensation protein, ScpB 
Protein accessionYP_003270010 
Protein GI262198801 
COG category[K] Transcription 
COG ID[COG1386] Predicted transcriptional regulator containing the HTH domain 
TIGRFAM ID[TIGR00281] segregation and condensation protein B 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.90406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.854183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCAA AGCGCAGGAA GAAGAAGAAG CGCGTGCGCG CCGCCGCTGT AAAGCGCGAG 
GTCGGCGAAG CGAACACCCA GGCGACCGTC GAGACGACGG CCGACGACAG CGCCGAGAGC
GCGGCCAGCG AGGACGCCGA AGCGTCCCAG GTCGCCGACG AGGCGCAGGC CGAGGACACG
CCCGGGGATG ACGCGGAGCC GCGCGCCGAG GACGAGCGCG GTGGCGAGGA CGCCGACGAG
GCCGACGAGG CCGAGAGCGA GGCCGCCGAT GTCGTGGCCG AAGGCGACGA GAGCGACGAG
GGCGACGCGG ATGAGGTAGG CCAGGAAGAG GCCGCCGAAG GTGAGGACGT CGACGCCGAG
AGGGAGGACG ACGCGGGCGA CGCGGGCGCT GTGGACGAAA ACGCCGACGC TGCCGCAGAG
GACGAGGCAG AGGACGAGGA CGGGGACGGG GACGACGGTG CAGACGAACG AGACGAGGGC
GAGCGGCGCC TGCACAGCCT GCTCGAGAGC CTGCTGTTCG CGACCGACAA GCCGCTGTCG
ATCAAGCAGC TCACCCAGAT CGCCGGGGTC AAGGAGATGG CGCGCATGCG CGCGGCGCTC
GATGTCTTGA GCGCGCACTA TCAGGGCCGT GGCATCGAAC TCGCCGAGGT CGCCGGCGGC
TGGCAGTTCC GCACCGCGCC GGAGAACTCG CGCTGGGTGC AGCAGCTCGT CGGCGGCAAG
CCCGTGCGCC TCAGCCGCGC GCAGCTCGAG ACCTTGGCCA TCATCGCCTA CCGGCAGCCG
ATCACGCGGC CCGAGATCGA CGAGATTCGC GGCGTGGATT CGGGCGGCAC GCTCAAGGTG
CTGCTCGATC GCAGCCTGAT CCGGGTGCTC GGCAAGAAGG AAGAGCCCGG TCGTCCGCTG
CTCTACGGCA CCACCAAGGA CTTCCTCGCC TTCTTCAATC TCAACGACCT GCGCGAGCTG
CCGACCTTGC GCGAGTACCA CGAGCTGACC GAGGACAGCC GCCGCGTGGT CGAGAAGATG
GGTATGGCGC TCGACGATCT GCCTGGCCGC GGCACAGTCG CGCACAGCGC GGACGATGAC
GACGAGGCGG TCGAGCAGGA CGCCGACGCA TCCGATGACG CCACCGACGG CGATGGCCCG
AGCGAGGACG AGAGCGGGAG CGACGATTCG TCCGAAAAAG ACGCGACGGC AGAAGGCGCG
GCTGCAGAGG GCGAGGGCGA GAGCCAGGGC GAAGACGAGG ACGGGGACGA GGACGGGGAC
GGGGACGGGG ACGGGGGCGA GAGCGCGAGC CAGGGCGAAG CGGAAGATGG CGAAGCCGAT
GTGCTGGACG ATGGCGAGGT CCCCGAGGCC GAGGCCGGGG GGGACGATGG CACCGAGGAT
GATCTCGGAG CGGATGAAAC GATTTCGGAC ACTGAGGAGT CCGAAGAAAT GACCGCCGAT
CCGGCGTTAC CGGACGATGC GGTTGCCGAC GAGATGTCGG CGGAGGAAGA GGCTGCGAGC
GAGGAGGGGG ACGCCGGCGA ACCGGCCGCT GACCCCTCGA CTGCGGAGCT TCTAGACGAT
GCGAGCGATG ACGGCGAGGA GCCAGAGCTC AGAGAGAAGG ACTAG
 
Protein sequence
MTAKRRKKKK RVRAAAVKRE VGEANTQATV ETTADDSAES AASEDAEASQ VADEAQAEDT 
PGDDAEPRAE DERGGEDADE ADEAESEAAD VVAEGDESDE GDADEVGQEE AAEGEDVDAE
REDDAGDAGA VDENADAAAE DEAEDEDGDG DDGADERDEG ERRLHSLLES LLFATDKPLS
IKQLTQIAGV KEMARMRAAL DVLSAHYQGR GIELAEVAGG WQFRTAPENS RWVQQLVGGK
PVRLSRAQLE TLAIIAYRQP ITRPEIDEIR GVDSGGTLKV LLDRSLIRVL GKKEEPGRPL
LYGTTKDFLA FFNLNDLREL PTLREYHELT EDSRRVVEKM GMALDDLPGR GTVAHSADDD
DEAVEQDADA SDDATDGDGP SEDESGSDDS SEKDATAEGA AAEGEGESQG EDEDGDEDGD
GDGDGGESAS QGEAEDGEAD VLDDGEVPEA EAGGDDGTED DLGADETISD TEESEEMTAD
PALPDDAVAD EMSAEEEAAS EEGDAGEPAA DPSTAELLDD ASDDGEEPEL REKD