Gene Hoch_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3855 
Symbol 
ID8546248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5306885 
End bp5308501 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content68% 
IMG OID646388524 
ProductNusA antitermination factor 
Protein accessionYP_003268247 
Protein GI262197038 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0719337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0942338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCCA ATCTCAACAT GGTCATCGAC CAGGTCGGTC GCGACAAGAA CATCGAGCGC 
GATGTCCTGG TTCAGGCGCT CGAGCAGGCG ATCCTCACCG CTGCGAAGAA GACCTTCGGG
GCCAGTCGTG AGCTCGAGGC GCAGTACAAC GAGGATACCG GCGTGGTCGA CCTCTTCCTC
ATCGTCAATG TGGTCGAGGA TGAAGAGGAC GCCATCTACG GTCGCGAGAT CACGGCTCCC
GACGCTGAGA CCCACGGCCT CGAGGCCGAG ATCGGCGACG AGCTGCTGTT CCAGGTCTTC
TACCGCGCCG AGGACAATGA GCGCGCGTCC GAGCAGGACG CCAAGTTCGG CGACCTGATC
GACCTCAAGA ACGCCCACAA GCGCTTCGGC CGCATCGCCG CGCAGACCGC CAAGCAGGTG
ATCTACCAGC GCGTGCGCGA GGCCGAGCGC GACAACGTCT ACAACGAGTA CAAGGACCGC
AAGGGCGAGC TCATCACCGG CATCGTGCGC CGCTTCGAGC GCGGCAGCAT CGTCGTCGAT
CTCGGCCGCG CCGAGGCCAT CTTGCCGACC CGCGATCAGG TGCCGCGCGA GTCGTATCGC
GTGGGCGACA GCATCAAGGC CTACGTGCTC GACATCGACC GCAACGCGCG CGGCCCGCAG
ATCATCCTCT CGCGCACGCA CAAGGGCCTG CTCGAGAAGC TGTTCGAGCA AGAGGTGCCC
GAGATCTACG AGAAGATCGT GCGCATCGAG TCGTCGGCTC GCGAGCCCGG CGCCCGCGCC
AAGATCGCGG TGTCCTCGCG CGACCGCGAC GTCGATCCCG TGGGCGCCTG CGTCGGCATG
AAGGGCTCGC GCGTCCAGGC CGTGGTCCAG GAGCTGCGCG GCGAGAAGAT CGACATCGTG
CCCTACGACG AGGATCCGGC GCGCTTCGTG TGCAACGCGA TCGCGCCCGC CGAGGTCTCG
CGCGTGCTCA TCGACGCCGA CGGCCACCGC ATGGAGCTGG TGGTGCCCGA CGACAAGCTG
TCGCTGGCCA TCGGCAAGAA GGGCCAGAAC GTGCGTCTGG CCTCGCAGCT CACCGGCTGG
CGCATCGATA TCCACTCGGA GTCGAAGATC CAGGATCTCG AGCGCCGCGC CAAGGAGCAG
CTCGCCGCGG TCGAGGGCAT GGACGACGAT CTCGCCGACA CCGTGTTCCG CCTCGGCTGG
CGCTCGGTGG GCGAGCTGTC GCGGGCCGCG CCCGAAGAGC TCGCCGGCGT GCCCGGCATC
GACGGTGTCG AGGTCGGCCG CCAGGTGGTC GCCGGCGCGC GCGCGTTCCT CGAGGAGGAG
AAGCTGCGCC AGGAGCACGC TCGCCGTGAG GCCGATCGCC GCAACAGCCT CAGCGATCGC
GAGCGCCTGC TCGAGGTCCG CGACATGAGC GAGGCGATCG CCGACCAGCT CGCCGAGGAG
GCGCAGGTGA TGCGCGTCGA GGATCTGGCC CGCTGGCCGC TCGACCGCCT GACCATGGCC
GACATCGACG AGGATACTCT GCGCACGCTG CGCCACTGGG CGCGGGTGTG GCTGGGCGAC
ATCTCGGCCG ACGCGCCGCC GCCCAAACCC CGCCGCAGCG AAGAGTCCGA GGCCTAG
 
Protein sequence
MQPNLNMVID QVGRDKNIER DVLVQALEQA ILTAAKKTFG ASRELEAQYN EDTGVVDLFL 
IVNVVEDEED AIYGREITAP DAETHGLEAE IGDELLFQVF YRAEDNERAS EQDAKFGDLI
DLKNAHKRFG RIAAQTAKQV IYQRVREAER DNVYNEYKDR KGELITGIVR RFERGSIVVD
LGRAEAILPT RDQVPRESYR VGDSIKAYVL DIDRNARGPQ IILSRTHKGL LEKLFEQEVP
EIYEKIVRIE SSAREPGARA KIAVSSRDRD VDPVGACVGM KGSRVQAVVQ ELRGEKIDIV
PYDEDPARFV CNAIAPAEVS RVLIDADGHR MELVVPDDKL SLAIGKKGQN VRLASQLTGW
RIDIHSESKI QDLERRAKEQ LAAVEGMDDD LADTVFRLGW RSVGELSRAA PEELAGVPGI
DGVEVGRQVV AGARAFLEEE KLRQEHARRE ADRRNSLSDR ERLLEVRDMS EAIADQLAEE
AQVMRVEDLA RWPLDRLTMA DIDEDTLRTL RHWARVWLGD ISADAPPPKP RRSEESEA