Gene Hoch_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3155 
Symbol 
ID8545543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4338370 
End bp4341120 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content73% 
IMG OID646387822 
ProductUTP-GlnB uridylyltransferase, GlnD 
Protein accessionYP_003267550 
Protein GI262196341 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.620128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.567988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACCAT CGCAGGCAAC TCCGCATAGC GACGCGCTCG CCGCTCGCTT CACCGAGGCG 
CGCGCCCACG CTCTGCGCGC GCTCGACGAG GGCGTGGGCG GACGCGCGGG CTGCGGACTG
CTGGCCGGGG CGTGCGACGA TATCGTCGCC GAGCTGTGGT CCGCGGCCGA GGCCGCCGAG
CCCTCGGAGG CGCCGCTGGC TCTGGTGGCC ACCGGCGGCT GGGGACGCCG GGCCGTGTGT
CCGTTTTCCG ATCTCGACTT CATCCTGCTG TCGAAGCCGC GCGCGCGAGA CGCGGCGCGC
CGTCGCGCCG ACGCGCTGAT CTATCCCATG TGGGATGCGC GCATGCGCGT GGGACACGCG
GTGCGCACGC CGCGCGAGGC CGCGCAGCTC GCGGGCGGCG ATCTGGCCAC GGCGACCGCG
TTGCTCGACC TGCGCCACGT CGCCGGCGAC CCCGCGCTCA CCGAGGAGCT GACGCGCAGC
GCCCGGCAGA TCCTGGCGCC CGGCGGCAAC GCCAACGAGT TCGTCACCCG CCTGGCCGAG
GAGCGCACCC GGCGCCACGA CAAGTTCGGC GACTCGCTGT ATCTGCTCGA GCCCAACCTC
AAGCACGGCA TCGGCGCGCT GCGCGACCTC GACACCGCGC TGTGGGCGGC CAAGGCGCGC
TGGTCCACGG GCGTGCCCGC GGAGCTGGTG ACCTTTGGCG AGCTCACCCA CCGCCAGGGC
CGGCTGCTCG AGGACGCGCT GGATTTTCTG CTCATGCTGC GCTTTCGCCT GCAGGCCCAG
GCCAAGCGCG CCACTGACCA GCTCAGCTTC GAGAGCCAGG AGGCCATCGC CGCGCACCTG
CACCCCGACG CCACCTTGCC CGAGGGCGGC ATCCGGCCCG CGGTCGCGCC CGCGGTCGAG
CTGCTGATGC GCCAGTACTA TCTTCACGCG CGCGATGTCG TGCGGCTCAC CGACCGCGTG
CTCGAGATGG CCCGGGTGCC GGCGCGGCGC AAGCCGCGGG TGCGCCGCGT GGACTCGGTG
TTCTTGCTGT TCAACGGCAA GCTCGCGGTC AAAGACCCGG ACATGCTGCG CCACCGCCCA
GCCGAGATGC TGCGCCTGTT CCGCGTCGCG CTCGACCTCG AGGTGCCGGT CTACAGCCAC
ACCAAGGAGC TGGTGGCCGA GCACCTGGCC GCGCACGAGG GTCTGCTCTC GCGCACCCGC
GGCGCCGCCG GGCACTTTCT CGACGCCCTG GTCGACACCC GCGACAGCGG CCAGCCCTCG
CTGCTCGAGC AGATGAATCA GATCGGCGTG TTGGCCTCGG TGATGCCCGA GTTCGCGCCC
TGCACCTGCC GCGTGCAGCA CGACCTCTAC CACGTGTACA CGGTCGACCA GCACCAGCTC
CTGACCGTGG CCATGCTCAA GCGCCTGGCC CGCGGCGAGC TGCGCGACAT CCACCCCACG
GCGACCGCCG CCATCGCCCA GGTCGAGCGG CCCGCGGCGC TGTTCCTCGG GACCCTGCTG
CACGACGTCG GCAAGCCGCT CGGCCACGGC CACGCCGAGA AGGGCGCGGT GCTCACGCGG
CGCATCGCCC TGCGCCTGGG CATGGACGAG CGCGACGTGA GCACGGCCGA GTTCCTCGTC
CGCCAGCACC TGACCATGGC CCACCTGTCG CAGCGCCGCG ACCTCTCCGA CCCCGACGTC
ATCAGCCGCT TCGCCGAGCG CGTGGGCGAC GCCCAGCGGC TCATCCAGCT CTACCTGTGC
ACGCTGTGCG ACACCGCCAT GACCGCGCCC GGCAACCTCA GCGCCTGGAA GGAGCAGCTC
CTCGAGGAGC TGTACACGCG CGCGCTGCGC TTTCTGCGCG GCCACGCCGG CGCCGCCGAG
ACCGACGTCG ACGAGCAGAT CCAAAGGACC CGCGAGCGCG TGCGCGTGCT CGTCGCCCAG
CCCGAGCCCG GACCGCCGGC GCCGGACGCG CCGGGCGAGG CCCCGAGCCC CGAGCCGCGC
AACAGCCGCG ACATCGAGGC CGCGCTGGGC GGCGTCGACG AGCGCCTGTT CGCGTCGCTG
TCGCCGCGGC AGCTCGCGCG CCTGATCAAG CTCGGTTGGG CCTGCGTCGA CTCCGGCAGC
TTGGCCGAGA GCGCGGTCGC CTTCTATCCG CTCAAGGGCC ACAGCGAGCT GGCCGTGGTC
GCCGAAGATC GCCACGGCCT GCTGTCGACC ATCGCGGCGG CGCTGTCGGC CGCGCGCATC
AGCGTGCTCG GCGCGGTCAT CGGCGCCGGC CACTTCGCCA GCACCGCCGA GGGCGACGAC
AGCGTGCGCA CCCTGGGCCT CGACATGTTC TTCGTGCGCG ATCTCGCGGG CGAGGCCATC
CCGGTCAACG ACGCCCGCTG GGCCAAGTTC AACGCCGAGC TGCTCAGCCT GCTCAGCGGC
GAGCAGGTGC GCGAGGCCGA GCGCCGCCTG CTCGCGCGCC GCCAGCAGTC GGGACTGGCG
CCGCGGGTGA CGCCCGGCGT GGGCACCAGC ATCCGCATCG ACAACAGCGC CTCGGCCGAC
GCGACCGTGA TCGACGTGCT CACCCAGGAC CGGGTCGGCG TGCTCCACGC CATCAGCCGC
ACGCTCTCGG ACTTCGGCCT CGATATCCAC CTGTCCAAAG TGTCCACCCA GGGCGAGCAG
GTCGCCGACA TCTTCTACGT GGTCAGTACC TCCACGCAGC GCAAGCTCGA GGACGACAGC
GCCATCGCCG ATCTCGAGCT GCGCCTGCAG GTCGCCCTCG AACAGGTATA A
 
Protein sequence
MQPSQATPHS DALAARFTEA RAHALRALDE GVGGRAGCGL LAGACDDIVA ELWSAAEAAE 
PSEAPLALVA TGGWGRRAVC PFSDLDFILL SKPRARDAAR RRADALIYPM WDARMRVGHA
VRTPREAAQL AGGDLATATA LLDLRHVAGD PALTEELTRS ARQILAPGGN ANEFVTRLAE
ERTRRHDKFG DSLYLLEPNL KHGIGALRDL DTALWAAKAR WSTGVPAELV TFGELTHRQG
RLLEDALDFL LMLRFRLQAQ AKRATDQLSF ESQEAIAAHL HPDATLPEGG IRPAVAPAVE
LLMRQYYLHA RDVVRLTDRV LEMARVPARR KPRVRRVDSV FLLFNGKLAV KDPDMLRHRP
AEMLRLFRVA LDLEVPVYSH TKELVAEHLA AHEGLLSRTR GAAGHFLDAL VDTRDSGQPS
LLEQMNQIGV LASVMPEFAP CTCRVQHDLY HVYTVDQHQL LTVAMLKRLA RGELRDIHPT
ATAAIAQVER PAALFLGTLL HDVGKPLGHG HAEKGAVLTR RIALRLGMDE RDVSTAEFLV
RQHLTMAHLS QRRDLSDPDV ISRFAERVGD AQRLIQLYLC TLCDTAMTAP GNLSAWKEQL
LEELYTRALR FLRGHAGAAE TDVDEQIQRT RERVRVLVAQ PEPGPPAPDA PGEAPSPEPR
NSRDIEAALG GVDERLFASL SPRQLARLIK LGWACVDSGS LAESAVAFYP LKGHSELAVV
AEDRHGLLST IAAALSAARI SVLGAVIGAG HFASTAEGDD SVRTLGLDMF FVRDLAGEAI
PVNDARWAKF NAELLSLLSG EQVREAERRL LARRQQSGLA PRVTPGVGTS IRIDNSASAD
ATVIDVLTQD RVGVLHAISR TLSDFGLDIH LSKVSTQGEQ VADIFYVVST STQRKLEDDS
AIADLELRLQ VALEQV