Gene Hoch_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3722 
Symbol 
ID8546112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5122635 
End bp5124917 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content72% 
IMG OID646388389 
Productputative transcriptional regulator, Crp/Fnr family 
Protein accessionYP_003268115 
Protein GI262196906 
COG category[T] Signal transduction mechanisms 
COG ID[COG0664] cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.44099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.232019 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTCA GGAATTTGAT CAAAGATCTC CGGCGCAAGC TTCGCAGAGA CGAAGACAAC 
CTCGCTCTGC GCCTCGAGCT GGCGGATCTC TGTCGCGACG ACGGCAACCA CGCCGAGGCG
CTCGAGACCT ATCGCGAGGT AGCCATCAGC GCCTGGCAGG CGGGGCGTCT GCAACAGGCG
CAGGATGCGT GCGAGCGCGC GCTCACCATC GCGCCACAGG ACGTCGAACT CAACGCCCTC
ATCGGCGACA TCGACGCCGC CCTGGGGGCG CCGCCCAACG ACGGCGAATC GCGCAATCCC
GGCCCGAAAC GCCGCGCCAA GGTGTGGCAC ACGGCGGCCG GTCCGGCCAC GCCCGCGCCC
GTCGAGGCCG ACAAGGGCCA GCCGCGGCGC AGCAGCCAAC AACGCGCGGC CGCGCCGGAG
CCGCAGAGGC GCGGCACCTC GGAGCTCACG TACACGCCGA CGCCGCTGCC CGCGCCGCTG
CCGCTGCACG ACGCCGCCGA CGACAGCCTG CTGCTGCATC AGCCCGTGGG CATCAAGGCG
CAGCCCTCGG GCCCCGTGCC CACGGTGTCG ATGCCGGCCG ATATCGACAC CGGCGACGAC
GACGTGCTGG CGCTCGCCGG CACCGGCGAT GAGGTCGGCC AGGGCGGCGC CGTCGTGCCA
TCCTTTCACC CGCCCGGCAT CCAACAGCTC GCGACCACGG TGGACGCGAT GTCGACGCTC
GGCGACAGCG CCTCCTCGTC CTTCGGTGAC GACGACGAGG AGGAAGAGAT CACCGACGTC
CGCGGCCTGG GCGCGGAGAC CGGACTGCAT CAGCTCCCGC AGCAGCATGC CGCTATCCCG
CTTCCCAGCG ACGGCGTGCC CAGCGGCGAT AACCGCGATA GTCAGGATCT CACCGCGCCG
CTCGCGCGCG TCAACGCACG CGTCGAGGCG ACCCCGCAGA GCAAGCCATC GCGCTCGCGC
GTGGTGCGCG ACTCGCGTCC CGTCGCCGTC AGCGGCGAAA TCTCTATCTC GGACAGACAG
TCGGAAGCGG CCCCGGCCGA ACGCGCGCCC GACGCCAGCG CGGAGCCACG AGAGGCCTGG
TACGCGGCCG GAGCCGAAGG CGCTGGCAAC GAGGACGACG AAGAAGAGGT CACCGATACC
GGCGCGACCA CGCAGCTCCG ACAGCAGGAC TACGCCGAGG CCCACGCAGC GCCGCAGCGG
GCGTCCACGC TCGCGGGCGC CGGCGTCGAT GACACCGGCC GCGTCGCCGC GCCGCTCGAC
GACGACGAGG ATCCTTCGCC GACGCTGAGC GACACCCAGC CCGTCGATGT CGAGTCGGTT
GATGAGGACA AGCTCGTTGC CGAGCAGGCG GCGCCACGCG GCCGCCGCCG CCGCCGGGCG
GCCGCGAGCA CCCTGGTCGA CATGGCGCCC CCCGTGCAGC CCGCGATGCC CACGCAGAGC
GCGCGAGCGG CCGCGCTGGC GACCATCTTC CCCTCGTTTC CCGCTTACGT GCTCGACGAG
CTGGCCGAGC GCATCACCCT GCGCGAGTTT CGCGACGGCG AGGAGATCCT GCAGCAAGGG
CAGGAGAGCC GCGCCTGTTA CCTGGTGATG TCCGGGTCTG TGCGGCTGTC GCGGCGCGCG
CGCCCGGGCG TGGGCGACGA GATCGCTGAG ACCGGCCTGC TGACCCGCGG CGCGCTGTTC
GGCGTGCGCT CGCTGCTGCC CGAGCGCACC AGCGTGGCGG CGGCCCACGC GGCCGGCCCG
TGCCGCGTCT ACGAGGTGCC GCGGCGCGCG CTGCGCGAGC TGGCGGCCAT CCACCACACC
CTGGGTCCGC TGCTCGAGGT CTTCTACCGC GAGCACCTCA CCGCCATGCT GCTGCACAGC
GCGCCGTTTC TCAGCTCGCT GCCCGCGGCC TGGCGCGATG GCCTGCAGGG CCGCTTTAGG
CCCCTGCGGC GCGCGGCCGG CGAGTCGATT CTGCGCCAGG GCGAACGCAC CGGTGGCCTG
TATCTCGTGG TGCTCGGCGC CGTCGAGATC GTGCAGCGCC TGTCGCCGAG CCGCGCCAAG
CTGCTGGCCA CCATCGGCGA GGGCTGCTTC TTCAGCGATA TGTCCGTGCT GGCCGGCAAA
GACGAGACCG CGGGCGCCTC GGTGTCGGCC GCCGGCCCGC TGGAATTGGC GGTGCTACCG
GCCGAGGAGT TCAGCCGGGT GCTGGCCGAG GCGCCGGCGC TGTGGCGCGA GCTGTGCGAG
CAAGACCCGG ACAGCGAGCT GCTGCGCTGC TCACTGCTCA CCGGCCGAAC TCGCGCCATC
TGA
 
Protein sequence
MKLRNLIKDL RRKLRRDEDN LALRLELADL CRDDGNHAEA LETYREVAIS AWQAGRLQQA 
QDACERALTI APQDVELNAL IGDIDAALGA PPNDGESRNP GPKRRAKVWH TAAGPATPAP
VEADKGQPRR SSQQRAAAPE PQRRGTSELT YTPTPLPAPL PLHDAADDSL LLHQPVGIKA
QPSGPVPTVS MPADIDTGDD DVLALAGTGD EVGQGGAVVP SFHPPGIQQL ATTVDAMSTL
GDSASSSFGD DDEEEEITDV RGLGAETGLH QLPQQHAAIP LPSDGVPSGD NRDSQDLTAP
LARVNARVEA TPQSKPSRSR VVRDSRPVAV SGEISISDRQ SEAAPAERAP DASAEPREAW
YAAGAEGAGN EDDEEEVTDT GATTQLRQQD YAEAHAAPQR ASTLAGAGVD DTGRVAAPLD
DDEDPSPTLS DTQPVDVESV DEDKLVAEQA APRGRRRRRA AASTLVDMAP PVQPAMPTQS
ARAAALATIF PSFPAYVLDE LAERITLREF RDGEEILQQG QESRACYLVM SGSVRLSRRA
RPGVGDEIAE TGLLTRGALF GVRSLLPERT SVAAAHAAGP CRVYEVPRRA LRELAAIHHT
LGPLLEVFYR EHLTAMLLHS APFLSSLPAA WRDGLQGRFR PLRRAAGESI LRQGERTGGL
YLVVLGAVEI VQRLSPSRAK LLATIGEGCF FSDMSVLAGK DETAGASVSA AGPLELAVLP
AEEFSRVLAE APALWRELCE QDPDSELLRC SLLTGRTRAI