Gene Hoch_2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2099 
Symbol 
ID8544485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2905878 
End bp2909057 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content72% 
IMG OID646386806 
Producthypothetical protein 
Protein accessionYP_003266537 
Protein GI262195328 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.094651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGGAT ACACGTGGGC CTGCGCTGCG GCGCTGTGTG CGCTGGGCGC GAACGCGCTG 
CCCGGCTGCG GCGATAACCT CGGCGCCGAA TGCGAACCTG GCGCCCCCGG CTGCGACGCC
GGCGCGCCCC CGGCGCCCAG CCTCGACCAG GTGCTGCCGG CCGTGCCCGC GCCCACGGGC
GAGCCGCAGG GAGCCTGGGC CGGCCGCATC GGGACTTCGG ACAGTCCGGA CATCATCCCC
GGCCCGGGCT CCATCGGCCG CGCGGGCGAC TACGTCGTGC GCAACCAGCG CGCGCGCTTC
GTGGTCCAGG CGCCGGGGCG GGCCATCGGC GTGGTGCCCT ACGGCGGCAA CCTGGTCGAT
ATCGCCGCGC TGGATGATCG CGGCGCGGCG CGGGCCGAGG ATCACTTCGG CGAGCTGTCG
CTGGTGTATC AGCTCGGACG GACCTGCGAG CACACGAGCG TAGAGATCGT CCAAGACGGC
AGCGGCGGCG GCGTGGCCGC ACTGCGCGCC CGCGGACGCG CCGCGGTCAA CGACTACATC
AACCTGCGCG GCGTCGGCTT GTTCGACGTC GCCGAGTCGC TGGACCCCGA GCGCGAGGAC
GAGGTCGCCT GCGCCACCAC CTATCTGCTG GCGCCCGGCA GCGCCCATCT CGAGGTCTGG
TTCACGCTGC TCAATCCCAC CGATACGCCG CTGAGCGCGC CCCTGGGCCT GCTCATCGAC
TCCGGCGCCA GCACCCACGC CTGGTCACCG GGCCCCGGCT TCGAGGGCTC GGTCGGGCTC
GACGAATTGC TGCGCGACGC CGGCACCGAG GTGCCGTATC TGGTGCAACA GGGACCCGGC
GTCGCCTACG GCGTGGCCCC GCGCCACGCC ACCGCGGGCA CGTCCAACGC GACCTTCAGC
GTCACCGGGG TGTCGCTGTT GCTGTTCGGC GCGCGCCGCG CGCTCGAGAT CTTCGACCCA
TCACGAAATT ATCTGAGCCT GGCGCCGCGC ACGGGCGACT CGTGGCGCGT GGACGTCGCC
GTGGGCCGCG ACGCCGCCGA GATCGGTGCC CACCTGCGCG CGGCGGCCGC CAGCGCGGAC
GCGGGAGATG GCATCGCCGA GGTCGAGCTC GCGGCCCAGG TGACCTGGTC CGATGGCAGC
GCAGCCGAGC ACGCGCGTGT CGGCCTGTAT CGCGACGACG ATGACGACGG CGCGATCACG
AGCGACGACG CGCTGGTCAC GTACCTCGAC GCCGACGCAG CCGGAGCCGT GCGCGGCGCC
GTGCCCGCGG GCAACTATCT GGCGCGCGCC GAGGTCTCAG ATCGCGGACG CTCGTCCGTT
ATCACCGTTG CGCTGGCCGC GGACCAACCC CCGGCCGCGC TCAGCTTCCT CCTGCCCGCG
CCCGTGCTCC TCGACTACAC CATCCGCGAC AGCGACGGCG CGGTGATTCC CGCGCGCCTG
TCGATCATCG GCGCGCATCC CGCGGCACCG GACGCGCGCC TGTTCCCGGT CGGCGACCGC
GCGCCCGGCA CCATCACCAC GGTGCACGCG CTGCGCGGCA CCAGCATCGA TCGCGGCGAC
GGCGCCGACC CCGCGCTGCG CCTGCCCGTC GATGCCGACG GGCGCGCTCG CTATCGCGTC
GAAGCATCGC GCGGTACCGA GTGGAGCGCG TTCGGCACCA CCGTCGAGCT CGCGGCCGAC
AGCCCGCCCG CGCCGCTCGA GATCACGCTC GAACGCGTGG TCGACAGCGC GGGCTACGTG
GCCTCCGAAT ATCACGTGCA CCAGCTCGCT TCGACCGATT CGGTGGTGAG CCAGATCGAG
CGCGTCGCGT CGATGGCGGC CGAGGGCGTC GAGCTGTTCG CGGCCACCGA CCACGACGCG
GTCAGCGATC TGCAACCCGT GGTCGAAGCC CTGGGCATAA GCGAGCTGGT GCGCGCCATC
CCGGGCCTGG AGATCACGCC GTTTAGCTAC GGCCACTTCA ACGCCTGGCC GATCGCGCCC
GACGGCACGC CGCGCGGTGG TGCCATCGAC TGGGCGCGTG GCGGGGGTGG CTACGCCATG
CGTCCAGTCG AGATCTACGC CGCGGCCCGC GCCCGCGGGG CCGCGCTGGT GCAGGTCAAT
CATCCGCGCG CCGACATCGG CGGCGGCCGC AGCGACTTCC AGGCGCACTT CGACCGCATC
GGCCTGCGCT TCGACTACGC CCGCGGCGTC ATCGAAAGCG GACAGGGGCC GGTGCCCAAC
GCCTGGCTGC GGCTTCCCGA GGGCGCGCTG TGGAGCGACG ACTTTCAGGC GCTCGAGGTC
TGGAATGGCA TGCAGGTCGC CGATACCAAC GGCGACGGCG TGCGCGAATT TCCCGGCCTC
GACCTGGTGA TGCGCGACTG GTTCAATTTC CTGTCGCTCG GCTTCGACGT GACGCCGCTC
GGCAACTCCG ATACCCACGA GCGCTTTCGC GACGTCGCCG GCATGCCGCG CACCTACGTC
CGCGTGGACG ACGACAGCCC GCAGGCGCTG GCCACGGGCG CGATCGTGGA CGCGGTACTC
GAGACCCTCG GCGGCCGGAT AGCGCGCGAC GTGCTCGTGA GCAACGGGCC CTTCTTGCGG
CTCACGCGCG CCGATGACAG CCAGTCCCGC TCGGTGATCG GGGCCGTGCT CGAGGCGGAC
GGCGATGGCC GCGTGCGCCT CGAGCTAAGC GTCGAAGCGC CGCATTGGGC ACAGTTTGAC
ACCGTCGAGG TGTTTGCCAG CGCCACGCCC GAGGTGCCGC GCCCGGGCGA GCCGCTCGAC
GAGACCGCGC TCGTGCCGCA TCTCTGCTTC ACCGCGCGCC CGCCGGCCGA GCTCGCGGAC
AACGACCCCT GCGCGCTGGC TCGCGGCGGC GCGCAGCCGC TGCAGGTGAA CCAGACAGAG
AGCGCGTATC TCGCGCAGCT TCGCATCGAG GTCGCGGGCG AGAACATCAT CACGCGCCCC
GAGGCGCGCG GCCTCGACGC CTGGCTGGTG GTGCGCGTGC GCGGCAATCG CGGCATCTAT
CCGCTGCTGC TCGGCGGCAT AGTCCAAGAC CAGGAGGCCA TCGACACCTT GCTCGACGGC
GACGCAGACG CCATCGACGC GCTTCTCGAT GGACGCGGAG CCCCGGCAAC CGCGTTCACC
GCGCCCGTAT TCGTGGATTT CGACGGCGGT GGATACCACG CGCCGTTCGG TCCAGAATAG
 
Protein sequence
MRGYTWACAA ALCALGANAL PGCGDNLGAE CEPGAPGCDA GAPPAPSLDQ VLPAVPAPTG 
EPQGAWAGRI GTSDSPDIIP GPGSIGRAGD YVVRNQRARF VVQAPGRAIG VVPYGGNLVD
IAALDDRGAA RAEDHFGELS LVYQLGRTCE HTSVEIVQDG SGGGVAALRA RGRAAVNDYI
NLRGVGLFDV AESLDPERED EVACATTYLL APGSAHLEVW FTLLNPTDTP LSAPLGLLID
SGASTHAWSP GPGFEGSVGL DELLRDAGTE VPYLVQQGPG VAYGVAPRHA TAGTSNATFS
VTGVSLLLFG ARRALEIFDP SRNYLSLAPR TGDSWRVDVA VGRDAAEIGA HLRAAAASAD
AGDGIAEVEL AAQVTWSDGS AAEHARVGLY RDDDDDGAIT SDDALVTYLD ADAAGAVRGA
VPAGNYLARA EVSDRGRSSV ITVALAADQP PAALSFLLPA PVLLDYTIRD SDGAVIPARL
SIIGAHPAAP DARLFPVGDR APGTITTVHA LRGTSIDRGD GADPALRLPV DADGRARYRV
EASRGTEWSA FGTTVELAAD SPPAPLEITL ERVVDSAGYV ASEYHVHQLA STDSVVSQIE
RVASMAAEGV ELFAATDHDA VSDLQPVVEA LGISELVRAI PGLEITPFSY GHFNAWPIAP
DGTPRGGAID WARGGGGYAM RPVEIYAAAR ARGAALVQVN HPRADIGGGR SDFQAHFDRI
GLRFDYARGV IESGQGPVPN AWLRLPEGAL WSDDFQALEV WNGMQVADTN GDGVREFPGL
DLVMRDWFNF LSLGFDVTPL GNSDTHERFR DVAGMPRTYV RVDDDSPQAL ATGAIVDAVL
ETLGGRIARD VLVSNGPFLR LTRADDSQSR SVIGAVLEAD GDGRVRLELS VEAPHWAQFD
TVEVFASATP EVPRPGEPLD ETALVPHLCF TARPPAELAD NDPCALARGG AQPLQVNQTE
SAYLAQLRIE VAGENIITRP EARGLDAWLV VRVRGNRGIY PLLLGGIVQD QEAIDTLLDG
DADAIDALLD GRGAPATAFT APVFVDFDGG GYHAPFGPE