Gene Hoch_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3637 
Symbol 
ID8546027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5003710 
End bp5009253 
Gene Length5544 bp 
Protein Length1847 aa 
Translation table11 
GC content75% 
IMG OID646388306 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003268032 
Protein GI262196823 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00719781 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACCCA TCGAGTTACG CGGCGCCCGC CAGAACAACC TGTGCAGCGT CGACCTCACC 
CTGGAGCCGG GCACGCTGGT GGCCGTGACC GGGCCCTCGG GCGCGGGCAA GTCCTCGCTG
GCCTTTGGCA CGCTGTACGC CGAGGGTCAG CGCCGCTACG TCGAGAGTTT TAGCGCCTAC
GCGCGGCAGT TCCTCGAGCG CCTGGCGCGG CCCGAGGTCG ACGCGCTCGA CCCGGTGCCC
GCGGCCGTGG CCGTGGATCG CAGCGCGCCG GTGCGCACCA GCCGCTCGAC CGTGGGCACG
ATGACCGAGC TGTGCGACTA CGCGAAGTCG CTGTGGGCGC ACAGCGCGCG CCTCTGCTGC
CCGGGCTGCG GCGCCGAGGT GCAGCCGGAC GAGCCCGGAG CCGCGGCCGA GAGCGTGATC
GAGGCGCATT CGGGCGCGCG CGTGGCCGTG AGCTTCGCGG TCCGCGCCAG CTCGGCCGAC
GCCCTGGCGC TGGCCCGCGA CGAGCTGCGC GCGCAGGGCT ACGGCCGGGT GCTGGCCGGC
GGCGCGCTGG CCAAGCTCGA CGAGCTCGAC GACGACGCGC TGGCCGCGGC CGCGGCCGAG
GACAACGCGC ATCACGGCGG CGGCGCGCTG GCCGTGATCG CCGACCGGAG CACGGCCACG
GCGCGCAATC GCCGGCGCCT GAGCGAGTCG CTGGCCGCCG CCATGCACCG CGGCGGCGGC
CGCGCCCGCG TGCACGTATT CGACGCCGAG GGCGGCCTCG GCGTCACCTT GCGCTTCTCC
GACCGGCTGC ACTGCGCCGA CTGCGAGCGC GACTTCCGCC CGGCCACGCC CGGGCTGTTC
TCGTTCAACA GCCCCATCGG CGCGTGCCCG AGCTGCCGCG GCTTCGGCCG CGTCATCGGC
ATCGATGTCC GCAAGGTCCT GCCCGATCCT TCGCTGTCTC TGAGCGCGGG CGCCATCCGC
CCGTGGCGCG GCAAGAAGTA CGCGTGGGAG CGGCGCGAGC TGGCCAGGCA CGCCAAGCGC
GCCGGCATCC CGTGGTCCGC GCCGGTGAGC GAGCTGAGCG CCGAGCAGCT CGCCTGGCTG
GTCGAGGGCG AGCCCGGCGG CTACGAGGGC GGCGGCTGGT GGGGCCTGCG CGGCTGGTTC
TCGTGGCTCG AGAGCAAGAC CTACAAGCTG CACGTGCGGG TGCTCCTGGC CCGCTATCGC
GCCTACGACG AGTGCCCCGC GTGCGCGGGC GCGCGGCTGC GCCCCGAGGC GCTGTGGTGG
CGCGTGCGCG GGCTCGATAT GGCAGGCTTT TTGGCGCTGT CGGTGGCCGA TGCGCGGCAG
TTCCTGCTCG ACTTCGATCG CGATGACGAC GGCGACGGCC ACGGCGACGG CGCAGCGGCT
GGCTCTGCGC GCGCTCGCGG CGCCCGCGGT GATGGTCGCG AACGCGATGA CGCCGCCGCG
CTGCTGCGCG CCGAGTGTCT GCGCCGGCTG AGCACGCTGG CCGATGTCGG CCTGGCCTAT
CTCACCCTCG ACCGCGCCTC GCGCACGCTC TCGGGCGGCG AGACCCAGCG CGTGGCGCTC
ACGGGCGCTC TGGGCGCGTC GCTGGCCGGC GCGTTGATCG TCATGGACGA GCCCTCGGTG
GGCCTGCACC CGCACGATGT CGGACGCCTG GCCGAGGTCG TCCAGCGCCT GGCCGCGGCC
GACAACACCG TGCTGGTGGT CGAACACGAT TGGGCATTGA TCCAGCGCGC CGACCGCGTG
GTCGAGCTCG GACCCGGCGC CGGTCGCGAG GGCGGCCGCG TGGTCTTCGA CGGCGCCCCC
GAGGAGCTGC TGAGCAGTGA CACCGCCACC GGCCGCGCCC ACCGCGGCGC GCGCGCGGCC
GCCTTGGTGC GGCGCGAGCC GAGCGCGTGG ATCGAGCTGC GCGCGGCCAC CGGCCACAAC
CTGCGCGGCG TCGATGTCGC CATCCCGCGC GGGCTGCTCA CCTGCGTCAC CGGCGTGAGC
GGCTCGGGCA AGAGCTCGCT GATCCTGGGC ACGCTGGCCC CGGCCGTGAT CGCGGCCCTG
GGCGGCGAGG TCGAAGAGCC CGCCCTGCCC CACGCCGCGC TCGCGGGCGC CGACGCGCTG
GCCGACGCCG TGGTCGTCGA TCAATCGCCG CTGGGGCGTA CGGCGCGCGG CAACCCGGCG
ACTTACGTGA AGGCCTGGGA CTGGATCCGG GGCGCGCTGG CCAAGACCGA GATGGCCGCG
GCCCGGGGGC TGAGCGCGGG CGCGTTCTCG TTCAATGTGC CCGGCGGCCG CTGCGAGTCG
TGCAAGGGCG AAGGCGCCGA GACCGTGGAG ATGCAGTTTC TGGCCGATGT CTCGTTCTCG
TGCCCGGACT GCGGCGGCAA GCGCTTCGTC GGCCCGGTGC TCGAGGTCCG CTACCAGGGC
AAGAACGTGG TCGACATCCT CGAGATGAGC GTCGATGAGG CCCTGGCCTG CTTCTCCTCG
CGCACGCTCA CGCGCCGCCT GCAGCCGGTG GTCGATGTCG GCCTGGGCTA TCTGCGCCTG
GGCCAGCCGC TCAACACGCT CTCGGGCGGC GAGGCGCAGC GGCTCAAGCT GGCCGAGGCG
CTAGCGCGCA CCAAGGCCGG CGGGCTGGTC ATCCTCGACG AGCCCACGGC CGGGCTGCAC
GCCCAGGACG TGGTGCCGCT GCGGCGCTCG CTCGAGGCGC TGGTGGCGCG CGGCGACACC
GTGGTCGTGG TCGAGCACGA CATGGCGCTG GCCGCGCACG CCGATTGGAT CGTCGACCTC
GGGCCCGGCG CCGGCGCCCA CGGCGGCACC ATCGTCGCGA GCGGGACCCC CGAGCAGGTG
GCCGCGGCGC AGGGCTCGAG CACGGCGCCG CACTTGGCCG CGGCGCTGGC CGCACACAGT
TCGTCCACGC GGACGAAATC GTCCCCGGCC GGCCGCGCGT CGGCGTCGGT GGGCGCGAGT
CGCGCGGCGC AGCCCGGGCC GGTGTGGCAG CGCGCGCCCA TCGCGGGCGC CGATATCGAG
ATTCTCGGCG CCCGCGAGCA CAACCTGCGC GATCTCAGCC TGCGGCTGCC GCGCGAGCAG
CTCGTCGTGG TCACCGGGCC CAGCGGCAGC GGCAAGAGCA CCCTGGCCTT CGACGTGCTC
TACGCCGAGG GCCAGCGCCG CTATCTCGAG ACCCTGTCGC CGTACGCGCG CCAGTACATG
CCGCAGTTGC CGCGGCCCGC GGTCGATCGC GTGCTCGGCG TGCCGCCCAG CGTGGCCCTC
GAGCAGCGCG TGCACCGCGG CGGCGCGGGC TCGACCGTGG CCACCATCAC CGAGGTCGCC
CACTACCTGC GGGTGATGTA CGCGCGCGCC GGGCTGCTGC ACTGTCCCGA CTGTGAGGTG
GCCATCGCGC CGCGCGCGCC CGAGCTGCTG GCGCGCGATC TCGCGGCGCT GGCCCGGCCC
GACGGCGACG GCGACGGCGA CGGCGACGCC TGGCTGGTGA TGGCGCCCGT GGTGCGCGGC
CGCAAGGGCC TGCACCGCGA GCTGCTGGCG CGGGCGCGCG AAGACGGCAT CGAGCGCGCG
CGCATCGACG GCGCGTTCAC GGCGCTGCGC GCGGGCATGA AGCTCGACCG CTACCGCGAG
CACGATGTCG AGCTGGTCAT CGCCGAGTTG CCGGCCGAGG ACGCCGCCAT GCCGGCTGCC
CTGGGTCGCG CCGCGGCGCT CAGCGGCGGC AGCGTGCGCG TGCGCCGCGG CCAGCAGGAG
CTGCTGCTGT CGACCCAGCG CGCGTGCCCC TCGTGCGGCA CCGGCTTCCC CGAGCTCGAC
CCGCGCATGT TCTCCTTTCA CACCCGCCAG GGCGCCTGCC CCGCGTGCGA GGGCCGCGGC
GTCATCGAGC CGGCCACGCG CCGCAAGCGC GGCAAACGCG CGGCCGCCGA GGCGCCGCCC
AAGACCTGCC CGGACTGCGC CGGCACCCGG CTCTCGCCCC TGGCCCGGGC CGTGACCGTG
GCCGGCTGGT CGATCGCCGA GCTCTTCGGC CGCAGCGTGC TCGAGGCCGG CCGCGCGCTC
GCCGACATGC AGCTCGACGG TCGCGACGCC ACCATCGCCG CCGTGCCCCT GGCCGAGGCC
CGCGCCCGGC TGTCGTTTCT GGCCTCGGTG GGCGTCGGCT ACCTCGAGCT CGACCGTCCC
GCGGCCACGC TCTCGGGCGG CGAGACCCAG CGCGTGCGCC TGGCCGCCCA GCTCGGCAGC
GGCCTCACCG GCATCCTGTA TGTGCTCGAC GAGCCCACCA TCGGCCTGCA CCCGCGCGAC
ACCGGCGTGC TGCTGGACGC GATGCGCGCG CTGGTCGAGC GCGGCAACTC GCTGGTGGTG
GTCGAGCACG ATCCCGACAC CATCCGCGCC GCCGACTTTC TCGTCGACAT CGGCCCGGGC
GGCGGCCACC ACGGCGGCCG CCTGCTGGCC TGCGGCGCCG CCGCCGAGGT GCTGGCCGAG
GACCGCGCGC CGACCGCGGC CGCGCTGCGC CGCCCGCCGC CCGTGCCGGC CCAGCGGCGC
GTGCCCGACG ACGCGGCCCA GCTCGAGCTG CGCGGCGCCC GGCGTCACAA CCTGCGCGAT
CTCGACCTGC GCCTGCCGCT CGGCTGCCTG GTCGCGGTCA CCGGCGTGAG CGGCTCGGGC
AAGTCCACCC TGATCCGCGA GGTGCTGCTC GAGGCCGTGG GCGACGCCCT GGCCCGGCCC
GCGCGCACTC CACCCAGGAG CCAGCGCGCC TACCGCGAGC TGCGTGGCGC CGAGGCGCTG
CGGCGCGCGG TGGAGATCGA CCAGAGCCCC ATCGGCCGCA CCTCGCGCTC GGTGCCGGCG
ACCTATGTCG GCGTGTGGAA CCACATCCGC GCCCTGCTCG CGAACACGCC CGAGGCGCGC
GCGCGCGGCT ACGGGGCCGC GCGCTTCTCA TTCAACACCG CCGAGGGCCG CTGCCCCACC
TGCGAGGGCC AGGGCCGGCT CAAGGCCGAG ATGGCGTTTT TGCCCGACGT CGAGATGGAC
TGCGAGGCCT GCGCCGGCAT GCGCTTCGAC CCCGACACCC TCGACATCAC CTGGCGCGGC
CGCAACGCCG GCGAGATCCT GGCGCTCGAG ATCGGCGAGG CCGCCGAGGT CTTCGCCTCG
GTGTCCAAGG TCGCGCGGCC CCTGGCCCTG CTCGACGAGC TCGGCCTCGG CTACCTCAAG
CTGGGCCAGC CCTCGAGCAC GCTCTCGGGC GGCGAGGCCC AGCGCCTCAA GCTGGTCTCC
GAGCTGGGCG CGCGCAGCCC GGGCGGCACC CTCTATGTCA TGGACGAGCC CACCACCGGC
CTGCACCGCG ACGATGTCGT CCGCCTGCTG GCCTTGCTCG ACCGATTGGT CGAACGCGGC
GACACCGTGG TGGTGATCGA ACATCACACC GACGTGATGC TGGCCGCCGA CTGGATCGTC
GATCTCGGCC CCGAGGGCGG CGCCGAGGGC GGCCGCGTGC TGGTGAGCGG CCCGCCCGAG
GACGTTGCCG CCTGCGCCGA GAGCCACACC GGCGCCGTGC TGGCCGCCGA ACTGCAGCGC
AGTGCGCACG GATCTGACAC ATGA
 
Protein sequence
MRPIELRGAR QNNLCSVDLT LEPGTLVAVT GPSGAGKSSL AFGTLYAEGQ RRYVESFSAY 
ARQFLERLAR PEVDALDPVP AAVAVDRSAP VRTSRSTVGT MTELCDYAKS LWAHSARLCC
PGCGAEVQPD EPGAAAESVI EAHSGARVAV SFAVRASSAD ALALARDELR AQGYGRVLAG
GALAKLDELD DDALAAAAAE DNAHHGGGAL AVIADRSTAT ARNRRRLSES LAAAMHRGGG
RARVHVFDAE GGLGVTLRFS DRLHCADCER DFRPATPGLF SFNSPIGACP SCRGFGRVIG
IDVRKVLPDP SLSLSAGAIR PWRGKKYAWE RRELARHAKR AGIPWSAPVS ELSAEQLAWL
VEGEPGGYEG GGWWGLRGWF SWLESKTYKL HVRVLLARYR AYDECPACAG ARLRPEALWW
RVRGLDMAGF LALSVADARQ FLLDFDRDDD GDGHGDGAAA GSARARGARG DGRERDDAAA
LLRAECLRRL STLADVGLAY LTLDRASRTL SGGETQRVAL TGALGASLAG ALIVMDEPSV
GLHPHDVGRL AEVVQRLAAA DNTVLVVEHD WALIQRADRV VELGPGAGRE GGRVVFDGAP
EELLSSDTAT GRAHRGARAA ALVRREPSAW IELRAATGHN LRGVDVAIPR GLLTCVTGVS
GSGKSSLILG TLAPAVIAAL GGEVEEPALP HAALAGADAL ADAVVVDQSP LGRTARGNPA
TYVKAWDWIR GALAKTEMAA ARGLSAGAFS FNVPGGRCES CKGEGAETVE MQFLADVSFS
CPDCGGKRFV GPVLEVRYQG KNVVDILEMS VDEALACFSS RTLTRRLQPV VDVGLGYLRL
GQPLNTLSGG EAQRLKLAEA LARTKAGGLV ILDEPTAGLH AQDVVPLRRS LEALVARGDT
VVVVEHDMAL AAHADWIVDL GPGAGAHGGT IVASGTPEQV AAAQGSSTAP HLAAALAAHS
SSTRTKSSPA GRASASVGAS RAAQPGPVWQ RAPIAGADIE ILGAREHNLR DLSLRLPREQ
LVVVTGPSGS GKSTLAFDVL YAEGQRRYLE TLSPYARQYM PQLPRPAVDR VLGVPPSVAL
EQRVHRGGAG STVATITEVA HYLRVMYARA GLLHCPDCEV AIAPRAPELL ARDLAALARP
DGDGDGDGDA WLVMAPVVRG RKGLHRELLA RAREDGIERA RIDGAFTALR AGMKLDRYRE
HDVELVIAEL PAEDAAMPAA LGRAAALSGG SVRVRRGQQE LLLSTQRACP SCGTGFPELD
PRMFSFHTRQ GACPACEGRG VIEPATRRKR GKRAAAEAPP KTCPDCAGTR LSPLARAVTV
AGWSIAELFG RSVLEAGRAL ADMQLDGRDA TIAAVPLAEA RARLSFLASV GVGYLELDRP
AATLSGGETQ RVRLAAQLGS GLTGILYVLD EPTIGLHPRD TGVLLDAMRA LVERGNSLVV
VEHDPDTIRA ADFLVDIGPG GGHHGGRLLA CGAAAEVLAE DRAPTAAALR RPPPVPAQRR
VPDDAAQLEL RGARRHNLRD LDLRLPLGCL VAVTGVSGSG KSTLIREVLL EAVGDALARP
ARTPPRSQRA YRELRGAEAL RRAVEIDQSP IGRTSRSVPA TYVGVWNHIR ALLANTPEAR
ARGYGAARFS FNTAEGRCPT CEGQGRLKAE MAFLPDVEMD CEACAGMRFD PDTLDITWRG
RNAGEILALE IGEAAEVFAS VSKVARPLAL LDELGLGYLK LGQPSSTLSG GEAQRLKLVS
ELGARSPGGT LYVMDEPTTG LHRDDVVRLL ALLDRLVERG DTVVVIEHHT DVMLAADWIV
DLGPEGGAEG GRVLVSGPPE DVAACAESHT GAVLAAELQR SAHGSDT