Gene Hoch_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3337 
Symbol 
ID8545725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4616472 
End bp4619498 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content70% 
IMG OID646388004 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003267732 
Protein GI262196523 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.690978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.507312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCA CCGCTCAGCG CCGCAACGCC ATGCCCGACA CCATCGTGAT CAAGGGCGCG 
CGCGAACACA ACCTCGACGT CCCCCTGCTC GAGCTGCCCA AGCACGCCCT CATCGTGGTC
ACCGGCGTGA GCGGCTCGGG CAAGTCCTCG CTGGCCTTCG ACACCCTGTA CGCCGAGGGC
CAGCGCCGCT ACGTCGAGTC GCTCTCGGCC TACGCGCGCC AGTTCCTCGG GCAGATGGAA
AAACCCAAGT ACGACCACAT CCGCGGCCTG TCGCCGACCA TCGCCATCCA GCAGAAGGCG
GCCTCGTCCA ACCCGCGCTC GACCGTCGGC ACGGTCACCG AGATCTACGA CTACCTGCGC
GTGCTCTACG CGCGCATCGG CGAGCAGCGC TGCCACCAGT GCGGCGGCCC GGTGAGCGCG
CGCTCGGCCG AGGAGATCGT CAACGAGCTG GCCGCGCTGC CCGAGGCCAG CAAGGTCACG
CTGCTGGCGC CCAAGGCCGA GAACCGCAAG GGCGAGTTCC GCGAGCTATT CGCCGAAGCG
CGCAAGGCCG GCTTCGTGCG CGTGCGCATC GACGGCATGG TGGTGCGGCT CGAGGACGTC
ACCGCGCTCG AGAAGCAGAA GAAGCACACC ATCGAGCTGG TGATCGACCG CGTGGTCATA
AAAGACGAGA ACCGCGCCCG GCTCACCGAC TCGGTCGAGA CCTCGCTGCG CGAGGGCGAG
GGCAAGATCA TCTGCCTGGT CGAGGGCGAG CGCACGCCGC GCGCCTACTC GCGCGACAAC
GCCTGCGCGA CCTGCGGCAT CGGCTTCCCC GACCTGGCGC CGCAGTCGTT CTCGTTCAAC
TCGCCGCTGG GCATGTGCGA GGACTGCAAC GGCCTGGGCG AGCGCATGCA GGTCGATCCC
GAGCTGATCA TCCCCGACAC CACGCGCAGC CTGCGCGACG GCGCCATCGC CGCCTGGGGC
GAGAACATCA TCGAGGACAG CGGCTGGACG GCCAAGATCA TCGGCGCCCT GGCCGAGGCC
TACAAGATCG ACCTCGACAA GCCGTGGAAC AAGCTGAGCA AGCGCCAGCG CACGGTGCTG
CTGCACGGCA CCGGCGACCG CCGCGTGCAG GTCACCTGGG ACGGCAGACA CAGCCAGGGC
GCCTGGGACA TGCGCTTCGA GGGCATCATC GGCCAGCTCG AGCGGCGCTG GCGCGAGACC
AGCTCGGAGC GCGCGCGCGC CAGCTACGAG CGCTTCTTCC GCGCCATCGC CTGCGCCACC
TGCGAGGGCT CGCGGCTGCG GCCCGAGTCG CGCGCGGTGC TGGTCGGCGG GCGCAACATC
TCCGAGCTCA CGGCCATGAC CGTGGCCAAC GCCAGCGCGC ACGTGCGCGA GCTCGGGCTG
CGCGGCGCCC AGGCCAAGAT CGCGGTCGAG GTGCTCAAGG AGATCCGCGC CCGGCTGTCG
TTTCTGCTCG ACGTCGGCCT CGACTACCTC ACCCTCGAGC GCAACGCGGC CACGCTCTCG
GGCGGCGAGG CCCAGCGCAT CCGCCTGGCC TCGCAGCTCG GCTCCGAGCT CTCGGGCGTG
CTCTACGTGC TCGACGAGCC CTCGATCGGC CTGCACCAGC GCGACAACGA GCGCCTCATC
AAGACCCTGC GCCGGCTGCG CGACCTGGGC AACACCGTGC TCGTGGTCGA GCACGACGAG
GCCACCATCG AGGCCGCCGA CCACGTGGTC GATTTCGGAC CCGGCGCCGG TCGTCACGGC
GGCCGCGTCA TCGCCCACGG CAGCCCGGCC CAGGTGCGGC GCGCCAAGGA CTCGCTCACC
GGCCGCTACC TGTCCGGCAA AGAGCGCATC GAGATCCCGA GCGAGCGGCG CCCGGCCCAG
GGCTGGATCG AGCTGCGCGG CGCCCGCGAG CACAACCTCA AGGGCGTCGA CGCCGACGTG
CCCCTGGGCG TGCTGGTGGC CATCACCGGC GTGTCCGGCG CCGGCAAGTC GTCGCTGATC
AACGCCACCC TGTACCCGGC CCTGCGCCGC ATCCTGCACG GCGCCACCGG CCACGTCGGC
CCGCACGAGT CGCTGCGCGG CCTCGAGCAG ATCGACAAGG TCATCGTCAT CGACCAAAAA
CCCATCGGCC GCACGCCGCG CTCCAACCCG GCCACCTACA CCAAGTGCTT CGACCTGGTG
CGCGAGGTGT TCGCGAGCAC GCCCGAGGCC CGCGCCTTTG GCTACAAGCC CGGGCGCTTC
TCGTTCAACG TCACCGCCAA GAACGGCGGC GGGCGCTGCG AATCGTGCGA GGGCGCGGGC
GTGCGCGAGG TCGAGATGCA CTTCTTGCCC AACGTCTTCG TCATCTGCGA GGGCTGCCAG
GGCAAGCGCT ACAACGAGGC CACGCTGCGG GTGCAGTTCA AGGGCAAGAC CATCGCCGAC
ATCCTCGAGA CGCCCATCGA CGAGGCCCTG GTGCTGTTCG AGCACCACAA GAAGCTGGCC
CGCATCCTCC AGACCCTGGT CGATGTCGGC CTCGGCTACG TGGCCCTGGG CCAGGCCGCG
ACCACGCTCT CGGGCGGCGA GGCCCAGCGC GTCAAGCTGG CCCGCGAGCT GGCCAAACAG
CAGACCGGGC GCACCCTGTA CCTGCTCGAC GAGCCCACCA CCGGCCTGCA CTTCCACGAC
GTGCGCAAGC TGCTCGATGT TCTCGGACGC TTGGTCGAAA CCGGTAACAC CGTGCTGGTC
ATCGAGCACA ACCTCGACGT CATCAAGACC GCCGACTGGA TCGTCGACCT CGGCCCCGAA
GGCGGCGCCG GAGGCGGCGA GATCATCGCC GTGGGCACGC CCGAGCAGGT GGCCGCGGTC
CCCGCGTCGT TCACCGGCCG CTTCCTGGCC GAGATGCTGC CGGCCGCAGC GGCGCCGGCC
AAGGGGACGA GCAAAACCGC GACCAAGAAG TCGGCGACCA AGAAAACGAC GACCAAGAAA
ACGGCGACCA AGAAAACGGC GACCAAGAAG CGGGCGAAGA CCACCACGGC CCGCCGGGGC
GTAGGTTCCG CGCAGGCCGG CAACTGA
 
Protein sequence
MTRTAQRRNA MPDTIVIKGA REHNLDVPLL ELPKHALIVV TGVSGSGKSS LAFDTLYAEG 
QRRYVESLSA YARQFLGQME KPKYDHIRGL SPTIAIQQKA ASSNPRSTVG TVTEIYDYLR
VLYARIGEQR CHQCGGPVSA RSAEEIVNEL AALPEASKVT LLAPKAENRK GEFRELFAEA
RKAGFVRVRI DGMVVRLEDV TALEKQKKHT IELVIDRVVI KDENRARLTD SVETSLREGE
GKIICLVEGE RTPRAYSRDN ACATCGIGFP DLAPQSFSFN SPLGMCEDCN GLGERMQVDP
ELIIPDTTRS LRDGAIAAWG ENIIEDSGWT AKIIGALAEA YKIDLDKPWN KLSKRQRTVL
LHGTGDRRVQ VTWDGRHSQG AWDMRFEGII GQLERRWRET SSERARASYE RFFRAIACAT
CEGSRLRPES RAVLVGGRNI SELTAMTVAN ASAHVRELGL RGAQAKIAVE VLKEIRARLS
FLLDVGLDYL TLERNAATLS GGEAQRIRLA SQLGSELSGV LYVLDEPSIG LHQRDNERLI
KTLRRLRDLG NTVLVVEHDE ATIEAADHVV DFGPGAGRHG GRVIAHGSPA QVRRAKDSLT
GRYLSGKERI EIPSERRPAQ GWIELRGARE HNLKGVDADV PLGVLVAITG VSGAGKSSLI
NATLYPALRR ILHGATGHVG PHESLRGLEQ IDKVIVIDQK PIGRTPRSNP ATYTKCFDLV
REVFASTPEA RAFGYKPGRF SFNVTAKNGG GRCESCEGAG VREVEMHFLP NVFVICEGCQ
GKRYNEATLR VQFKGKTIAD ILETPIDEAL VLFEHHKKLA RILQTLVDVG LGYVALGQAA
TTLSGGEAQR VKLARELAKQ QTGRTLYLLD EPTTGLHFHD VRKLLDVLGR LVETGNTVLV
IEHNLDVIKT ADWIVDLGPE GGAGGGEIIA VGTPEQVAAV PASFTGRFLA EMLPAAAAPA
KGTSKTATKK SATKKTTTKK TATKKTATKK RAKTTTARRG VGSAQAGN