Gene Hoch_5240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5240 
Symbol 
ID8547652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7198115 
End bp7201318 
Gene Length3204 bp 
Protein Length1067 aa 
Translation table11 
GC content66% 
IMG OID646389914 
Productpreprotein translocase, SecA subunit 
Protein accessionYP_003269618 
Protein GI262198409 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0653] Preprotein translocase subunit SecA (ATPase, RNA helicase) 
TIGRFAM ID[TIGR00963] preprotein translocase, SecA subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0916466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.569995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCA TAGAAAAACT CTTCGGCAGC AAGAACGACC GCGACATCAA GAAGCTACGG 
CCCTTGGTGC AGCGCATCGG CGACCTCGAG CCCGACATGA AGGCCAAGAG CGACGCCGAG
CTCCAGGGCA TGACCGCCGT CTTCAAGGAG CGCCTCGACC AGGGCGCGTC GCTCGACGAC
CTGCTGCCCG AGGCCTTCGC CACCGTGCGC GAGGCCGGCT GGCGCGTGCT GGGCATGCGC
CACTTCGACG TACAGCTCAT CGGCGGCATG ATCCTGCACC GCGGCAAGAT CGCCGAGATG
CGCACCGGTG AGGGCAAGAC CCTGGTCGCC ACCCTGCCCG CCTACCTCAA CGCGCTGCCC
GGCCGCGGCG TCCACGTGGT CACGGTCAAC GACTACCTGG CCCGACGCGA CGCCGCCTGG
ATGGGTCGCC TCTACGGCTT CCTCGGTCTC GAGGTCGGCG TCGTGATCCA CGGCCTCGAC
GATTACGCCC GTCAGCGCCA GTACAACGCC GATATCACCT ACGGTCAGAA TAACGAGTTC
GGCTTCGACT ACCTGCGCGA CAACATGAAG ATGTCGCCCG ACCGCATGGT GCAGCGCCAT
CACGCCTACG CCATCGTCGA CGAGGTCGAC TCGATCCTCA TCGACGAGGC CCGCACCCCG
CTCATCATCT CGGGCCCGGC CGAGCGCTCG GCCGATCTCT ACAAGACCGT CGATCGCGTG
GTGCCCAAGC TCAAGCGGGA TATCGACTTC ACGGTCGACG AAAAAGCCCA CTCGGCGATG
CTCACCGACG CCGGCGTCGA GAAGATCGAA GAGCTGCTCG ACATCGAGAA CCTCTACGAC
CCGGCCAACA TCGCCTACAA CCACCACGTG GCCCAGTCGC TGCGCGCGCA CACGCTGTAC
AAGCGCGACG TCAACTACCT GGTGCAAAAC AGCAAGATCG TCATCGTGGA CGAACACACC
GGCCGCACCA TGCCGGGCCG TCGCTGGTCC GACGGTCTGC ACCAGGCCAT CGAGGCCAAA
GAGGGCGTCA AGATCGAGGA GGAAAACCAG ACCCTGGCGA CGATTACGTT CCAGAACTAT
TTCCGCCTCT ACGACAAGCT CTCGGGCATG ACGGGTACCG CCGACACCGA GGCGCCCGAG
TTCCACCAGA TCTACAAACT CGACGTCACC GTCATCCCGA CCAACAAGCC CATCGCCCGC
GATGACGCTC CCGACCTGGT GTACAAAAAC GAGCGCGGCA AGTTCCACGC AGTGATCGAC
GAGATCAAAG AGGCGCACGA GAAGGGCCAG CCGGTGCTGG TCGGTACGGT CTCGGTCGAG
AAGTCCGAGG TCGTGGCCAA CCAGCTCAAG AAGGCCAAGC TGCCCTTCCA CGTCCTCAAC
GCCAAGCATC ACCAGAGCGA GGCGTCGATC GTCGCGCAGG CGGGCCGCAA GGGCTCGATC
ACCATCTCGA CCAACATGGC CGGTCGCGGT ACCGACATCG TGCTCGGCGG CAACGCCGAG
GCCATGGCCA AGGACGAGCT CGAGCAGGAG CGCGCCGCCT TCGTCGCCGA GCTGGCGGAC
AAGCGCAAAG AGCAGCGCAA GAGCCTGCAA AAGGCCGAGG ACAGCGAGGC CCTGGCCGCG
CTCGACGAGG CCCTCGAGGA CGAGGGCTTC GACGAGGAGG GCCGCCTCGC CGAGCTGCTC
GCCAAGTACG AGAAGCAGTG TTCGGCCGAG CGCGAAGAGG TGCTCGAGGC CGGCGGCCTC
AAGATCGTCG GCACCGAGCG CCACGAGTCG CGGCGCATCG ACAACCAGCT CCGCGGCCGC
GCCGGCCGTC AGGGTGACCC GGGCGCCTCG CGCTTCTACC TGTCGCTCGA GGACGATCTA
CTGCGCATCT TCAATGCCGA CTTCGTCACC CGCTGGATGG AGCGCCTGGG CATGGAAGAG
GACGTGCCCA TCGAGAGCGG CATGGTCACG CGCGCCATCG AGAAGGCGCA GAAGCAGGTC
GAGGGGCGCA ACTTCGACAT GCGCAAAAAC CTGCTCGAGT ACGACGACGT GATGAACCAG
CAGCGCAAGA CCATCTACGG CTTGCGCCGC CAGATTCTCG AAGGTCGCTA CGCGCCCGAG
CTGAGCGACG AGGAGCGCAA GGCCGGCAAG ACGCCCGAGG TCCCGACCGA GAGCGGCGAC
TGGACCGTGG CCAAGCTGGA GAAGGAGACC CAGGAGGAGA TCCAGACCCT GGTCACGCGC
GTCGCCGAGG CCTACAAAGC GCAGCACGAA AAGCTCGCCG AGCAGGAAGA CGAGGACGGC
GACGACCTGC CGCCGCTGTG GCGCGTGCTG CGGCACGAGC TGTGGCGCAC CACGGGCACC
CTGTGCGATG TGGAGCGGCT CTACGGCAAG GCCGGGGCCT CGCTCAGCGA GGAAGAGATC
GCCCGGATGG CGGCCTCCAC CGAGGTGGTG TCGCAGATCG CGGCCTCGCT GGTGCAGCAG
CGCGAGCGCC TCTACGACCT GTGCGATACG GTGATCGGAC AGTTGGTCGA CAGCAAGTGC
CCGCCCGGCA GCAACGACGA CGACTGGGCG CTCGACGAGC TCCAGGACAG CCTGCGCGAG
CACTTCCACA CCGCCGTCGA GGTGCCGCGC AACGCCGCCA GCCAGGAGGA GATCGCCCAG
AAGGTGTGGG GCCAGGTCGA GCGCCGCATC GACGAGCGCA TCGAGGAGCT GGGGCGGCCG
TGGCTGCTGT ACTTCGTGCG CCACTTCTTC CTCGAGGAGA TCGACCAGCA GTGGGTCGAT
CACCTCAAGA CCATGGACCA GCTCCGCGAG GGTATCGGCC TGCGCGGCTA CGGCCAGAAG
GATCCCAAGA AGGAGTACAA GAAAGAGGGC TTCGACCTCT TCGGCGGCAT GATGGAGCGC
ATCCAGAGCA ACGTGTGCTC GAAGATCTTC CGGGTGCAGA TCCGCCGCGA GGAGGACGAG
ATTCCCGAGC TGCAGGCCAA GCAGCGCCGC ACCACGGCCG TGCATCCCAC CGCCGGCACC
GGCGCCGCCG AGCCCAGCAC CGAGGCCGAG GCGAAGTCCT CGACCTACGG CGACGCCGCC
GACGGCGGCA AGGAGCCGGT GCAGAAGCAG CAGACCGTGC GCCGCGACCG CCCCAAGGTC
GGCCGCAACG ACCCCTGCCC CTGCGGCAGC GGCAAGAAGT ACAAGAAGTG CCACGGCCGG
CCGGGCGCTG AAGCCTCGGC CTGA
 
Protein sequence
MGFIEKLFGS KNDRDIKKLR PLVQRIGDLE PDMKAKSDAE LQGMTAVFKE RLDQGASLDD 
LLPEAFATVR EAGWRVLGMR HFDVQLIGGM ILHRGKIAEM RTGEGKTLVA TLPAYLNALP
GRGVHVVTVN DYLARRDAAW MGRLYGFLGL EVGVVIHGLD DYARQRQYNA DITYGQNNEF
GFDYLRDNMK MSPDRMVQRH HAYAIVDEVD SILIDEARTP LIISGPAERS ADLYKTVDRV
VPKLKRDIDF TVDEKAHSAM LTDAGVEKIE ELLDIENLYD PANIAYNHHV AQSLRAHTLY
KRDVNYLVQN SKIVIVDEHT GRTMPGRRWS DGLHQAIEAK EGVKIEEENQ TLATITFQNY
FRLYDKLSGM TGTADTEAPE FHQIYKLDVT VIPTNKPIAR DDAPDLVYKN ERGKFHAVID
EIKEAHEKGQ PVLVGTVSVE KSEVVANQLK KAKLPFHVLN AKHHQSEASI VAQAGRKGSI
TISTNMAGRG TDIVLGGNAE AMAKDELEQE RAAFVAELAD KRKEQRKSLQ KAEDSEALAA
LDEALEDEGF DEEGRLAELL AKYEKQCSAE REEVLEAGGL KIVGTERHES RRIDNQLRGR
AGRQGDPGAS RFYLSLEDDL LRIFNADFVT RWMERLGMEE DVPIESGMVT RAIEKAQKQV
EGRNFDMRKN LLEYDDVMNQ QRKTIYGLRR QILEGRYAPE LSDEERKAGK TPEVPTESGD
WTVAKLEKET QEEIQTLVTR VAEAYKAQHE KLAEQEDEDG DDLPPLWRVL RHELWRTTGT
LCDVERLYGK AGASLSEEEI ARMAASTEVV SQIAASLVQQ RERLYDLCDT VIGQLVDSKC
PPGSNDDDWA LDELQDSLRE HFHTAVEVPR NAASQEEIAQ KVWGQVERRI DERIEELGRP
WLLYFVRHFF LEEIDQQWVD HLKTMDQLRE GIGLRGYGQK DPKKEYKKEG FDLFGGMMER
IQSNVCSKIF RVQIRREEDE IPELQAKQRR TTAVHPTAGT GAAEPSTEAE AKSSTYGDAA
DGGKEPVQKQ QTVRRDRPKV GRNDPCPCGS GKKYKKCHGR PGAEASA