Gene Hoch_3985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3985 
Symbol 
ID8546381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5487497 
End bp5488843 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content64% 
IMG OID646388657 
Productpreprotein translocase, SecY subunit 
Protein accessionYP_003268377 
Protein GI262197168 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00342615 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGCT CCATCTCAAA CATCGGCAAG ATTCCCGAGC TCAGGAAACG CATCCTGTTC 
ACGCTCGGGA TGCTCGCGGT TTACCGCATC GGCATCTTCG TCACCCTGCC GGGCGTCAAC
CGCGTCGAGA TGCAGAATCT CATCTCGACC GGCGGCACCG GCACCTTCCT GGGACTCTTC
AGTGTGTTCT CGGGCGGGGC CCTGGAGCAG CTGTCGATCT TCGCGCTGGG CATCATGCCC
TACATCAGCG CGTCGATCAT TTTGCAGCTG CTCACGGTGG TAGTGCCAAA GCTCGACCAG
CTCAATAAAG AGGGCGAGCA GGGCCGCAAG AAGATCAACC AGTACACGCG CTACGGCACC
ATCGGCCTGT CGATCGTTCA GGCGGTCTTC CTGTCGTCGT GGCTCGTGGG GCTGGGACGC
GGCGGCGGTG GTTACTCCGC AGGCGAGGTC GTCAACGACC CCGGCATGTG GTTCCACATC
ATGACCGTGT TGACGCTGAC CACGGGGACC GCTTTCCTGA TGTGGCTCGG TGAGCAGATC
ACCGAACGCG GCATCGGCAA CGGCATCTCC CTGCTCATCT TCGCCGGCAT CGTCGCCGGC
ATGCCCGACG CGCTCACGCA GCTCTTCGGA CGCAGCTCCA CGGGCGACTA CGACCTGTTC
ATGCTGCTCG TGCTGCTGCT CATCGCCGTC GGCACGATCG CCACCATCTG CTACTTCGAG
CGCGCCCATC GCCGCATCCC GGTGCAGTAC ACCAAGCGCA TGGTGGGCCG GAAGATGTAC
GCGGGGACGC AGACCCACCT GCCGCTCAAG ATCAACGTCT CGGGCGTCAT CCCGCCGATC
TTCGCCTCGT CGATCCTCAT GTTCCCGGCG CAGATCGCCA ACATGGTCGG CACGCCGTGG
ATGCAGTCGG TGGCCAACGC GCTCAATCCC AACGACTGGC GCTACAACGT CATCTACGTC
GGGCTGATCG TGTTCTTCAC CTTCTTCTAC ACCGCCGTGA CCTTCAACCC GGTGGACGTC
GCCGACAACC TCAAGAAGAG CGGCGGCTTC ATCCCCGGCA TCCGTCCCGG CAAGAAGACG
GCCGAGTACA TCGACTACGT GCTCACCCGC ATCACGGCCG CAGGCGCCGT GTATCTGTCG
GCCGTGTGCC TGGTGCCCGC GCTGCTGCAG AACTGGATGC AGGTGCCGTT TTACTTTGGC
GGCACCGGCC TGCTCATCGT GGTCGGCGTC GCGCTCGACA CCGTCCAGCA GATCGAGAGC
CATCTCATCA CCCGTAACTA CGAGGGATTC ACCGGTCCCA AGGGGCCGCG CATCCGCGGC
CGCGTGACCA GTGGCCGCGG GCGCTGA
 
Protein sequence
MASSISNIGK IPELRKRILF TLGMLAVYRI GIFVTLPGVN RVEMQNLIST GGTGTFLGLF 
SVFSGGALEQ LSIFALGIMP YISASIILQL LTVVVPKLDQ LNKEGEQGRK KINQYTRYGT
IGLSIVQAVF LSSWLVGLGR GGGGYSAGEV VNDPGMWFHI MTVLTLTTGT AFLMWLGEQI
TERGIGNGIS LLIFAGIVAG MPDALTQLFG RSSTGDYDLF MLLVLLLIAV GTIATICYFE
RAHRRIPVQY TKRMVGRKMY AGTQTHLPLK INVSGVIPPI FASSILMFPA QIANMVGTPW
MQSVANALNP NDWRYNVIYV GLIVFFTFFY TAVTFNPVDV ADNLKKSGGF IPGIRPGKKT
AEYIDYVLTR ITAAGAVYLS AVCLVPALLQ NWMQVPFYFG GTGLLIVVGV ALDTVQQIES
HLITRNYEGF TGPKGPRIRG RVTSGRGR