Gene Hoch_3003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3003 
Symbol 
ID8545391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4157656 
End bp4160412 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content64% 
IMG OID646387675 
ProductABC transporter related protein 
Protein accessionYP_003267403 
Protein GI262196194 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102792 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.46573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAT TCGGCAAGAA CAAGCTGCCC ACGCTCTATC AGACCGAGAG CAGCGAGTGT 
GCGCTCGCCT GCCTGGCGAT GGTCGCCGGC TATCACGGCC TCGACATCTC CATGCTCGAG
CTGCGCGAGC GCTTTCCCAT CTCGATGAAG GGCGCCACCC TGCGCGACGT GGTCGAAGTC
GCCAACCAGA TCGGCTTCTC GTCGCGACCG GTGCGCTGTG AACCGGCCGG TCTGCGGCGC
ATCGCGCTAC CGGCCCTGCT GCATTGGGAC TTCGAGCACT TCGTGGTGCT CGAGCGCGCC
GACAAGCGCG GCTATCGCAT CCACGACCCC GCCATCGGTG TCGTCCATCT ATCCGAAAAC
GAGCTATCCG ATCATTTCAC CGGCGTCGCC GTCATCCTGT CACCGACCGA CGACTTTGCC
GGCGGCGAAC TCGGCGAGAA ACTATCGCTG TGGCAACTGC TCAAACGCTC GCGCGGGATG
GTGCCGTTTG TGGCCCAGGT CCTGTGGCTC ACGGCGTTTC TCGAGCTCTT CGCGCTGCTC
GGGCCGCTGT TCCTCAAAGA GGTCATCGAC ACCGGCCTGG CCCATCGCAG CTTTGACCTG
ATCACGGCCA TCGCCGTCGG CATCGGCGCC ATCGGCCTGT TCCAGGGGCT GCTGTCGTTC
TTGCGCGACT ACGTCATTCT GTATTTCGGC ACGTCCTTCA ACCAGCAGAT GATGAATAAC
CTATTCCGTC ATCTGCTGCG ATTGCCCATG CACTTCTACG AGAAGCGCAT CACGGGCGAT
CTCATCGACC GCTATCAGTC GACCGACGTC ATCCGGCGGG TGTTCACCAG CAACCTGCCC
ACCATCCTGC TCGACGGCCT GGTCACCGTG ATCGCGCTCT CGGCCGTGTT CCTCATCTCG
CCCATCCTGG CCGCGATCGC GCTCGCCAGC TTCGCCGTGT ACCTGGGCAT GCGCATCTAC
TTCTACAGCT CGATGCGCAC GCTGACCGAG AAGGCCGTGC GGGCTCGCTC CGAGGAAAAC
GGCCACGTCA TCGACACGCT GCGCGGCATG CAGCCCATCA AGATCTTCGC CAAAGAGCTC
GAGCGGCTCA ACATCTGGGG CAACTTCTAC GCCCGTCTGA TCAACGCCGA AAAAGACGTC
GGGGTGCTGG CGGCCACGCA GTCGGGGTTC AAGCTGTTCA TCCTGGGCGT GGACACCGCC
CTGTGCGTGT ACTTCGGCGC CAACCTGGTG GCGCAGGGCG AGCTGTCGCT CGGCATCCTG
CTGGCGTTCT TCTTCTACAA GGCGCATTTC ACGCAAAAGT CGGTCAACTT CGCCGAGCGC
CTCATGGACC TGCGCCTCGT CGCCGTGCAC GTGGACCGAC TGTCCGATAT CGCGCTCAGC
GAACCCGAGC AACAGGTCCA GGACAAACAG CCGGTCACGC GCGAGGCGTT CGCCGACTTT
CGCGTGGCGT TTGCCAATGT CGGCTTTCGC TACGCGCCGC TCGAGCCCGA CGTCGTGCAG
GGCGCCTCAT GCGAGCTCCG GCGCGGCGAG TTCGTCGCGC TGGTCGGCCC ATCGGGCGGC
GGCAAGACCA CGCTCTTCAA GCTGCTGCTC GGCCTACTGC AGCCGAGCGA GGGCCATATC
GAGTTCAACG GCACACCGCT GAGCGAGCTC GACATCCGCC AATATCGCCG CCACTTCGGC
GTGGTCATGC AGGAAGATCT GCTGCTGACC GGCACGCTGC TCGACAACAT CGCCTTTTTC
GAGGCCAGCC CGGACGAGAA CAAGGCCCGT CGTTGCGCCG AGATCGCGCT CATCCTCGAC
GAGATCGAGG CCATGCCCAT GAAGCTCAAC ACGCGCATCG GCGACCTCGG CTCGGCGCTC
TCGGGCGGCC AGAAGCAGCG CATCCTGCTC GCGCGCGCCC TCTACGGCGA GCCCGAAGTG
ATGTTGCTCG ACGAGGGCAC CGCCAACCTC GATCAGGCCG TCGAGCGCCA GCTCCTCGAC
AACCTCACCG CGCTGGGCAT CACGTGCATA TCGATCGCCC ACCGACCGGA GACCATCTAT
CGAGCGACCA AGGTGTTGCG GCTGGAGAAT GGTACGCTCA CCGACGTCAC AGATGCCTAC
GCCGATGCGC AGACGCCACC ACAGAGAGAG GAACACGAGA TGAAGGTTCG CTACCTGGAG
CCGCGCCCCA AAAACCACTC CAGCAACGTG GCCCTGCTGA TGAAGCTCTG GGACACACCG
CTCACGGGCG AGCAGCAGGA GCGGCTCGCC CAGACCGCGC CCGTGAAGCA GCAGCGCTCG
GAGTTTGGCA ACCTCAACAA CGAGGGCACG CCCTACCCGT CCCAGAGCTG CCTGGTCGCT
CGCTTCCACC CCGATTTTGA GTCGGTGATC GAACCCGGGG TCAAGGAGCT GCTGGCGGTG
GTGGCCATCG ACCTCGATCT GGTGACGTAC ACGAGCTGTC AGGGGCACCG CTACGAGAAT
CCCGACACCC CGACCGACGA GCGCCACGTG GGTATCATCG CCCGCAGCGC CGAGGAGCAT
CAGCGCGTGC GTGGGCTGTT CGAGGACGTC GCCCGTGAGC TCAACCCGGG CTTGGCCGAC
AGCGCGGTCG AAATCGCCAT CATGGACCAT ACCGTACGCG ACGGCGACAC CATCTACCCG
GCGCTGGATC TCTACCTCAG CCAGCGCGAG GGCCACTCTC TCGAGTCCTA TTTCGCAGAA
CTCGACCAGG CGTCCGACAC GCTCATCACC GCGCTCCGCA GCAGGGCCGA GGCGTAG
 
Protein sequence
MSLFGKNKLP TLYQTESSEC ALACLAMVAG YHGLDISMLE LRERFPISMK GATLRDVVEV 
ANQIGFSSRP VRCEPAGLRR IALPALLHWD FEHFVVLERA DKRGYRIHDP AIGVVHLSEN
ELSDHFTGVA VILSPTDDFA GGELGEKLSL WQLLKRSRGM VPFVAQVLWL TAFLELFALL
GPLFLKEVID TGLAHRSFDL ITAIAVGIGA IGLFQGLLSF LRDYVILYFG TSFNQQMMNN
LFRHLLRLPM HFYEKRITGD LIDRYQSTDV IRRVFTSNLP TILLDGLVTV IALSAVFLIS
PILAAIALAS FAVYLGMRIY FYSSMRTLTE KAVRARSEEN GHVIDTLRGM QPIKIFAKEL
ERLNIWGNFY ARLINAEKDV GVLAATQSGF KLFILGVDTA LCVYFGANLV AQGELSLGIL
LAFFFYKAHF TQKSVNFAER LMDLRLVAVH VDRLSDIALS EPEQQVQDKQ PVTREAFADF
RVAFANVGFR YAPLEPDVVQ GASCELRRGE FVALVGPSGG GKTTLFKLLL GLLQPSEGHI
EFNGTPLSEL DIRQYRRHFG VVMQEDLLLT GTLLDNIAFF EASPDENKAR RCAEIALILD
EIEAMPMKLN TRIGDLGSAL SGGQKQRILL ARALYGEPEV MLLDEGTANL DQAVERQLLD
NLTALGITCI SIAHRPETIY RATKVLRLEN GTLTDVTDAY ADAQTPPQRE EHEMKVRYLE
PRPKNHSSNV ALLMKLWDTP LTGEQQERLA QTAPVKQQRS EFGNLNNEGT PYPSQSCLVA
RFHPDFESVI EPGVKELLAV VAIDLDLVTY TSCQGHRYEN PDTPTDERHV GIIARSAEEH
QRVRGLFEDV ARELNPGLAD SAVEIAIMDH TVRDGDTIYP ALDLYLSQRE GHSLESYFAE
LDQASDTLIT ALRSRAEA