Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3003 |
Symbol | |
ID | 8545391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4157656 |
End bp | 4160412 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 646387675 |
Product | ABC transporter related protein |
Protein accession | YP_003267403 |
Protein GI | 262196194 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.102792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.46573 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTAT TCGGCAAGAA CAAGCTGCCC ACGCTCTATC AGACCGAGAG CAGCGAGTGT GCGCTCGCCT GCCTGGCGAT GGTCGCCGGC TATCACGGCC TCGACATCTC CATGCTCGAG CTGCGCGAGC GCTTTCCCAT CTCGATGAAG GGCGCCACCC TGCGCGACGT GGTCGAAGTC GCCAACCAGA TCGGCTTCTC GTCGCGACCG GTGCGCTGTG AACCGGCCGG TCTGCGGCGC ATCGCGCTAC CGGCCCTGCT GCATTGGGAC TTCGAGCACT TCGTGGTGCT CGAGCGCGCC GACAAGCGCG GCTATCGCAT CCACGACCCC GCCATCGGTG TCGTCCATCT ATCCGAAAAC GAGCTATCCG ATCATTTCAC CGGCGTCGCC GTCATCCTGT CACCGACCGA CGACTTTGCC GGCGGCGAAC TCGGCGAGAA ACTATCGCTG TGGCAACTGC TCAAACGCTC GCGCGGGATG GTGCCGTTTG TGGCCCAGGT CCTGTGGCTC ACGGCGTTTC TCGAGCTCTT CGCGCTGCTC GGGCCGCTGT TCCTCAAAGA GGTCATCGAC ACCGGCCTGG CCCATCGCAG CTTTGACCTG ATCACGGCCA TCGCCGTCGG CATCGGCGCC ATCGGCCTGT TCCAGGGGCT GCTGTCGTTC TTGCGCGACT ACGTCATTCT GTATTTCGGC ACGTCCTTCA ACCAGCAGAT GATGAATAAC CTATTCCGTC ATCTGCTGCG ATTGCCCATG CACTTCTACG AGAAGCGCAT CACGGGCGAT CTCATCGACC GCTATCAGTC GACCGACGTC ATCCGGCGGG TGTTCACCAG CAACCTGCCC ACCATCCTGC TCGACGGCCT GGTCACCGTG ATCGCGCTCT CGGCCGTGTT CCTCATCTCG CCCATCCTGG CCGCGATCGC GCTCGCCAGC TTCGCCGTGT ACCTGGGCAT GCGCATCTAC TTCTACAGCT CGATGCGCAC GCTGACCGAG AAGGCCGTGC GGGCTCGCTC CGAGGAAAAC GGCCACGTCA TCGACACGCT GCGCGGCATG CAGCCCATCA AGATCTTCGC CAAAGAGCTC GAGCGGCTCA ACATCTGGGG CAACTTCTAC GCCCGTCTGA TCAACGCCGA AAAAGACGTC GGGGTGCTGG CGGCCACGCA GTCGGGGTTC AAGCTGTTCA TCCTGGGCGT GGACACCGCC CTGTGCGTGT ACTTCGGCGC CAACCTGGTG GCGCAGGGCG AGCTGTCGCT CGGCATCCTG CTGGCGTTCT TCTTCTACAA GGCGCATTTC ACGCAAAAGT CGGTCAACTT CGCCGAGCGC CTCATGGACC TGCGCCTCGT CGCCGTGCAC GTGGACCGAC TGTCCGATAT CGCGCTCAGC GAACCCGAGC AACAGGTCCA GGACAAACAG CCGGTCACGC GCGAGGCGTT CGCCGACTTT CGCGTGGCGT TTGCCAATGT CGGCTTTCGC TACGCGCCGC TCGAGCCCGA CGTCGTGCAG GGCGCCTCAT GCGAGCTCCG GCGCGGCGAG TTCGTCGCGC TGGTCGGCCC ATCGGGCGGC GGCAAGACCA CGCTCTTCAA GCTGCTGCTC GGCCTACTGC AGCCGAGCGA GGGCCATATC GAGTTCAACG GCACACCGCT GAGCGAGCTC GACATCCGCC AATATCGCCG CCACTTCGGC GTGGTCATGC AGGAAGATCT GCTGCTGACC GGCACGCTGC TCGACAACAT CGCCTTTTTC GAGGCCAGCC CGGACGAGAA CAAGGCCCGT CGTTGCGCCG AGATCGCGCT CATCCTCGAC GAGATCGAGG CCATGCCCAT GAAGCTCAAC ACGCGCATCG GCGACCTCGG CTCGGCGCTC TCGGGCGGCC AGAAGCAGCG CATCCTGCTC GCGCGCGCCC TCTACGGCGA GCCCGAAGTG ATGTTGCTCG ACGAGGGCAC CGCCAACCTC GATCAGGCCG TCGAGCGCCA GCTCCTCGAC AACCTCACCG CGCTGGGCAT CACGTGCATA TCGATCGCCC ACCGACCGGA GACCATCTAT CGAGCGACCA AGGTGTTGCG GCTGGAGAAT GGTACGCTCA CCGACGTCAC AGATGCCTAC GCCGATGCGC AGACGCCACC ACAGAGAGAG GAACACGAGA TGAAGGTTCG CTACCTGGAG CCGCGCCCCA AAAACCACTC CAGCAACGTG GCCCTGCTGA TGAAGCTCTG GGACACACCG CTCACGGGCG AGCAGCAGGA GCGGCTCGCC CAGACCGCGC CCGTGAAGCA GCAGCGCTCG GAGTTTGGCA ACCTCAACAA CGAGGGCACG CCCTACCCGT CCCAGAGCTG CCTGGTCGCT CGCTTCCACC CCGATTTTGA GTCGGTGATC GAACCCGGGG TCAAGGAGCT GCTGGCGGTG GTGGCCATCG ACCTCGATCT GGTGACGTAC ACGAGCTGTC AGGGGCACCG CTACGAGAAT CCCGACACCC CGACCGACGA GCGCCACGTG GGTATCATCG CCCGCAGCGC CGAGGAGCAT CAGCGCGTGC GTGGGCTGTT CGAGGACGTC GCCCGTGAGC TCAACCCGGG CTTGGCCGAC AGCGCGGTCG AAATCGCCAT CATGGACCAT ACCGTACGCG ACGGCGACAC CATCTACCCG GCGCTGGATC TCTACCTCAG CCAGCGCGAG GGCCACTCTC TCGAGTCCTA TTTCGCAGAA CTCGACCAGG CGTCCGACAC GCTCATCACC GCGCTCCGCA GCAGGGCCGA GGCGTAG
|
Protein sequence | MSLFGKNKLP TLYQTESSEC ALACLAMVAG YHGLDISMLE LRERFPISMK GATLRDVVEV ANQIGFSSRP VRCEPAGLRR IALPALLHWD FEHFVVLERA DKRGYRIHDP AIGVVHLSEN ELSDHFTGVA VILSPTDDFA GGELGEKLSL WQLLKRSRGM VPFVAQVLWL TAFLELFALL GPLFLKEVID TGLAHRSFDL ITAIAVGIGA IGLFQGLLSF LRDYVILYFG TSFNQQMMNN LFRHLLRLPM HFYEKRITGD LIDRYQSTDV IRRVFTSNLP TILLDGLVTV IALSAVFLIS PILAAIALAS FAVYLGMRIY FYSSMRTLTE KAVRARSEEN GHVIDTLRGM QPIKIFAKEL ERLNIWGNFY ARLINAEKDV GVLAATQSGF KLFILGVDTA LCVYFGANLV AQGELSLGIL LAFFFYKAHF TQKSVNFAER LMDLRLVAVH VDRLSDIALS EPEQQVQDKQ PVTREAFADF RVAFANVGFR YAPLEPDVVQ GASCELRRGE FVALVGPSGG GKTTLFKLLL GLLQPSEGHI EFNGTPLSEL DIRQYRRHFG VVMQEDLLLT GTLLDNIAFF EASPDENKAR RCAEIALILD EIEAMPMKLN TRIGDLGSAL SGGQKQRILL ARALYGEPEV MLLDEGTANL DQAVERQLLD NLTALGITCI SIAHRPETIY RATKVLRLEN GTLTDVTDAY ADAQTPPQRE EHEMKVRYLE PRPKNHSSNV ALLMKLWDTP LTGEQQERLA QTAPVKQQRS EFGNLNNEGT PYPSQSCLVA RFHPDFESVI EPGVKELLAV VAIDLDLVTY TSCQGHRYEN PDTPTDERHV GIIARSAEEH QRVRGLFEDV ARELNPGLAD SAVEIAIMDH TVRDGDTIYP ALDLYLSQRE GHSLESYFAE LDQASDTLIT ALRSRAEA
|
| |