Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5429 |
Symbol | |
ID | 8547841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 7453095 |
End bp | 7456361 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646390101 |
Product | TonB-dependent receptor |
Protein accession | YP_003269805 |
Protein GI | 262198596 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0208974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAA TGCTCACAAG GTATGGCGTG CCCAGCGCTG GCCTCACGAT GTTGCTGATG GCCAGCACGC CGGCCCGGGC TCAGGACGCG CAAGACGCGC AAGCCGACCC GGCGCAGCCC GCGACTCCGG CGCAGCCCGC GGCTCCGGCG CAGCCCGCGG CTCCGGCGCA GCCCGCGGCT CCGGCGCAGC CCGCGGCTCC GGCGGCTCCC GCGCAGCCCG CGGTCCCGGC GGCCCCGGCG CCCGCGCCGG CGCCGGCTCC CGCGCCAGCC CCCGCACCGG CTCCGGCCCC CGCGCCGGCC CCGGCCCCGG CCGCGCAGGC GGACGAGGGC GCCGACCAGG ACTTCGATCC CACGGTCGCG CTCATGGGCA TCGGCGAGGA AGAGGGCGTG GAGGAGGCCG AGGGCGAGGT CATCGTGGTC ACGGGTTCTC GTATCCGCAC CGACCCTCTC GACAAGCAGG CGCCGGTCCT GCAGCTCACC CGCGAGGAGC TCGAGCGCAC CGGTCTCACC TCGGTGGGCG ACATCCTGCA GCGGCTGCCG GTCTCGGGCG GCGCGCTCAA CACCAAGTTC AACTCCAGCG GTAACTTCGG CTTCCCGCCC GATGGCGGCG GTATCGGCGC CGGCGCGGCC GAGGCCGATC TGCGCGGTCT CGGCTCCAAG CGCGTGCTCG TGCTGGTCGA CGGCGTCCGC TGGGTCAACG GCTCTTCGGC CTCGGGTGTG GCGGCTTCGA CCGACCTCAA CACCATCCCG CTCGGCATCA TCGAGCGCAT CGAGGTCCTC GAGGACGGCG CCTCGCCGAT CTACGGCTCG GACGCGATCT CGGGCGTCAT CAACATCATC ACCCGCAAGG ACCTCGACGG CGCCATCGCC AACGCGTACC TGGGTGGCTA CAACCAGGGC GACGGCTTCA CCCAGAAGTA CGACGTCTCC TGGGGCAAGT CGGACGAGAA GATGTCCATC GTGGTCAGCG CCTCGTTCGT CGACCAGGGC CTGGTCCGCG CCGAGGATCG CGAGCTGTCC AAGGACCCGG TGCCCAACGT GCCCAACTGC GGCGCCGGCT GCTCCTCGGG TACGCCCCAG GGCCGCTTCT TCTTCACCGA TCCCAACACC GGTGAGGCGC GCGATCTGAC CATCAACAAC GGCGTCGGCG GCATCCCGGT CTACGATCCC ACGGATCCTG ACGGCGGCGC TGGCTTCCAC GCCTTCGAGA CCTCCGACCG CTTCAACTTC GCGCCCTACA ACTTGATGCT CACGCCCTCG CAGCGTACGG GCGCGTTCAG CGCGGTGCGC TACCGCCTGG CCGAGCGCGT GAACTTCAGC GGCAAGGTGT CGTTCACCAA CCGCAAGTCG GTCAACCAGG CCGCGCCCGA GCCGCTGTTC ATCGGCCCGG AGGCCGGTAA CGGCAACCGC CTCGACCGCA TCTCGATCCA CCAGAGCAAC CCCTACAACC CCTTCGGGTT CACGCTCGAC GCCGCCACCA ACCCCTACTT CATCGGTCGC CGTCCGCTCG AGGCCGGCCC GCGCCGCTTC GAGCAGTCGG TCAACACCTG GTACATGTCC GGTGGCCTCA ACGGCGACTT CGACATCGGC GGTCAGCGCT TCTACTGGGA CGCCAACGTG GCCTACGGCG CCAACCGCGC CGACCAGCTC AAGACCGGCG CCTTCAACTC GGCCAAGCTC GAGGACGCGC TGGGTCCGGC ATTTCAGGAC GGCGACGGCG TGTTCCGCTG CGGCACCGCC GAGAACCCGG GCAACGCCAA GTGTGTGCCC TTCAACATCT TCGGCGGCCA GGGCATGAGC GGCGACGGCA CGATCACCCA GGAGATGCTC GACTACGTCA CCTTCGTGCA GCACGACATC TCCGAGCAGA CCCTGTTCGA CGCCACCGCC AACGTCTCCG GTACCCTGGT CGAGCTGCCC ACGGGCGCGC TGGCCATGGC CGCCGGTGTC GAGCATCGCC GCCTGGCCGG CTTCTTCGAG CCCGACCCCG TGGTGGTCGC CGGCGACAGC GCGGGCGTGC CCTCGCAGCC GACCTCGGGC GACTACTGGG TCAACGAGGC CTACGCCGAG CTGCGCGCGC CGCTGGTCAC CGACATGCCG GGCGCCGAGC TGATCGATAT CAACGGCGCC GTCCGCGTGT CCGATTACTC GTTCCTGTCG CCGCAATTCA CCGGCAAGCT GGGCGCGCGT TGGAAGCCGA GCGATGACTT CATCCTCCGC GGCAGCTACG GTCAGGGCTT CCGGGCCCCG AGCATCGGCG AGATCTACGG CAGCGAGGCG CGCTTCGACG CCACCCTCAC CGACCCGTGC TCGAACCTCA ACCAGTACGC GGAGAACAGC CCCATCCGCC AGCGCTGCAT CGACCTGGGC GTGCCCGCCG ATGGCAGCTA CGAGCAGTTC AACCCGCAGA TCTCGGTGAC CACCGGTGGC AACCTCGAGC TCGAGCCCGA GACCTCGGAC AGCGTGGTGG TCAGCATGGC CTACAGCCCC TCGTGGCTGG AGGAGAACCT GTGGGTCGAC GCCTTCGACG TCGAGCTGGC CTACTACGAC GTGCGTCTCG ACGGTGCCAT CGCCGCCATC GACGCCGACG TCCAGCTTCA GGGCTGCGTC GTCGGCCAGG ACGACACGCT GTGCGACGGC ATCACGCGTA CCCCGGGCGG CACCATCAAC GGCTTCAGCA ACCGGCTGCA GAACATCGGC GGCATCGAGA CCCGCGGTCT CGACCTCACG CTCACCTACC TGATGCCCGA GACCGGCGCC GGTCGCTTCC GCTTCACCTC GCTGACCAAC TACCTCATCG ACTTCCACGA GCGCATCCCG TCGGCCTCCG GCTACAACGT GATCCGCCGT GAGGGCACCG AGATCGGCGA CCCCGAGCGC GCCTTCCCGC TGTTCAAGTC GTCGTTCATC ATCGACTGGT TCTCGGGTGA CTGGTACGCC TCGCTCACCA CCCGCTACAT CCACAAAGTG CGCGAGTCGT GCGACGCGGT CGACGGCGTG CCCAACGCCG ACGAGCTGTG CTCGGACCCC GACACCAGCG ACGCCGCCTT CGAGAACATC ATGTCGCCCA CCGTCTACAA CGACGTGCAG GTGACCTGGA CGCCCACGGA GATGCAGAAG GCGTTCACCA TGACCCTGGG TATCAACAAC CTGTTCAACG TCGATCCGCC GGCCTGCTAT AGCTGCGCGC TCAACGGCTT CGACGCCACG GTCTACGAGG TCCCCGGTAT CTTCGGGTAT CTCTCCGCCA GCTACCGCAT GTACTAA
|
Protein sequence | MKRMLTRYGV PSAGLTMLLM ASTPARAQDA QDAQADPAQP ATPAQPAAPA QPAAPAQPAA PAQPAAPAAP AQPAVPAAPA PAPAPAPAPA PAPAPAPAPA PAPAAQADEG ADQDFDPTVA LMGIGEEEGV EEAEGEVIVV TGSRIRTDPL DKQAPVLQLT REELERTGLT SVGDILQRLP VSGGALNTKF NSSGNFGFPP DGGGIGAGAA EADLRGLGSK RVLVLVDGVR WVNGSSASGV AASTDLNTIP LGIIERIEVL EDGASPIYGS DAISGVINII TRKDLDGAIA NAYLGGYNQG DGFTQKYDVS WGKSDEKMSI VVSASFVDQG LVRAEDRELS KDPVPNVPNC GAGCSSGTPQ GRFFFTDPNT GEARDLTINN GVGGIPVYDP TDPDGGAGFH AFETSDRFNF APYNLMLTPS QRTGAFSAVR YRLAERVNFS GKVSFTNRKS VNQAAPEPLF IGPEAGNGNR LDRISIHQSN PYNPFGFTLD AATNPYFIGR RPLEAGPRRF EQSVNTWYMS GGLNGDFDIG GQRFYWDANV AYGANRADQL KTGAFNSAKL EDALGPAFQD GDGVFRCGTA ENPGNAKCVP FNIFGGQGMS GDGTITQEML DYVTFVQHDI SEQTLFDATA NVSGTLVELP TGALAMAAGV EHRRLAGFFE PDPVVVAGDS AGVPSQPTSG DYWVNEAYAE LRAPLVTDMP GAELIDINGA VRVSDYSFLS PQFTGKLGAR WKPSDDFILR GSYGQGFRAP SIGEIYGSEA RFDATLTDPC SNLNQYAENS PIRQRCIDLG VPADGSYEQF NPQISVTTGG NLELEPETSD SVVVSMAYSP SWLEENLWVD AFDVELAYYD VRLDGAIAAI DADVQLQGCV VGQDDTLCDG ITRTPGGTIN GFSNRLQNIG GIETRGLDLT LTYLMPETGA GRFRFTSLTN YLIDFHERIP SASGYNVIRR EGTEIGDPER AFPLFKSSFI IDWFSGDWYA SLTTRYIHKV RESCDAVDGV PNADELCSDP DTSDAAFENI MSPTVYNDVQ VTWTPTEMQK AFTMTLGINN LFNVDPPACY SCALNGFDAT VYEVPGIFGY LSASYRMY
|
| |