Gene Hoch_5429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5429 
Symbol 
ID8547841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7453095 
End bp7456361 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content69% 
IMG OID646390101 
ProductTonB-dependent receptor 
Protein accessionYP_003269805 
Protein GI262198596 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0208974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA TGCTCACAAG GTATGGCGTG CCCAGCGCTG GCCTCACGAT GTTGCTGATG 
GCCAGCACGC CGGCCCGGGC TCAGGACGCG CAAGACGCGC AAGCCGACCC GGCGCAGCCC
GCGACTCCGG CGCAGCCCGC GGCTCCGGCG CAGCCCGCGG CTCCGGCGCA GCCCGCGGCT
CCGGCGCAGC CCGCGGCTCC GGCGGCTCCC GCGCAGCCCG CGGTCCCGGC GGCCCCGGCG
CCCGCGCCGG CGCCGGCTCC CGCGCCAGCC CCCGCACCGG CTCCGGCCCC CGCGCCGGCC
CCGGCCCCGG CCGCGCAGGC GGACGAGGGC GCCGACCAGG ACTTCGATCC CACGGTCGCG
CTCATGGGCA TCGGCGAGGA AGAGGGCGTG GAGGAGGCCG AGGGCGAGGT CATCGTGGTC
ACGGGTTCTC GTATCCGCAC CGACCCTCTC GACAAGCAGG CGCCGGTCCT GCAGCTCACC
CGCGAGGAGC TCGAGCGCAC CGGTCTCACC TCGGTGGGCG ACATCCTGCA GCGGCTGCCG
GTCTCGGGCG GCGCGCTCAA CACCAAGTTC AACTCCAGCG GTAACTTCGG CTTCCCGCCC
GATGGCGGCG GTATCGGCGC CGGCGCGGCC GAGGCCGATC TGCGCGGTCT CGGCTCCAAG
CGCGTGCTCG TGCTGGTCGA CGGCGTCCGC TGGGTCAACG GCTCTTCGGC CTCGGGTGTG
GCGGCTTCGA CCGACCTCAA CACCATCCCG CTCGGCATCA TCGAGCGCAT CGAGGTCCTC
GAGGACGGCG CCTCGCCGAT CTACGGCTCG GACGCGATCT CGGGCGTCAT CAACATCATC
ACCCGCAAGG ACCTCGACGG CGCCATCGCC AACGCGTACC TGGGTGGCTA CAACCAGGGC
GACGGCTTCA CCCAGAAGTA CGACGTCTCC TGGGGCAAGT CGGACGAGAA GATGTCCATC
GTGGTCAGCG CCTCGTTCGT CGACCAGGGC CTGGTCCGCG CCGAGGATCG CGAGCTGTCC
AAGGACCCGG TGCCCAACGT GCCCAACTGC GGCGCCGGCT GCTCCTCGGG TACGCCCCAG
GGCCGCTTCT TCTTCACCGA TCCCAACACC GGTGAGGCGC GCGATCTGAC CATCAACAAC
GGCGTCGGCG GCATCCCGGT CTACGATCCC ACGGATCCTG ACGGCGGCGC TGGCTTCCAC
GCCTTCGAGA CCTCCGACCG CTTCAACTTC GCGCCCTACA ACTTGATGCT CACGCCCTCG
CAGCGTACGG GCGCGTTCAG CGCGGTGCGC TACCGCCTGG CCGAGCGCGT GAACTTCAGC
GGCAAGGTGT CGTTCACCAA CCGCAAGTCG GTCAACCAGG CCGCGCCCGA GCCGCTGTTC
ATCGGCCCGG AGGCCGGTAA CGGCAACCGC CTCGACCGCA TCTCGATCCA CCAGAGCAAC
CCCTACAACC CCTTCGGGTT CACGCTCGAC GCCGCCACCA ACCCCTACTT CATCGGTCGC
CGTCCGCTCG AGGCCGGCCC GCGCCGCTTC GAGCAGTCGG TCAACACCTG GTACATGTCC
GGTGGCCTCA ACGGCGACTT CGACATCGGC GGTCAGCGCT TCTACTGGGA CGCCAACGTG
GCCTACGGCG CCAACCGCGC CGACCAGCTC AAGACCGGCG CCTTCAACTC GGCCAAGCTC
GAGGACGCGC TGGGTCCGGC ATTTCAGGAC GGCGACGGCG TGTTCCGCTG CGGCACCGCC
GAGAACCCGG GCAACGCCAA GTGTGTGCCC TTCAACATCT TCGGCGGCCA GGGCATGAGC
GGCGACGGCA CGATCACCCA GGAGATGCTC GACTACGTCA CCTTCGTGCA GCACGACATC
TCCGAGCAGA CCCTGTTCGA CGCCACCGCC AACGTCTCCG GTACCCTGGT CGAGCTGCCC
ACGGGCGCGC TGGCCATGGC CGCCGGTGTC GAGCATCGCC GCCTGGCCGG CTTCTTCGAG
CCCGACCCCG TGGTGGTCGC CGGCGACAGC GCGGGCGTGC CCTCGCAGCC GACCTCGGGC
GACTACTGGG TCAACGAGGC CTACGCCGAG CTGCGCGCGC CGCTGGTCAC CGACATGCCG
GGCGCCGAGC TGATCGATAT CAACGGCGCC GTCCGCGTGT CCGATTACTC GTTCCTGTCG
CCGCAATTCA CCGGCAAGCT GGGCGCGCGT TGGAAGCCGA GCGATGACTT CATCCTCCGC
GGCAGCTACG GTCAGGGCTT CCGGGCCCCG AGCATCGGCG AGATCTACGG CAGCGAGGCG
CGCTTCGACG CCACCCTCAC CGACCCGTGC TCGAACCTCA ACCAGTACGC GGAGAACAGC
CCCATCCGCC AGCGCTGCAT CGACCTGGGC GTGCCCGCCG ATGGCAGCTA CGAGCAGTTC
AACCCGCAGA TCTCGGTGAC CACCGGTGGC AACCTCGAGC TCGAGCCCGA GACCTCGGAC
AGCGTGGTGG TCAGCATGGC CTACAGCCCC TCGTGGCTGG AGGAGAACCT GTGGGTCGAC
GCCTTCGACG TCGAGCTGGC CTACTACGAC GTGCGTCTCG ACGGTGCCAT CGCCGCCATC
GACGCCGACG TCCAGCTTCA GGGCTGCGTC GTCGGCCAGG ACGACACGCT GTGCGACGGC
ATCACGCGTA CCCCGGGCGG CACCATCAAC GGCTTCAGCA ACCGGCTGCA GAACATCGGC
GGCATCGAGA CCCGCGGTCT CGACCTCACG CTCACCTACC TGATGCCCGA GACCGGCGCC
GGTCGCTTCC GCTTCACCTC GCTGACCAAC TACCTCATCG ACTTCCACGA GCGCATCCCG
TCGGCCTCCG GCTACAACGT GATCCGCCGT GAGGGCACCG AGATCGGCGA CCCCGAGCGC
GCCTTCCCGC TGTTCAAGTC GTCGTTCATC ATCGACTGGT TCTCGGGTGA CTGGTACGCC
TCGCTCACCA CCCGCTACAT CCACAAAGTG CGCGAGTCGT GCGACGCGGT CGACGGCGTG
CCCAACGCCG ACGAGCTGTG CTCGGACCCC GACACCAGCG ACGCCGCCTT CGAGAACATC
ATGTCGCCCA CCGTCTACAA CGACGTGCAG GTGACCTGGA CGCCCACGGA GATGCAGAAG
GCGTTCACCA TGACCCTGGG TATCAACAAC CTGTTCAACG TCGATCCGCC GGCCTGCTAT
AGCTGCGCGC TCAACGGCTT CGACGCCACG GTCTACGAGG TCCCCGGTAT CTTCGGGTAT
CTCTCCGCCA GCTACCGCAT GTACTAA
 
Protein sequence
MKRMLTRYGV PSAGLTMLLM ASTPARAQDA QDAQADPAQP ATPAQPAAPA QPAAPAQPAA 
PAQPAAPAAP AQPAVPAAPA PAPAPAPAPA PAPAPAPAPA PAPAAQADEG ADQDFDPTVA
LMGIGEEEGV EEAEGEVIVV TGSRIRTDPL DKQAPVLQLT REELERTGLT SVGDILQRLP
VSGGALNTKF NSSGNFGFPP DGGGIGAGAA EADLRGLGSK RVLVLVDGVR WVNGSSASGV
AASTDLNTIP LGIIERIEVL EDGASPIYGS DAISGVINII TRKDLDGAIA NAYLGGYNQG
DGFTQKYDVS WGKSDEKMSI VVSASFVDQG LVRAEDRELS KDPVPNVPNC GAGCSSGTPQ
GRFFFTDPNT GEARDLTINN GVGGIPVYDP TDPDGGAGFH AFETSDRFNF APYNLMLTPS
QRTGAFSAVR YRLAERVNFS GKVSFTNRKS VNQAAPEPLF IGPEAGNGNR LDRISIHQSN
PYNPFGFTLD AATNPYFIGR RPLEAGPRRF EQSVNTWYMS GGLNGDFDIG GQRFYWDANV
AYGANRADQL KTGAFNSAKL EDALGPAFQD GDGVFRCGTA ENPGNAKCVP FNIFGGQGMS
GDGTITQEML DYVTFVQHDI SEQTLFDATA NVSGTLVELP TGALAMAAGV EHRRLAGFFE
PDPVVVAGDS AGVPSQPTSG DYWVNEAYAE LRAPLVTDMP GAELIDINGA VRVSDYSFLS
PQFTGKLGAR WKPSDDFILR GSYGQGFRAP SIGEIYGSEA RFDATLTDPC SNLNQYAENS
PIRQRCIDLG VPADGSYEQF NPQISVTTGG NLELEPETSD SVVVSMAYSP SWLEENLWVD
AFDVELAYYD VRLDGAIAAI DADVQLQGCV VGQDDTLCDG ITRTPGGTIN GFSNRLQNIG
GIETRGLDLT LTYLMPETGA GRFRFTSLTN YLIDFHERIP SASGYNVIRR EGTEIGDPER
AFPLFKSSFI IDWFSGDWYA SLTTRYIHKV RESCDAVDGV PNADELCSDP DTSDAAFENI
MSPTVYNDVQ VTWTPTEMQK AFTMTLGINN LFNVDPPACY SCALNGFDAT VYEVPGIFGY
LSASYRMY