Gene Hoch_5509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5509 
Symbol 
ID8547922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7553321 
End bp7556485 
Gene Length3165 bp 
Protein Length1054 aa 
Translation table11 
GC content70% 
IMG OID646390182 
ProductTonB-dependent receptor plug 
Protein accessionYP_003269885 
Protein GI262198676 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.180232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACCT GTTCCCGGGT GATCATTCAT CCGGCGCGTT CTCGGCGCGC ACTCCACCTC 
TTTGCGGCCG CGCTGACGGC TGCGTGTTTG CTCCCGACCA CCGGCGTGAC CCCGGTCTAT
GCTCAGGTCG CGACCACGGG CAGCTTGCGC GGCGTCATCA GCGACCGGGC CAGCGGCGCG
CCCGTGGCCA ACGCCGTGGT CACGGTGCGC GGCCCCGCGC TCATCGGTTC GGCAGACGCC
ACCACGGATG TCGACGGAGC GTACGAAATC AGCAATCTGC CGCCCGGCCT GTACACCCTG
AGCGTGTACG CGGGCCAGGC GCAGGTGTCG CGCTCGGGCG TGCTCATCCA GCTCGGCAAG
CAGGCGCGAA TCGACGTCCC CGTGAGCGAG CAGAGCATGG CCGGCGAGGT GGTCGAGATC
GAGGGTCGGG TGCCCATCAT CGACCAGGGC TCGACCAAGG CCGGCGTCAC CATTACCAAC
GACTACATGC GCAACGTGCC CACGGGGCGC AACTTCGACG AGGTCCTGGA AGTGGCCCCC
GGCGCCCAGG TCGACCAGTT CGGCACCGGC TTCGGCGGCT CGAGCTCGCC CGAGAACAGC
TACGTCATCG AGGGCCTCAA CGTCACCGGC GTCGGCTTCG GCCAGGCGCG GCTGACCCTG
CCGCACGAGT TCCTCGAGGA GATCGAGGTC GTGAGCGGCG CCTACGGCAC GGAATTCGGA
CGCTCGACCG GCGGCGTCGC CAACGTGCTC ACGCGCGCGG GCGGCGAAGA GCTCCGCGGC
AGCGTGTTCG GCTACGTGAG CCCGGGCGCG CTCTCGGGCA CGCCGCGCAC CGTGGTCGAC
GAGAGCAACG CCCTCACCTA TCGCCGCGAC CTGGTCGCCG ACACCGACTT CGGCGTCGAG
GTCGGCGGTC CGCTGATCCC GGACACGCTG TGGTTCCACG CCGGCTTCAG CCCGTCGATC
CGCCTGCGCA CCGCGGACAC GCTGATCGAG GGCCAGGCCG AGGACACCGA GGAGATCAAC
AGCCAGACCT ATTACTTCAC CGGCAAGCTG TCGTACGCGG CCTCGCCCGA TCACGGCGGC
TTCGTGTCGG TGTTTGGCAA TCCCAGCACA GACGAGCGCC TGTTCGACGA GTTCGCGGTC
GGCAACCCCT ACACGCTCAA CTTCGAGGAC GACAACGGCG TGGTCGCCGG CGTGAGCCGC
TGGACCTCGC GCTTCCCCGC GATCGACGGC GAGCTGACCG CGACCCTGGG CGTGTACCGC
GGCCACGACG TCGAGGGCGC CAAGTACGGC AGCGCGCTCG ATCGCCAGGC CGTGCGCTTT
TTGGCGCCGC GCCCGCTGGC CGACTTCGCC GACTTCCAGG ACGTGCCCGA GCGCTGCGGC
GCCGACGGCT GCCAAGTGTA CAACTACCAG ATCGGCGGCC TCGACGTGTT CACCCAGGCC
AAGACCCTGC GCCTCGGCGG CGGTCTGGTG TATCGTCAGA TCTTCGACGC CCTGGGCCGC
CACCGGGTCA AGGTCGGCGC CGACGTCGAG GTCAACCGCT ACGACTCGCG CGCCGGCTTC
AGCGGCGGCA CGCGCTGGTG GCTCGCGCCC GGCGGGTTCA TCGGGCTGCC CGTGGAGAAT
GCCGCGCTGG GCTGGCGCTT CCTGCGCCTC GACGAGAACG GCGACCAGCC CTGCGGCCTC
GACCTCGACC TCGACGGCGC GCCCGACTCG CGCTGCAGCT ATCAGACCGA CGGCTTCGAC
GCCGACGCCC AGACCCTCAA CAGCGCGCTG TTTTTGCAGG ACTCGTGGAC CGTGCTGCCC
AACCTCACCA TCGAGGCCGG CGTGCGCTAC GAGCGCCAAG ACGTGGGCGC GGCCGACCAG
GTCGTGGGCC AGATCAACCC GGTGAGTGGC GACCCCATCG AGCAGAACGC GTTCGCGCTC
AACAACCTGG CTCCGCGCGT CGGCGCCATC TACGACTGGA CCAACGAGGG CCGTTCGCGG
GTTTTCGGAC ACTGGGGCCG CTACTACGAG TCGGTGCCCA TGGACCTCAA CGCCCGCGCG
TTGTCCGGCG AAGTGGTCGA GATCAGCTTC TACGACGCCG CCGCGTGTAG CGACCCCTTC
CGCCCCAAGA CCTACGACTG CGGCGCCGAG GGTCTGCTCG GCAACCAGCA GCTCGGCAGC
GGCGGCAAGC TGATCGCGCC CGGCCTGCAG GGCCAGTACA TGGACGAGAT CGTCCTGGGC
GTCGAATACG AGCCGCTGGC CAACCTCAAG GTCGGCGCCG TGTACACGCA CCGCGACCTC
GGCCGGGTCA TCGAGGACGT CTCGCCCGAC GGCGGCGTCT CGTTGGTGCT GGCCAACCCC
GGCGAGTACG ACGCCGGCGC GGTCGCCGAT CTGCGCTCGC AGGCCGACGC CGCCCGCGCT
GCCGGCAACA TGGACGAGGC CGCGCGCCTC GATCGCACCG CCACCCTGGC CGAGGGCGTG
GGCGACTTCG ACCCGCCCAA GCGCAGCTAC GACGCGATCG AGCTGAGCGC GGTCAAGCGC
TTCTCCGACG ACCTGATGGG CCGCTTCTCG TACGTGTACT CGCGGCTCGA GGGCAACTTC
GCCGGTCTGT TCTCGCCCGA CACCGGCCAG CTCGACCCGA ACTTCACCTC GCGCTACGAC
GTGCCCGAGC TGATGACCAA CCGCTACGGC CGCCTGGCCT CGGACGCGCC GCACCAGATC
AAGCTCGACG CCTTCTATCA CCTGCCGATT CCGGCCGAGC TCGGCGCGCT GATCGTCGGC
GGACGCTTCC GCGGCCGCTC GGGACGCCCG CACAACTACC TGGCCGATCA TCCCATCTAC
GGCGCCAAGG AGGTCGTGCT GTTGCCGCGC GGCAGCGGCG AGCGCAACGA CTTCGTGACC
TCGGTCGACC TGCAGCTCAC CTACGGCCGG GCGCTGAGCG AGGACATGGG CCTCGAGGTG
TTCGTGTCGG TGTTCAACCT GCTGAACCAG AAGACCGCGC TGCGCCGCGA CGAGCAGTAC
ACCTTCGACT ATCCCAACCC GGTCGGCGGC GGCAGCGTCG AGGACCTCGA GCACGTCAAG
TCGCCCGGCA CCAACCAGGC GCTGTCGACC AACGCGAACT TCGGCAACCC GATCGAGTTC
CAGGACCCGA TCGGCGTGCG CCTGGGCGCT CGCTTCACCT TCTGA
 
Protein sequence
MRTCSRVIIH PARSRRALHL FAAALTAACL LPTTGVTPVY AQVATTGSLR GVISDRASGA 
PVANAVVTVR GPALIGSADA TTDVDGAYEI SNLPPGLYTL SVYAGQAQVS RSGVLIQLGK
QARIDVPVSE QSMAGEVVEI EGRVPIIDQG STKAGVTITN DYMRNVPTGR NFDEVLEVAP
GAQVDQFGTG FGGSSSPENS YVIEGLNVTG VGFGQARLTL PHEFLEEIEV VSGAYGTEFG
RSTGGVANVL TRAGGEELRG SVFGYVSPGA LSGTPRTVVD ESNALTYRRD LVADTDFGVE
VGGPLIPDTL WFHAGFSPSI RLRTADTLIE GQAEDTEEIN SQTYYFTGKL SYAASPDHGG
FVSVFGNPST DERLFDEFAV GNPYTLNFED DNGVVAGVSR WTSRFPAIDG ELTATLGVYR
GHDVEGAKYG SALDRQAVRF LAPRPLADFA DFQDVPERCG ADGCQVYNYQ IGGLDVFTQA
KTLRLGGGLV YRQIFDALGR HRVKVGADVE VNRYDSRAGF SGGTRWWLAP GGFIGLPVEN
AALGWRFLRL DENGDQPCGL DLDLDGAPDS RCSYQTDGFD ADAQTLNSAL FLQDSWTVLP
NLTIEAGVRY ERQDVGAADQ VVGQINPVSG DPIEQNAFAL NNLAPRVGAI YDWTNEGRSR
VFGHWGRYYE SVPMDLNARA LSGEVVEISF YDAAACSDPF RPKTYDCGAE GLLGNQQLGS
GGKLIAPGLQ GQYMDEIVLG VEYEPLANLK VGAVYTHRDL GRVIEDVSPD GGVSLVLANP
GEYDAGAVAD LRSQADAARA AGNMDEAARL DRTATLAEGV GDFDPPKRSY DAIELSAVKR
FSDDLMGRFS YVYSRLEGNF AGLFSPDTGQ LDPNFTSRYD VPELMTNRYG RLASDAPHQI
KLDAFYHLPI PAELGALIVG GRFRGRSGRP HNYLADHPIY GAKEVVLLPR GSGERNDFVT
SVDLQLTYGR ALSEDMGLEV FVSVFNLLNQ KTALRRDEQY TFDYPNPVGG GSVEDLEHVK
SPGTNQALST NANFGNPIEF QDPIGVRLGA RFTF