Gene Hoch_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2138 
Symbol 
ID8544524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2965477 
End bp2970942 
Gene Length5466 bp 
Protein Length1821 aa 
Translation table11 
GC content69% 
IMG OID646386845 
Productalpha-2-macroglobulin domain protein 
Protein accessionYP_003266576 
Protein GI262195367 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.977531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTGG CGGTGAAACG ATTTGGTATT TTCGTAGCAA TTCTCGCGGC TCTGGGCTGC 
GGATGCGGCG CCGATGCGCC CAGCGATCTC AGCGCTAGCG GCACGCCGAG CGGCGAGGTC
GCCGAGCTGC TCGAGGTCGT GGTCTCGTTC TCGCGGCCCA TGGTCGCCGA GTCCGCGCTC
GACGAGCCGC TCGACAAGGC GCCGCTGCGC ATCGCGCCCG AAATCGCCGG CGAGATGCAC
TGGCGCGATG CGAGCACCTT GGTGTTCGTC GCCACCACCA ACCTGCCCGC GTCCACCGAG
TTCACGGCGA CGGTCCCGGC CGGGACCGCG GCCCTCGACG GCAACCAGCT CGCCGAGCCG
TACTCGTTCA CGTTCACGTC CGAGCGCCTG GCCGGCTCGA TGGAGGTGGT GGGCTCGGCC
GAGCGCGCGC GCGCCGACCA GCAGATCAAG CTGACGTTCA ACCAAGAGGT GGCGTTCGAT
CAGGTGCGCG AGCACTGCCA CTACGCAGCC CACGGCAAGG CGATTCCCGT GGTCCTCGCG
CCCGATACGA GCGACGGCCC GGCCAAGTCG TATCTGGTCA CGCCGGCCGA AGAGCTGGCC
CTCGACACCG AGTGGACGCT GCTGTGCAAA GCCGATCTGC GCGGCACGCA GGGCAAGCTG
AGCATGGCGG CCGCGGCCTC GCAGCGCTTC CACACCTACG GTCCGCTGGA GTTTGTGGCC
ATGACGCCCG AGGGTAAGGA CCTCGTCCCC GACGAGGGCC TGCGCCTCGA GCTGGCCTTC
AACAACCCCC TGGCCAAGCC CTACGCCATC AGCTTGAAGC CCGCGGCGCC CGGCTTCCCC
GAGCGCTGCC ACATGGCCGG CGAGGTGCCG CCGGCGCTGA GCTGTCCCGC GATCCTCGAG
CCCAACACCA GCTACACGCT GACCGTGGAC GGCGCGCAGA CCGACATCTT CGGCCAGACC
CTGGGCGAGA GCGTGACGCT GCCGCTGCGC ACCTCCAACG CCAAGCCCAC GGTGTCGATG
GAGGCCGGGT ATTTCGTCGC CGAGCTCAAA CGGCCCGTGT ATCCGCTGTG GCTGCGCAAC
GTGAGCAAAC TCAAGGTCCG CGCCGCCGCG GTCACGCCCG CGAATTTCCA CGAGCTGCGT
CCGCTGCTCG ACTGGTGGAC GAGCGACGCC GCCAAGCTCA AGGGCACCTC GCTGAGGGCG
CAGAGCAAGG ACGTGACGCT CGAGCTCGAG GAGAACAAAT GGCACCAGTA CCCGCTCGAC
CCGAGCGAGT TCTTCGGCGG CCAGCCGGGC CCGGGCATGT ACTACTTCGA GGTCGGCGCG
CCCGAGGTCG AGAGCGGCGG GTTCTGGAAC GGCGGCTACA AGAAGGTGCT CATCAGCTTC
ACCGACATCG GCGTGGTCAC CAAGGTGGGG GGCGCGCGCG GTCTGGTGTG GGCCACGCAG
CTCTCTACCG GCAAGCCGCT GGCCGGCGCC AAGGTGACGG TGCGCGCCGG CGGCAAGGTG
ACCTGGAACG GCACCACCGA CGCCGAGGGT GTAGCCGTGC TGCCGCCGCG CGATAGCTTG
ACGGGCACAG GCGAGCAGGG CGGCTCGCTC AACGTGTACG TGAGCAAGGG CGACGACTGG
ACCATGGTCG ACCCCGAGCG CACGGGCAGC CTGTCGGCGT GGAACTTCAA CGTCTCGCCG
TCCTGGAATC ATTCGTCCGT GCGCATGCGC GGTTTCATGC ACTCGGATCG CGGCCTGTAC
CGACCGGGCG ACACCGTGCA CGTCAAAGGA CTGGCGCGCA CCAGCCAACT CGGCGCGCCG
CTCGCGGTGC CGAGCGAGAA GCGGGCGAAG CTGACGGTGC GCGGCCCGCG CGGCGACGAG
CTGCTCACCA GGGAGGTGCC GATCAGCGAC TTCGGCGGCT TCTGGACCGA CATCGAGATG
CCCAGCGACG CCCGCCTGGG CGACTACAGC GTGCGCGCCG AGATGGAGCA CGGCAGTTTT
TCGACCAGCT TCGCGGTCGA GGAGTACCGG GCCGCGTCCT TCGAGGTCAC GGGCAAGAGC
GAGACCAAGC GGCTGGTGCG CCGCGGCTCG CTCGAGGCCA AGGTCTCGGC CAATTACTTC
TACGGCGCGC CAGTGCGCGA CGCCCAGGCC ACCTTCACGG TGCACAGCCG CCCGCGCTCG
GTGTCCTTTC CCGAACACGA GCAGTTCAGC TTCGGCGACG ACCGTCGCTA TGACAGCTAT
CGCTACTACC CGGACTATTC GCAGACGCTC ATCACCGAGG TGCAGGCGCG GCTCGACAGC
GACGGCAACG GCTCGCTGTC GGTGCCGCTC ACGCCCAGCG ATGTGAGCGG CGACGCCGAT
CTGCTGGTGC GCGCCAGCGT CACCGCGCCC TCGAACGAGG TCATCAACGA CTCGTTCCTG
GTCCCGTACT TCCGCGCGCG CCGCTACCAC GGCATCAAGA GCGAGGGCGG CTACTTCCTC
GAGGTCGGCA AGAAGCGGCG CTTCGAGGTC ATGGCGGTGT CGCCCGCGGG CAAGCCGGTG
GCGGGCGATG TGCAGGTGAC CGTGCAGCGC CGCGACTGGA ACTGCCTGTG GGAGGATTGG
GGCTACCGCG GCTCGTATCG CTGCAACGAG ATCAAGCACG ACGTGCTGTC CACCTCGGTG
AAGATGCAGG AGAACGCGCC CGGCAGCTTC GAGTTCACGC CCGAGGGCGG CGGCGAGTAC
TGGATCATCG TCGAGGGCGA GCGCGACAAC CACGCCTCCG CCGCCATGCG CATGTACGCC
TGGGGCGACG GCGGCGGCTC GTGGCGCAGC GACGACACGA TGAGCTTCGA CCTGCTGTCC
GATAAAAAGG AATACCGGGT CGGTGACACC GCCACGCTGC TGCTGCAGAC CGACCTCAGC
GAGGGCTCGG GCCTGGTCAC CATCGAGCGC GACGGCGTCA TCGAGAGCCG GCCGTTCGAG
CTCTCGCCCA CCAACAAGCA CATCAAGGTG CCGATCCTCG ACAGCTACGC GCCCAACGTC
TACGTCTCGG TGGCGCTGGT GCAGGGGCGC ATGGGCGAGG GGCCGCGCGG CATGCCGCGC
ATGCGCATGG GGCTCACCAA CCTGCGGGTG CGGCCCGAGG GCAACGTCCT CAAGGTCGCA
GTGCAGACCG CGCGTCCCGA TTACCGCCCC GGTGAGCGCG TCGAGGCCAC GGTCACGGTG
ACCGACGCGA GCGGCGCCCC GGTCTCGGCC GAGGTCGCGA TCACGGCCGC GGACGAGGGC
GTGCTCTCGC TCATCGATTA CGAGACGCCC AACCCCACGC CGACCTTCTA CTCGCCCTGG
GGTCTGGCCG TGGAGACCTC CACGCAATAT CAATATCTCA AGGACATCGC CGCGCCCAAC
CTCGAGCGCC CGGCCACCGG TGGCGATGCC GGCGGGCCCG GCTCGCTGCG CGCGCGCTTC
CTGGCCTCGG CGGTGTGGAA GCCGGGCGTG GTCACCGACA GCGCCGGCAA GGCCACGGTG
AGCTTCGACG CGCCCGACAA CCTGACCGCC TTCCGGGTCA TGGCCGTGGC CGCCGATCGC
GGCCAGCGCT TCGGCTCGGG CGACAAGCGC TTCACGGTGT CCAAGCCGCT GCAGCTCCAC
CGCTCGCTGC CGCGCTTCCT CACCCTGGGC GACACGCTGC AGGGCGGCGT GGTGGTGCAC
AACGAGACCG GCAAGGCCGG CCGCGCCACC GTGGAGCTGA AGACCAATGA CGCGCTCGCG
CTGTCGGGCT CGGCGCAGCA GACCGTCGAT GTGCCCGCGG GCGGGCGCGT GCCGGTGCTC
TTTGCCATCG AGGCCATGAA CCCGGGCAAC GCCGAGCTGA CCTTTTCCGT CCGTATGGAC
AAAGAACGCG ATGACGTGCG CTTCGAGCTG CCGGTGCATC ACGCCTCGCC CGAGCGCAAG
CTGTCGGTGG CGCGCGGCGA CACCAAGGGC GAGGAGCGCA TCGCCATCGA GCTGCCCGAC
CATGCCATCG CGTCCACGGC GGTGCTGAGC GTGTCGGTCG ATCCCGACGG CCTGGCCGGC
ATCGAGGAGG GTCTGCAGTC GCTCATCCGC TATCCCTACG GCTGTCTGGA GCAGACCACG
TCCAAGGTGA TTCCCATCAT CGCGGTGCGC GAGCTGGCCG AGGCGCTGCA GCTCGAGGGC
CTGAGCGGCG CCGAGATCGA CGAGTTTGTC ACAGCCGGTG TCGGCAAGAT CGGCCGCCAT
CAGAACCCGG ACGGCGGCTA CGCGCTGTGG CCGGGCAACG ACTCGGAGAC CTACTACACG
GCCTACGCGC TGTGGGGTCT GCACCTGGCC AAGCAAGCCG GCTACGCGGT CGAGGACAGC
CGCATCCGCG AGGGTCTCAG CTACTTGCGC TACAACGCCG AGGGGCAGGA TGACGGGCCG
CACTACAGCG CCGCCGGCGA CTTCGGCTCG CGCGCCTTTG CCCTGTACGT GCGCGCCATG
CTCGGCGACG CGGATCCCCA GGCCGTGACC CGCCTGGCCG AGGAGTCGGC CATGCCCGTG
TACGGGCGCG CGTTTCTGGC GCGTGCCCTG GCCGCGTCGG TGGGCGCCAA GGGCGCGGGC
GTGAGCGCCA TGGTCGCCGA CCTGCGGGCC AAGGCCGAGG CGGCCGCGCA GCGCGGCGAG
CTCATCGAGG AATCCCAAGA CGACGAGCTC GACTGGTACA TGTCCAACTC GGTGCGGACC
ACGGCCATCG TTCTCGAGGC GCTGATCGCG CTCGATCCCA AGGCGCCCGT GATCAAGAAG
CTGGTGGCGA GCCTGATGAA GGCGCGGCGG GCTCGGCCCT ATTTCAGCAC CCAGGACAAC
CTGTATACCC TGCTGGCGCT GTCGTCGTAC GCGCGGTCGA TGAGCGGGGC GCCTCCGAGC
GTGACGGTAA TGCTCGGCGA CGACACCCTG ATCGCCGGCA AGCTCGGCGG CAAGCTGCGC
ATCCGGGTGG CGACGGCGCA GTTGCGCGCG CGCCAGGCGA CGCTGAGCAT CCGCGCCCAG
GGCGAGGTGC ACTACGCCGT GGACCTGCGC TACCGGCAGA AGCCGGAGAC GCTCGACACC
GTGTCCGAAG TGCTGGCGCT CGAGCGCGTC TACCTCGACG ACAGCGGCGC GCCCAAGTCC
TCGTTCCAGG TCGGCGATGT GTTCAAGGTC AAGCTGTCCA CGCCGATCGA CAAGCGCCGC
ACGCACCTGA TGATTTCGGA CCGCCTGCCG GCCGGCTTCG AGGCGCTCAA CGCGCGTCTG
GCCACGGTGG GCAGCGAGGG CCGCGTGGAG CAGCGCAGGA CCTGGGGCAC GCACCGCGAG
CTGCGCGACG AGCGCGCCGA CTTCTCGGCC GAATACGTGT CCGAGGGCGC CTACGTGCGC
GAGTACATGG TCCGCGTGAT CGCGGCCGGC CGCTTCGCCG TGCCGCCGGC CGTGGCCGAG
CTGATGTACG AGCCTGAAGT GCACGCGCAG ACCGCGCTCA CGCACATCGA CATCGCGGCC
AGGTGA
 
Protein sequence
MELAVKRFGI FVAILAALGC GCGADAPSDL SASGTPSGEV AELLEVVVSF SRPMVAESAL 
DEPLDKAPLR IAPEIAGEMH WRDASTLVFV ATTNLPASTE FTATVPAGTA ALDGNQLAEP
YSFTFTSERL AGSMEVVGSA ERARADQQIK LTFNQEVAFD QVREHCHYAA HGKAIPVVLA
PDTSDGPAKS YLVTPAEELA LDTEWTLLCK ADLRGTQGKL SMAAAASQRF HTYGPLEFVA
MTPEGKDLVP DEGLRLELAF NNPLAKPYAI SLKPAAPGFP ERCHMAGEVP PALSCPAILE
PNTSYTLTVD GAQTDIFGQT LGESVTLPLR TSNAKPTVSM EAGYFVAELK RPVYPLWLRN
VSKLKVRAAA VTPANFHELR PLLDWWTSDA AKLKGTSLRA QSKDVTLELE ENKWHQYPLD
PSEFFGGQPG PGMYYFEVGA PEVESGGFWN GGYKKVLISF TDIGVVTKVG GARGLVWATQ
LSTGKPLAGA KVTVRAGGKV TWNGTTDAEG VAVLPPRDSL TGTGEQGGSL NVYVSKGDDW
TMVDPERTGS LSAWNFNVSP SWNHSSVRMR GFMHSDRGLY RPGDTVHVKG LARTSQLGAP
LAVPSEKRAK LTVRGPRGDE LLTREVPISD FGGFWTDIEM PSDARLGDYS VRAEMEHGSF
STSFAVEEYR AASFEVTGKS ETKRLVRRGS LEAKVSANYF YGAPVRDAQA TFTVHSRPRS
VSFPEHEQFS FGDDRRYDSY RYYPDYSQTL ITEVQARLDS DGNGSLSVPL TPSDVSGDAD
LLVRASVTAP SNEVINDSFL VPYFRARRYH GIKSEGGYFL EVGKKRRFEV MAVSPAGKPV
AGDVQVTVQR RDWNCLWEDW GYRGSYRCNE IKHDVLSTSV KMQENAPGSF EFTPEGGGEY
WIIVEGERDN HASAAMRMYA WGDGGGSWRS DDTMSFDLLS DKKEYRVGDT ATLLLQTDLS
EGSGLVTIER DGVIESRPFE LSPTNKHIKV PILDSYAPNV YVSVALVQGR MGEGPRGMPR
MRMGLTNLRV RPEGNVLKVA VQTARPDYRP GERVEATVTV TDASGAPVSA EVAITAADEG
VLSLIDYETP NPTPTFYSPW GLAVETSTQY QYLKDIAAPN LERPATGGDA GGPGSLRARF
LASAVWKPGV VTDSAGKATV SFDAPDNLTA FRVMAVAADR GQRFGSGDKR FTVSKPLQLH
RSLPRFLTLG DTLQGGVVVH NETGKAGRAT VELKTNDALA LSGSAQQTVD VPAGGRVPVL
FAIEAMNPGN AELTFSVRMD KERDDVRFEL PVHHASPERK LSVARGDTKG EERIAIELPD
HAIASTAVLS VSVDPDGLAG IEEGLQSLIR YPYGCLEQTT SKVIPIIAVR ELAEALQLEG
LSGAEIDEFV TAGVGKIGRH QNPDGGYALW PGNDSETYYT AYALWGLHLA KQAGYAVEDS
RIREGLSYLR YNAEGQDDGP HYSAAGDFGS RAFALYVRAM LGDADPQAVT RLAEESAMPV
YGRAFLARAL AASVGAKGAG VSAMVADLRA KAEAAAQRGE LIEESQDDEL DWYMSNSVRT
TAIVLEALIA LDPKAPVIKK LVASLMKARR ARPYFSTQDN LYTLLALSSY ARSMSGAPPS
VTVMLGDDTL IAGKLGGKLR IRVATAQLRA RQATLSIRAQ GEVHYAVDLR YRQKPETLDT
VSEVLALERV YLDDSGAPKS SFQVGDVFKV KLSTPIDKRR THLMISDRLP AGFEALNARL
ATVGSEGRVE QRRTWGTHRE LRDERADFSA EYVSEGAYVR EYMVRVIAAG RFAVPPAVAE
LMYEPEVHAQ TALTHIDIAA R