Gene Hoch_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2051 
Symbol 
ID8544433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2834087 
End bp2837323 
Gene Length3237 bp 
Protein Length1078 aa 
Translation table11 
GC content73% 
IMG OID646386754 
ProductProtein of unknown function DUF2126 
Protein accessionYP_003266489 
Protein GI262195280 
COG category[S] Function unknown 
COG ID[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.86109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.238647 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATTT TTGTTCAGCA CCGCACCCGT TACGTCTTCG AACAGCCCGC CTCGCTCGGC 
CCGCACATCG TGCGCCTCCA CCCCGCGGCC CACACCCGCG CCCAGGTGCT GTCGTACAAC
CTCGACGTCG ACGTCGATTG CCAGATGCGC TGGCAGCGCG ACCCCTGGGG CAACCGCGTC
GCGCGCCTCA CCTTCGCCGA GGGCTGCACC ACGCGCACGC TCACGCTCAC GGTCGACGCC
GCCTTCGACA TCCACCCGGT CAACCCCTTC GACTTCTTCG TCGACGATCG CAGCGCCGCG
CTGCCCATGC ACTATCCCGA CGGCCTGGAC GAAGAGCTCG CGCCCTTCCT CAAGCCGCCC
GAGGCGAGCC CGGAGCTGTC CGCGTTCATC GAAGCGCAGC CGGCCACCGG TGGCGTGGTC
GACTACATGG TCGAGCTCAA CCGGCGCGTG GCCGAGCGGG TCTCGTACGT CATCCGCGAC
GAGGCCGGCA TCCAGACCAG CGAGGAGACC CTGACCATCG GCCGCGGTAG CTGCCGCGAC
TCGGCCCGGC TGCTGGTCGA TGTGCTGCGC GCGCGCGGGA TCGCGGCCCG CTTCGTCTCG
GGCTACCTGG TGCAGCTCAA GGACGAGGGC TTCATCCCCG ACCTGCCGCG CGGCGTCGAT
CGCGACGTCG TCGACCTCCA CGCCTGGGCC GAGGCCTACG TGCCCGGCGC CGGCTGGATC
GGGCTCGACG GCACCAGCGG CCTGATGTGC GGCGAGGGCC ACATCCCGCT GGCGAGCACG
GTGATGCCGG CGCTGGCCGC GCCCGTGAGC GGCACCGCCT CGCAGCCGGC GAGCTCGCTC
GAGTTCGAGA TGCACCTGGG CCGCATCGGC CACGAGCCGC GCCCGCGCGT GCCCTACACC
GAGGACACCT GGCAGGAGAT GTGTCGCGCC GGCGACGCCA TCGACGCAGC CATCACCGGC
AGCGGGCTCG CGCTCACCTG CGGCGGCGAG CCGACCTTCA CCTCGCACGA GCACGCGGGC
GAGCCCGAGT GGCAGACCGA GGCGCTGGGG GCGAGCAAGT GGGGCCAGGG TCTGCGCCTG
GCCAATTCGC TGGCGGAGCG TCTGGGCACC GGCACGCTGA CCATGCAGCG CATGGGCAAG
CACTACCCGG GCGAGAGCCT GCCGCGCTGG GTGCTGAACC TGCTGTGGCG CGCCGACGGT
CAGCCGGTGT GGCGCGATGC GGCGCTGCTG GCGCGCGAGC CCGACCCGCA GGCGCCCGCC
GATCTGGCGC TGGCCGAGCG CTTTTTGCGC GATCTGGCCG AGCGCCTGGG CGTCGTCGCG
CCGCAGCTCG AACCCGGCTA CGAGGACCCG TGGTACGCGA TCGAGACCGA GCAGCGTCTG
CCCGACGATG TCGATCCGCT GGCGGCCGAT CTCGATGACT CCGAGACCCG TCGGCGCCTG
TCGCGCATGC TCGGCCACGG TCTCGGACAG CCGGTCGGAT ACGTCCTGCC GCTGGGTCGG
CGCGCGGGCG GCTGGGCCAG CGATCGCTGG AGCTTTCGCC GTGAGCACAT GTTCCTGGTG
CCCGGCGACA GCCCCATGGG TCTGCGCCTG CCGCTCGACT CCCTGGGCGG CAGCGCGGCC
GCCCAGTTTC CGCGCGACGT CACCGCCATC CGCCTCGACG AGCCGCTGCA CTTCCCCCCC
GGCACGCCGT CCGGAGCCGG GCCGGAGCCC GCGTCCGACG GGGACTCGGA CGAGCTGCTG
TACACCGCGC TGTGCGTCGA GCCGCGCGAC GGCCACCTGT GCGTGTTCCT GCCGCCGCTC
GAGACCGCCG ATGACTTCCT CGCGCTCGTC GCCGCGGTCG AGGACGCCGC GGCCGCGCTC
GCGCGCCCGG TGCTGATCGA GGGCTACCCG CCGCCCAGCG ACCCGCGCCT GCGCACCTGC
GTGGTCGCGC CCGATCCCGG CGTACTCGAG GTCAACATGC CGGTGTGCGC CAGCTTCGCC
GAGTACCAGA GCATCATGGC CATGGTCAAC GACGCCGCGC ACCACGCTGG CCTGTCGACC
GAGAAGTATC AGATGGACGG CCGCGAGGTC GGCAGCGGCG GCGGCAACCA CCTCACCCTG
GGCGGGCCGA GCACGGTCGA GAGCCCGTTT TTGCGCCGCC CGGCGCTGCT CGGCGGCCTG
CTGCGCTACC TCAACAATCA CCCCTCGCTG TCGTATCTGT TCACCGGCCT GTTCGTGGGC
CCGACCTCGC AGGCGCCGCG CATCGATGAG GCCCGCCTCG ACTCGCTGTA CGAGCTCGAG
CTGGCGCTGG CCCAGATGCC CGAGGGCGAG ACCGACCAGC CCTGGCTCAC CGACCGGCTG
CTGCGCCACT TGCTCGTCGA CGTGTCCGGT AACGGCCACC GCACCGAGGT GTCGATCGAC
AAGCTGTATC ACCCGCTCGC GCTCGGCGGC CGCCAGGGCA TCGTCGAGTT TCGCGCCTTC
GAGATGCCGC CGCACACGCG CCTGGCCGCG GCCCAGATGC TGCTCGTGCG CGCCCTGGTG
GCGCGCCTGG CCAACGCCCC GTACCGCGAG AAGCTGGTGC GCTGGGGCAG CCGGCTGCAC
GACCGCTTCA TGCTGCCGCA CTTCCTGTGG CACGACTTCG AGGAGGTCGC CGCCGACCTG
GCCGGGCACG GGCTGCCCTT CGAGGCCTCG TGGTTCCGGC CCTTCCTCGA CCACCGCTGC
CCGGTTTTCG GACGCCTGCA GCTCGGCGAT GTCGAGCTCG AGCTGCGCAC CGCGCTCGAG
CCCTGGCCGA CCCTGGGCGA GCAGCCTTCG GGCGCGGTGG TGGCGCGCTA CGTCGACTCC
TCGCTCGAGC GCCTGCAGGT GAGCGCGCGC GGCGTGGTCG AGGACCGTCA CGCCATCGCC
GTCAACGGCG TGGTGCTGCC CATGTGGCCC ACGGGCAACG CCGGCGAGCA GGTGGCCGGC
GTGCGCTTCC GCGCCTGGCA GCCGCCCGAG TGTCTACAGC CGACCATCGG CGTGCACCAC
CCGCTGCGCT TCGACGTGGT CGACACCTGG GCGCAGCGCT CGCTGGGCGG CTGCACCTAT
CACGTCTGGC ACCCCGGGGG CCGGGCCTTC GAGCAGCCGC CGCTCACCGC CTTCGAGGCC
GCGGCCCGGC GCGCGCAGCG TTTCACGACC GACGCCCACG CCGCCTGGCC GGTGTCGCTG
CGGCACCTGC CGCCGCACCC CGAGCAGCCG CTGACCCTCG ACCTGCGCCG CTCCTGA
 
Protein sequence
MRIFVQHRTR YVFEQPASLG PHIVRLHPAA HTRAQVLSYN LDVDVDCQMR WQRDPWGNRV 
ARLTFAEGCT TRTLTLTVDA AFDIHPVNPF DFFVDDRSAA LPMHYPDGLD EELAPFLKPP
EASPELSAFI EAQPATGGVV DYMVELNRRV AERVSYVIRD EAGIQTSEET LTIGRGSCRD
SARLLVDVLR ARGIAARFVS GYLVQLKDEG FIPDLPRGVD RDVVDLHAWA EAYVPGAGWI
GLDGTSGLMC GEGHIPLAST VMPALAAPVS GTASQPASSL EFEMHLGRIG HEPRPRVPYT
EDTWQEMCRA GDAIDAAITG SGLALTCGGE PTFTSHEHAG EPEWQTEALG ASKWGQGLRL
ANSLAERLGT GTLTMQRMGK HYPGESLPRW VLNLLWRADG QPVWRDAALL AREPDPQAPA
DLALAERFLR DLAERLGVVA PQLEPGYEDP WYAIETEQRL PDDVDPLAAD LDDSETRRRL
SRMLGHGLGQ PVGYVLPLGR RAGGWASDRW SFRREHMFLV PGDSPMGLRL PLDSLGGSAA
AQFPRDVTAI RLDEPLHFPP GTPSGAGPEP ASDGDSDELL YTALCVEPRD GHLCVFLPPL
ETADDFLALV AAVEDAAAAL ARPVLIEGYP PPSDPRLRTC VVAPDPGVLE VNMPVCASFA
EYQSIMAMVN DAAHHAGLST EKYQMDGREV GSGGGNHLTL GGPSTVESPF LRRPALLGGL
LRYLNNHPSL SYLFTGLFVG PTSQAPRIDE ARLDSLYELE LALAQMPEGE TDQPWLTDRL
LRHLLVDVSG NGHRTEVSID KLYHPLALGG RQGIVEFRAF EMPPHTRLAA AQMLLVRALV
ARLANAPYRE KLVRWGSRLH DRFMLPHFLW HDFEEVAADL AGHGLPFEAS WFRPFLDHRC
PVFGRLQLGD VELELRTALE PWPTLGEQPS GAVVARYVDS SLERLQVSAR GVVEDRHAIA
VNGVVLPMWP TGNAGEQVAG VRFRAWQPPE CLQPTIGVHH PLRFDVVDTW AQRSLGGCTY
HVWHPGGRAF EQPPLTAFEA AARRAQRFTT DAHAAWPVSL RHLPPHPEQP LTLDLRRS