Gene Hoch_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2052 
Symbol 
ID8544434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2837331 
End bp2839424 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content76% 
IMG OID646386755 
ProductProtein of unknown function DUF2126 
Protein accessionYP_003266490 
Protein GI262195281 
COG category[S] Function unknown 
COG ID[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.598866 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCAGC GAGAGTCCGA GGCAGGAACC CCTGAAGCTG CCCTCGTCGA GGCCCTGCGC 
GCGCACGATG CCGCCATCGA GGCGCGCGGC GGCGGCCTGT GGCTGGGCGG CGAACCGACC
TTCACCGACC GGCACTCGAG CGAGCCGTGC TGGAACGGCG GCGCGCTCGG CGGCGACAAG
CGCGAGCGCG GCCTGCAAGT GGCCGCCGCG CTGGCGGCCG CGCATCCCGG CGCGGTGTTT
TTGCGGACGC TCGGGCGACA ATATCCCGGC GAGTCCGCGC CGCGCTGGAG CTTTGGCGTC
TACCGCCGAC GCGACGGCGC CGCGGTGTGG CGCGGACCGC TCGATCCGGC CTGCGGCGGC
GGCAGCAGCA GCGAGGCGCA GGCCGCGGCG CTACGCGATG CGGTGGCCGC GCGCCTGGGC
GGGCGCAGCT TCCGCTGTGC GCAGAGCTTG CCCGAGCGCC TGGTGCTGGG CGAGGTCGCC
GACGACGATC CCGCGCTGGC GCGCGCGCCG GCCGGCACCG TGGCCCTGGG CGACGCGGGA
CCCACCGACA CCCTGGCCGA GCGTGGCGCG CTGCTGTTCT GCTTCGGCGT CATCGACGGC
GCCGCCCAGG TCGATCTGCC CGCGGCGGAT TCGGTTGAGC GCTTTCTCGG CTGGCTGTCC
GAAATCGCCG CGGCCGCGAG CGAGATCGGA CTCGCCGGGC TGCGCCTGGG CGGGGCGCCG
CCGCCGGTCG ACGACAGCGT GTGGTCGGCG ACGATCACGC CCGACCCCGG CGTGCTCGAG
ATCAACGGCG CGCCCGAGTC CTCGCTGGTC GCCTACCACC AGCAGTTGCG CTGGACCTAC
GGCGCGGCCG CCGCGGTCGG CCTCGAACCC GCGCGCCTGT TCTTCAACGG CGATGTCGCG
CCCTCGGGCG GTGGCGGGCA CGTCACCTTC GGCGGCCCCG CGCCCGAGCG CAGCCCGTTC
TTTCGCGCGC CCATGCTGCT GCCGCGGCTG CTGGCCTACG CCAACCGCCA CCCGGCGCTG
TCGTACTGGT TTGCGCCCGA GTGCGTGGGC TCGTCGAGCC AGTCGCCGCG CCCCGACGAG
GTCGCGCGCG AGTCCTTCGA CGAGCTGGGC GTGGCGCTCG AGCTGCTGAG CCGGCTCGAG
GCGCCCACGC CGGCCGACCT GTGGACCGCG CTGGCGCCGT TTCTGGCCGA CCGCTTCGGC
AACACCCACC GCACCGAGAT CAACGCCGAG AAGCTGTGGA ATCCCTACCT GCCGCTGCGC
GGGCGCCTCG GACTGGTCGA GATGCGCGCG CTGCGCATGC CGCCCGACGC CGAGCGAGCG
GCCGCCATCG CCGCGCTGCT GCGCGCGCTG CTGGCGTACC TGAGCACCAG CGAGGTGTCG
CTCGAGCTGC GCGACTGGGG CGCCGAGCTC CACGATCGCT TCGCGCTGCC GCACTTCCAG
CGCGTCGACC TCGACGCCGT GCTGGGCGAA TTGGCGGCCG CCGGCCTGGG CCTGGGTGAG
CCGCTGCGGC GCGCGCTGCG CGACGACGCA CATCTCCTGG TGGGCGAGCT GGCGCTGCCC
GCGGGCGCCA CCCTGCGCCT GCGCTGGGCG CGCGAGTTCT GGCCGCTGAT GAGCGACGAC
AACCAGCCCG AGCAGAGCTC GCGGCTCATC GACGCGAGCT GCGCGCGGCT CGAGCTCACC
ATCACGTCCG CGGGCGCGAT GGACCCGGAG CGGGCGCCGG CGCTGGTGGT CGGCGGCTAT
CGCGTGCCCT GGGGGCGCGA CGGCGACACC CTGGTGCGCG CGCTGCGCTT TCGCCGCTTC
GTGCCCGGGC GCGGTCTGCA CCCGCACGTG CCTGCGCTCG ACCCGCTGCT CTTCGACGTC
GGCGAGCAGC GTCTGGCCCT GCATGGCTGG CGCCCGGGCG GCGGCTCCTA CGACGGTCTG
CCCGCGGACC TGGCCGAGGC CGCGCGCCGC CGGGCCGAGC GCTTCGTGGT GAGCGCGCGC
CCGGACGCGC CCGCGCCCGT GCCGGTGCCG CGCTCGGCGC TGAGCCCGTG GACCGTGGAC
CTGCGCCGGC TGCCGCCCAC CTCCGGGCGC GCGGCAGAGC CGTCCGAAAG CTGA
 
Protein sequence
MSQRESEAGT PEAALVEALR AHDAAIEARG GGLWLGGEPT FTDRHSSEPC WNGGALGGDK 
RERGLQVAAA LAAAHPGAVF LRTLGRQYPG ESAPRWSFGV YRRRDGAAVW RGPLDPACGG
GSSSEAQAAA LRDAVAARLG GRSFRCAQSL PERLVLGEVA DDDPALARAP AGTVALGDAG
PTDTLAERGA LLFCFGVIDG AAQVDLPAAD SVERFLGWLS EIAAAASEIG LAGLRLGGAP
PPVDDSVWSA TITPDPGVLE INGAPESSLV AYHQQLRWTY GAAAAVGLEP ARLFFNGDVA
PSGGGGHVTF GGPAPERSPF FRAPMLLPRL LAYANRHPAL SYWFAPECVG SSSQSPRPDE
VARESFDELG VALELLSRLE APTPADLWTA LAPFLADRFG NTHRTEINAE KLWNPYLPLR
GRLGLVEMRA LRMPPDAERA AAIAALLRAL LAYLSTSEVS LELRDWGAEL HDRFALPHFQ
RVDLDAVLGE LAAAGLGLGE PLRRALRDDA HLLVGELALP AGATLRLRWA REFWPLMSDD
NQPEQSSRLI DASCARLELT ITSAGAMDPE RAPALVVGGY RVPWGRDGDT LVRALRFRRF
VPGRGLHPHV PALDPLLFDV GEQRLALHGW RPGGGSYDGL PADLAEAARR RAERFVVSAR
PDAPAPVPVP RSALSPWTVD LRRLPPTSGR AAEPSES