Gene Hoch_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0205 
Symbol 
ID8542584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp306827 
End bp308362 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content68% 
IMG OID646385001 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_003264739 
Protein GI262193530 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCG GGACCCAAAC ACCACGCGAT AAATTCGATC AGCTCCTCGC CTTCGTGCGG 
CGCACCATGC GCTACTGGTG GCTGGTCGGC GTTATCACCT TCATCGGCGG GGCCTTGGCA
GTGGTGTTCG CGCTCACGCA GAAGCCCAAG TTCCTCTCGG AAACCAAGAT CTTCTACAAC
GAGCGCATTC AGTCGAGCGT GCTCCAGGGC CGCGACTACG GCGTCAACAC CAAGAACCTC
GGCTACCACT ACGAAGAGAT GTTGATGTCG CGCACCAACA TCCAGTCGAT CATCGAGAAG
CTCGAGCTGT TCCCCAAGGT GCGCGACAAG AAGGGCATCG ACGCCGCGCT CGAGGAGTTC
GACAAGAGCG CCAAGTTCCG CGTGCGCGGC ACCGGCATGT TCAACATCTC GTTCCTCGGC
GAGGACCCTG AAAAGTCGCA GGCCGTGACC GCGATGATGG TCGACATCCT CATGCGCGAG
GACGAGCGGC TGCGGCGCGA GCAGGCCTCG GCGACCCTCA ACTTCCTGCT CGAGGAGAAG
GCCAAGATCA ACAAGGATCT CGACCAGCGC AATCGCGAGC TGGCCAAGTT CCTCACCGAG
CACCCCGAGT TCGCGCTCGA CAACACAGTC GGCGGCGCGC AGACGCCGGG CGCGACCATC
CGCGCCCAGG CCAAGGCCAA GTCCGGCCAG GGTCCGGCGG TCAGTGGCCC CACCAACGTC
GATCCCCGCA TCCTGGCGCT CGAGCGCCAG CGCCGCCGCA TCCGCGACCG CCTGGCCGCG
CCCGACCAGG TCGGCCCGCC GCGCAAGACG CCCGAGCAGA TCGAGGCCGA GCGCCTGGTG
TCCGAGGCCG AGCGCGACCT GCGCAGCGCC CAGCGCGCGC TGCAAGACCG ACTGTCGCGG
TTGCAGCCCG CCCACCCCGA CGTGATCCAG GCGCAGAGCG AAGTGGCGGC CGCGCAGCGA
CGCGTGCGCC AGCTCGAGGC CGCGGTGCCG TCGGCGGCGA TTCCCAACAA GCCCATCGAC
CGCAGCGCGC TCGAGGGCGA GCTGCGTGAG GTCGAGCGCC AGATCGCCGG TGTGCGCAGC
AGCATCCGCG AGGAGAGCGG CGACGACGAG ACGGCAGGGG AAGCCGAGGT GCCGCTCTCC
GAAGAGGATT GGGTGATCAA GCTCGAGACC GAGTACGCGC GCCTCAAGCA GGCGGTGGAG
GAGCAGCAGA AGCGCCTCGA GAGCACCGAC TCCAGCTTGT CGCGGGCCCA GATCACGGCC
AGCCAGCAGA TGGCCGAGCA GGGCGCGGTG CTGTCGATCA TCGACCCCCC GAGCCTGCCG
ACCCTGCCGC AAGGCAAAGG CCGCGCCATC CTGGCCGCCG CTGGCACCGC CGTGTTCATC
ATCCTGGGCA CGCTGCTGGC GCTGGCGCTG GCGCTCATCG ACGACCGCAT CTACAGCGCC
GGCGATCTCG AGCGCCTGGC GATCGCGCCG GTCGCGGTCG TGGTTCCCAA AATGCGCAAG
CCGGGCTTGC TGAGAAGGCT ATTCCGCCGT GGCTGA
 
Protein sequence
MPIGTQTPRD KFDQLLAFVR RTMRYWWLVG VITFIGGALA VVFALTQKPK FLSETKIFYN 
ERIQSSVLQG RDYGVNTKNL GYHYEEMLMS RTNIQSIIEK LELFPKVRDK KGIDAALEEF
DKSAKFRVRG TGMFNISFLG EDPEKSQAVT AMMVDILMRE DERLRREQAS ATLNFLLEEK
AKINKDLDQR NRELAKFLTE HPEFALDNTV GGAQTPGATI RAQAKAKSGQ GPAVSGPTNV
DPRILALERQ RRRIRDRLAA PDQVGPPRKT PEQIEAERLV SEAERDLRSA QRALQDRLSR
LQPAHPDVIQ AQSEVAAAQR RVRQLEAAVP SAAIPNKPID RSALEGELRE VERQIAGVRS
SIREESGDDE TAGEAEVPLS EEDWVIKLET EYARLKQAVE EQQKRLESTD SSLSRAQITA
SQQMAEQGAV LSIIDPPSLP TLPQGKGRAI LAAAGTAVFI ILGTLLALAL ALIDDRIYSA
GDLERLAIAP VAVVVPKMRK PGLLRRLFRR G