Gene Hoch_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_0081 
Symbol 
ID8542452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp126986 
End bp129286 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content71% 
IMG OID646384869 
Productcapsular exopolysaccharide family 
Protein accessionYP_003264615 
Protein GI262193406 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.80701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGC CCTCGACACA CTCCCCCGCT CGCCAGGAGA CCGCCGCGTC GCCGCCGGGT 
GCGATCCTCG AAGACGAGCG GCCGGCGCTC GATCTCGCCC GCTACTGGCG CGTGGTGCGC
AAGCGCGCGT GGCTGATCGC GGCCGTGGTC GCCGTCGGCG TGACCGCGTC GGTCCTGTAC
ACGCGCAGTC TGCCCAAGAT CTATCAGGCC ACGGCCAGCG TGGTCATCGA CCCGACGCCG
CCCCAGGTCT TCGGCAGCCA GGTGCAGGAG GTCATCCAGC TCGGCGCCCA GAGCTACTGG
TCCAATCAGG AGTACTACAA CACCCAGCTC GAGATCTTCA CCCGCTTCGA CCTGGTCGAG
CGCACGCTGG CCCGCCACCC CGAGCTGATC GATCTGGTCC TGGCCACCGA GATCCCGGGC
GCGCGCTCGG CCGGTGGCGA TCGCGATGGC GGCGATAGCG ACGGGGTCGC CGATGACCCG
CTGAGCCTGG CGACCGAGCG CCTGGCGGCC CTGCTCACGG CCACGCAGAC GCGCGAGAGC
CGCGTGGTCC GCCTGCGCGC GCAGCACACC GAGCCCGCGG TGGCCGAGCA GCTCGCCAAC
TATCACGTCG ATACCTTCCT CGCCTATCAC CGCAACCTGC GCGGCGAGAA CACCGAGCAG
ATCACCGAGT TTCTCGACCA GGAGCTGGTG ACCTCGGCCG ACGACCTGGA AAACGCCGAG
CGCGCGCTGC TGGCGTTCAA GAAGGACAAC GACATCCTCT CGGTGTCGAT CGAGGACCGG
CAGAACATCC TCGCCGCCGA CATCGCGCGC TACAACGCCG CGCTCAGCGA CGCCCGCATC
GAGCGCATCG AGCTGGGCAC CATCCTCGCC CGCGCCCGCG CGCTCGAGGG CGAGGCCATC
ATGGAGTCGC CGATCTTCGC GCTGGCCTCG TCCACCGACA TGGTCGAGGA GCTCAAGGCC
CAGTACATCC GCGAGAAGCA CAAGCTGGCC GAGCTGAGCC AGGAGCTGGG GCCGCGGCAT
CCCGCCTACC AGGCGCAGAA CGAGAAGGTC GAGCAGCAGT ACGCGGCCAT CGAGGCCGAG
GCCCGGCGCG CGCTGCGCGA GCTGAGCGAG CGCCATCAGG CGGCGCTGGC CCGCGAGCGC
CAGCTCGGCG CCGAGCTCGA GCGGCTCAAG GCCGAGGCCT TCGAGCTGGG CGACAAGACC
GCCGTGTACA AGCAGCTCGA GCGCAAGCAG CACAGCGAGG AGGAGAAGTA CAATCTGGTG
CTCAGCCGCC TGGGCGCGAG CCAGCTCAGC GAGCGCAACA CCGCCAGCAA CGTGCACCTG
CACATGCGCG CGCGCTCGGC CGAGCTGGTG TATCCGCGCG TGGCCGTCAA CCTCGGCCTG
GGCGGGGTGC TGTCGCTGCT GCTGGGGCTG GGGCTGGCCT TCTTGCTCGA CTTCTTCGAC
CGCACCATCA AGTCGCCCGA GCAGGTCGAG CAGGCCACGG GCGCGCCGCT GCTGGGCATG
ATCCCGGCCG TCGAGGCCGC GGGCGAGGGC GCGGACGCGG ACCGGGACCG CGCGCGCGAT
CTCTTCGTGT TCGACAAGCC GCGCTCGGCC GTGGCCGAAC ACTGCCGCTC GATCCGCACC
AACATCCTGT TCAGCACCGC CATGCGGCCG ACCAAGGTGC TCACGATCTC GAGCCCGCGG
CAGGGCGAGG GCAAGACCAC GACCGCGATC TACCTGGGCA CGATCATGGC CCAGAGCGGG
CAGCGCGTGC TGCTGGTCGA CACCGACATG CGCCGGCCGC GCCTGCACCA GAGCCTGGGC
ACGGGCACGG CCACGGCGCA CGGGCTCAGC GAGCTGCTGC TGCCCGAGAC CCGGATCGCC
GACAAGCTCG ACCAGGTCAT CGTCGAGACC GCGGTGCCCG GGCTGTTCTT GCTGCCCTGC
GGCGCGGTGC CGCCCAATCC CGCCGAGCTG CTACTCACCG AGCGCTTCGG CGAGGTGCTG
GACGCGCTGC GCGAGCGCTT CGACCGGGTG CTGCTCGATT CGCCGCCGCT GATGCTGATG
AACGACGCGG TGGTGCTGTC GCGGCGCTCC GACGGCGTGG TCATGGTCGC ACGCGCGGGC
CGCACCGCGG TCGAGGACCT GAGCCGCTCG GGGCGCATGG TGCGCGACGT CGACGCGCCG
GTCCTGGGCG TCATCCTCAA CGGTGCCAGC ACCGCGCGCG GGCGCTACGG CGGCTACGAG
CGCTACGGCT ACGCGGGCGA TGACGAGGAC GGCGGCGAGC TCCGCTCCAG CGGCCGCGAT
GGCGCGGAGC GGGCGGCATG A
 
Protein sequence
MSPPSTHSPA RQETAASPPG AILEDERPAL DLARYWRVVR KRAWLIAAVV AVGVTASVLY 
TRSLPKIYQA TASVVIDPTP PQVFGSQVQE VIQLGAQSYW SNQEYYNTQL EIFTRFDLVE
RTLARHPELI DLVLATEIPG ARSAGGDRDG GDSDGVADDP LSLATERLAA LLTATQTRES
RVVRLRAQHT EPAVAEQLAN YHVDTFLAYH RNLRGENTEQ ITEFLDQELV TSADDLENAE
RALLAFKKDN DILSVSIEDR QNILAADIAR YNAALSDARI ERIELGTILA RARALEGEAI
MESPIFALAS STDMVEELKA QYIREKHKLA ELSQELGPRH PAYQAQNEKV EQQYAAIEAE
ARRALRELSE RHQAALARER QLGAELERLK AEAFELGDKT AVYKQLERKQ HSEEEKYNLV
LSRLGASQLS ERNTASNVHL HMRARSAELV YPRVAVNLGL GGVLSLLLGL GLAFLLDFFD
RTIKSPEQVE QATGAPLLGM IPAVEAAGEG ADADRDRARD LFVFDKPRSA VAEHCRSIRT
NILFSTAMRP TKVLTISSPR QGEGKTTTAI YLGTIMAQSG QRVLLVDTDM RRPRLHQSLG
TGTATAHGLS ELLLPETRIA DKLDQVIVET AVPGLFLLPC GAVPPNPAEL LLTERFGEVL
DALRERFDRV LLDSPPLMLM NDAVVLSRRS DGVVMVARAG RTAVEDLSRS GRMVRDVDAP
VLGVILNGAS TARGRYGGYE RYGYAGDDED GGELRSSGRD GAERAA