Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0081 |
Symbol | |
ID | 8542452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 126986 |
End bp | 129286 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646384869 |
Product | capsular exopolysaccharide family |
Protein accession | YP_003264615 |
Protein GI | 262193406 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.80701 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCCGC CCTCGACACA CTCCCCCGCT CGCCAGGAGA CCGCCGCGTC GCCGCCGGGT GCGATCCTCG AAGACGAGCG GCCGGCGCTC GATCTCGCCC GCTACTGGCG CGTGGTGCGC AAGCGCGCGT GGCTGATCGC GGCCGTGGTC GCCGTCGGCG TGACCGCGTC GGTCCTGTAC ACGCGCAGTC TGCCCAAGAT CTATCAGGCC ACGGCCAGCG TGGTCATCGA CCCGACGCCG CCCCAGGTCT TCGGCAGCCA GGTGCAGGAG GTCATCCAGC TCGGCGCCCA GAGCTACTGG TCCAATCAGG AGTACTACAA CACCCAGCTC GAGATCTTCA CCCGCTTCGA CCTGGTCGAG CGCACGCTGG CCCGCCACCC CGAGCTGATC GATCTGGTCC TGGCCACCGA GATCCCGGGC GCGCGCTCGG CCGGTGGCGA TCGCGATGGC GGCGATAGCG ACGGGGTCGC CGATGACCCG CTGAGCCTGG CGACCGAGCG CCTGGCGGCC CTGCTCACGG CCACGCAGAC GCGCGAGAGC CGCGTGGTCC GCCTGCGCGC GCAGCACACC GAGCCCGCGG TGGCCGAGCA GCTCGCCAAC TATCACGTCG ATACCTTCCT CGCCTATCAC CGCAACCTGC GCGGCGAGAA CACCGAGCAG ATCACCGAGT TTCTCGACCA GGAGCTGGTG ACCTCGGCCG ACGACCTGGA AAACGCCGAG CGCGCGCTGC TGGCGTTCAA GAAGGACAAC GACATCCTCT CGGTGTCGAT CGAGGACCGG CAGAACATCC TCGCCGCCGA CATCGCGCGC TACAACGCCG CGCTCAGCGA CGCCCGCATC GAGCGCATCG AGCTGGGCAC CATCCTCGCC CGCGCCCGCG CGCTCGAGGG CGAGGCCATC ATGGAGTCGC CGATCTTCGC GCTGGCCTCG TCCACCGACA TGGTCGAGGA GCTCAAGGCC CAGTACATCC GCGAGAAGCA CAAGCTGGCC GAGCTGAGCC AGGAGCTGGG GCCGCGGCAT CCCGCCTACC AGGCGCAGAA CGAGAAGGTC GAGCAGCAGT ACGCGGCCAT CGAGGCCGAG GCCCGGCGCG CGCTGCGCGA GCTGAGCGAG CGCCATCAGG CGGCGCTGGC CCGCGAGCGC CAGCTCGGCG CCGAGCTCGA GCGGCTCAAG GCCGAGGCCT TCGAGCTGGG CGACAAGACC GCCGTGTACA AGCAGCTCGA GCGCAAGCAG CACAGCGAGG AGGAGAAGTA CAATCTGGTG CTCAGCCGCC TGGGCGCGAG CCAGCTCAGC GAGCGCAACA CCGCCAGCAA CGTGCACCTG CACATGCGCG CGCGCTCGGC CGAGCTGGTG TATCCGCGCG TGGCCGTCAA CCTCGGCCTG GGCGGGGTGC TGTCGCTGCT GCTGGGGCTG GGGCTGGCCT TCTTGCTCGA CTTCTTCGAC CGCACCATCA AGTCGCCCGA GCAGGTCGAG CAGGCCACGG GCGCGCCGCT GCTGGGCATG ATCCCGGCCG TCGAGGCCGC GGGCGAGGGC GCGGACGCGG ACCGGGACCG CGCGCGCGAT CTCTTCGTGT TCGACAAGCC GCGCTCGGCC GTGGCCGAAC ACTGCCGCTC GATCCGCACC AACATCCTGT TCAGCACCGC CATGCGGCCG ACCAAGGTGC TCACGATCTC GAGCCCGCGG CAGGGCGAGG GCAAGACCAC GACCGCGATC TACCTGGGCA CGATCATGGC CCAGAGCGGG CAGCGCGTGC TGCTGGTCGA CACCGACATG CGCCGGCCGC GCCTGCACCA GAGCCTGGGC ACGGGCACGG CCACGGCGCA CGGGCTCAGC GAGCTGCTGC TGCCCGAGAC CCGGATCGCC GACAAGCTCG ACCAGGTCAT CGTCGAGACC GCGGTGCCCG GGCTGTTCTT GCTGCCCTGC GGCGCGGTGC CGCCCAATCC CGCCGAGCTG CTACTCACCG AGCGCTTCGG CGAGGTGCTG GACGCGCTGC GCGAGCGCTT CGACCGGGTG CTGCTCGATT CGCCGCCGCT GATGCTGATG AACGACGCGG TGGTGCTGTC GCGGCGCTCC GACGGCGTGG TCATGGTCGC ACGCGCGGGC CGCACCGCGG TCGAGGACCT GAGCCGCTCG GGGCGCATGG TGCGCGACGT CGACGCGCCG GTCCTGGGCG TCATCCTCAA CGGTGCCAGC ACCGCGCGCG GGCGCTACGG CGGCTACGAG CGCTACGGCT ACGCGGGCGA TGACGAGGAC GGCGGCGAGC TCCGCTCCAG CGGCCGCGAT GGCGCGGAGC GGGCGGCATG A
|
Protein sequence | MSPPSTHSPA RQETAASPPG AILEDERPAL DLARYWRVVR KRAWLIAAVV AVGVTASVLY TRSLPKIYQA TASVVIDPTP PQVFGSQVQE VIQLGAQSYW SNQEYYNTQL EIFTRFDLVE RTLARHPELI DLVLATEIPG ARSAGGDRDG GDSDGVADDP LSLATERLAA LLTATQTRES RVVRLRAQHT EPAVAEQLAN YHVDTFLAYH RNLRGENTEQ ITEFLDQELV TSADDLENAE RALLAFKKDN DILSVSIEDR QNILAADIAR YNAALSDARI ERIELGTILA RARALEGEAI MESPIFALAS STDMVEELKA QYIREKHKLA ELSQELGPRH PAYQAQNEKV EQQYAAIEAE ARRALRELSE RHQAALARER QLGAELERLK AEAFELGDKT AVYKQLERKQ HSEEEKYNLV LSRLGASQLS ERNTASNVHL HMRARSAELV YPRVAVNLGL GGVLSLLLGL GLAFLLDFFD RTIKSPEQVE QATGAPLLGM IPAVEAAGEG ADADRDRARD LFVFDKPRSA VAEHCRSIRT NILFSTAMRP TKVLTISSPR QGEGKTTTAI YLGTIMAQSG QRVLLVDTDM RRPRLHQSLG TGTATAHGLS ELLLPETRIA DKLDQVIVET AVPGLFLLPC GAVPPNPAEL LLTERFGEVL DALRERFDRV LLDSPPLMLM NDAVVLSRRS DGVVMVARAG RTAVEDLSRS GRMVRDVDAP VLGVILNGAS TARGRYGGYE RYGYAGDDED GGELRSSGRD GAERAA
|
| |