Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_0205 |
Symbol | |
ID | 8542584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 306827 |
End bp | 308362 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646385001 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_003264739 |
Protein GI | 262193530 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCATCG GGACCCAAAC ACCACGCGAT AAATTCGATC AGCTCCTCGC CTTCGTGCGG CGCACCATGC GCTACTGGTG GCTGGTCGGC GTTATCACCT TCATCGGCGG GGCCTTGGCA GTGGTGTTCG CGCTCACGCA GAAGCCCAAG TTCCTCTCGG AAACCAAGAT CTTCTACAAC GAGCGCATTC AGTCGAGCGT GCTCCAGGGC CGCGACTACG GCGTCAACAC CAAGAACCTC GGCTACCACT ACGAAGAGAT GTTGATGTCG CGCACCAACA TCCAGTCGAT CATCGAGAAG CTCGAGCTGT TCCCCAAGGT GCGCGACAAG AAGGGCATCG ACGCCGCGCT CGAGGAGTTC GACAAGAGCG CCAAGTTCCG CGTGCGCGGC ACCGGCATGT TCAACATCTC GTTCCTCGGC GAGGACCCTG AAAAGTCGCA GGCCGTGACC GCGATGATGG TCGACATCCT CATGCGCGAG GACGAGCGGC TGCGGCGCGA GCAGGCCTCG GCGACCCTCA ACTTCCTGCT CGAGGAGAAG GCCAAGATCA ACAAGGATCT CGACCAGCGC AATCGCGAGC TGGCCAAGTT CCTCACCGAG CACCCCGAGT TCGCGCTCGA CAACACAGTC GGCGGCGCGC AGACGCCGGG CGCGACCATC CGCGCCCAGG CCAAGGCCAA GTCCGGCCAG GGTCCGGCGG TCAGTGGCCC CACCAACGTC GATCCCCGCA TCCTGGCGCT CGAGCGCCAG CGCCGCCGCA TCCGCGACCG CCTGGCCGCG CCCGACCAGG TCGGCCCGCC GCGCAAGACG CCCGAGCAGA TCGAGGCCGA GCGCCTGGTG TCCGAGGCCG AGCGCGACCT GCGCAGCGCC CAGCGCGCGC TGCAAGACCG ACTGTCGCGG TTGCAGCCCG CCCACCCCGA CGTGATCCAG GCGCAGAGCG AAGTGGCGGC CGCGCAGCGA CGCGTGCGCC AGCTCGAGGC CGCGGTGCCG TCGGCGGCGA TTCCCAACAA GCCCATCGAC CGCAGCGCGC TCGAGGGCGA GCTGCGTGAG GTCGAGCGCC AGATCGCCGG TGTGCGCAGC AGCATCCGCG AGGAGAGCGG CGACGACGAG ACGGCAGGGG AAGCCGAGGT GCCGCTCTCC GAAGAGGATT GGGTGATCAA GCTCGAGACC GAGTACGCGC GCCTCAAGCA GGCGGTGGAG GAGCAGCAGA AGCGCCTCGA GAGCACCGAC TCCAGCTTGT CGCGGGCCCA GATCACGGCC AGCCAGCAGA TGGCCGAGCA GGGCGCGGTG CTGTCGATCA TCGACCCCCC GAGCCTGCCG ACCCTGCCGC AAGGCAAAGG CCGCGCCATC CTGGCCGCCG CTGGCACCGC CGTGTTCATC ATCCTGGGCA CGCTGCTGGC GCTGGCGCTG GCGCTCATCG ACGACCGCAT CTACAGCGCC GGCGATCTCG AGCGCCTGGC GATCGCGCCG GTCGCGGTCG TGGTTCCCAA AATGCGCAAG CCGGGCTTGC TGAGAAGGCT ATTCCGCCGT GGCTGA
|
Protein sequence | MPIGTQTPRD KFDQLLAFVR RTMRYWWLVG VITFIGGALA VVFALTQKPK FLSETKIFYN ERIQSSVLQG RDYGVNTKNL GYHYEEMLMS RTNIQSIIEK LELFPKVRDK KGIDAALEEF DKSAKFRVRG TGMFNISFLG EDPEKSQAVT AMMVDILMRE DERLRREQAS ATLNFLLEEK AKINKDLDQR NRELAKFLTE HPEFALDNTV GGAQTPGATI RAQAKAKSGQ GPAVSGPTNV DPRILALERQ RRRIRDRLAA PDQVGPPRKT PEQIEAERLV SEAERDLRSA QRALQDRLSR LQPAHPDVIQ AQSEVAAAQR RVRQLEAAVP SAAIPNKPID RSALEGELRE VERQIAGVRS SIREESGDDE TAGEAEVPLS EEDWVIKLET EYARLKQAVE EQQKRLESTD SSLSRAQITA SQQMAEQGAV LSIIDPPSLP TLPQGKGRAI LAAAGTAVFI ILGTLLALAL ALIDDRIYSA GDLERLAIAP VAVVVPKMRK PGLLRRLFRR G
|
| |