Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_4038 |
Symbol | |
ID | 8546439 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5542483 |
End bp | 5543643 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646388715 |
Product | ApbE family lipoprotein |
Protein accession | YP_003268430 |
Protein GI | 262197221 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.618341 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.564525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCC CGCCGCCGCC GATCCCGCCG CGCCGCTCGC GGATCGCGCG CCGCCCCCGG CTGCGCAGCC CCGGCCTGGC CGCCGCCGGC CTGGCCGCCG CCCTGCTCGC AGCGCCGCTC GCGCGCCCGG CCCGGGCCGC GCCGCCGGCC CAACAACAAC AACAACAACA ACAGGCGCAG GCGGCCCAGG CGCAGGAAGC CGCCAAGCAC ATGTTCGTGC GCAGCGCCCA GACCATGGGC ACGCGCGTCA CCCTCAGCCT GTGGAGTCAC GACGAGGCCG AGGCCGCGCG CGCCGCCGCC GCCGTGTTCG AGGAGTTTCA GCGCGTCGAC GCGCTGATGA CCTCGTGGGG TGAGGACAGC GACGTGGCCC GCATCAACGC CGCCGCCGGC GGCCCCTTGG TGCGCGTCGA CGCCGAGGTC ATGGGCGTGC TGCAGCGCGC GCAGCAGATC GCGCGCCGCT CCGCGGGCGC CTTCGACATC ACCGTCGGCG CCTTCCGCGG GCTGTGGAAG TTCGACCAGG ACAAAGACGG CTCCATCCCG GCCGCCGACG AGGTCGCCGC GCGCCTGCCC CTGGTCGGCT ACCGCGGCGT GCGCCTGGCA CCGGGCAGCA AGCGCGCCGG CCTGCGCCGC GCCGGCATGC GCATCACGCT CGGCGGCATC GCCAAGGGCT ACGCGGTCGA TCGCGCCGTG GCCCTGTTGC GCGCCCGCGG CTACCGCGAC TTCCTCATCC AGGCCGGCGG CGATCTCTTC GTCGCCGGGC GCCGCGGCGA CCGCCCCTGG CGCGTCGGCA TCCGCGATCC CCGCGGCGAC GCCGCCAGCC CCTTCGCGGT CGCCGAGATC GAGAACCAGA CCTTCTCGAC CTCGGGCGAC TACGAGCGCT CGGTGGTCCG CGACGGCGTG CGCTACCACC ACATCCTCGA CCCCAGCAGC GGACGCCCGG CCGATAAAAG CCGCTCGGTC ACCGTCATGG CCGCCGACGC GCTCACGGCC GACGCCTGGT CCACGGCCCT GTTCGTCATG GGCGCCGAGC GCGGCCTGCC GCTGGTCGAG AAGCTGCCCG GCGTCGAGGC CGTGTTCGTC GACGCCGACA ACCGCGTGCA CGTGTCCTCG GGCTTGCAGG ACAAGCTGCA CGTGCTGCGG CCGCCGACCC CGGGCATCTG A
|
Protein sequence | MTIPPPPIPP RRSRIARRPR LRSPGLAAAG LAAALLAAPL ARPARAAPPA QQQQQQQQAQ AAQAQEAAKH MFVRSAQTMG TRVTLSLWSH DEAEAARAAA AVFEEFQRVD ALMTSWGEDS DVARINAAAG GPLVRVDAEV MGVLQRAQQI ARRSAGAFDI TVGAFRGLWK FDQDKDGSIP AADEVAARLP LVGYRGVRLA PGSKRAGLRR AGMRITLGGI AKGYAVDRAV ALLRARGYRD FLIQAGGDLF VAGRRGDRPW RVGIRDPRGD AASPFAVAEI ENQTFSTSGD YERSVVRDGV RYHHILDPSS GRPADKSRSV TVMAADALTA DAWSTALFVM GAERGLPLVE KLPGVEAVFV DADNRVHVSS GLQDKLHVLR PPTPGI
|
| |