Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3651 |
Symbol | |
ID | 8546041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 5025794 |
End bp | 5028037 |
Gene Length | 2244 bp |
Protein Length | 747 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646388320 |
Product | Rhomboid family protein |
Protein accession | YP_003268046 |
Protein GI | 262196837 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.015996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGTCG ATTTCCTCCT GCTCTCGGTG GTCGCGGTCA CCGCCTACCT GGTGCCCGTG GTCTGGCGAC GCCGGCCGCC GGGACGCCGG GGCTTTGGCT GGCTGCTGCT GGCCGATGGC GGCGCCGCCC TGCTGGCGCT GCTCGAGCCG CCGGGGATCG ACGCCGAGCT GATCGGTTTC ATCGCCATCG CCGCGGCCGT GTTCCTGCTG GTGATTCCGC CGCTGCTGCG CGATCTCGTG CACCGCGCGC TGCGCGCCGA CAAACCCCGC CTGGCCCTGC ATCTGCTGCA GCTCTGGGAG CTTCTGCAGC CCGGCATGCT GTCGGGCGAG GAGCGCGAGA TGATCGCCAT CCGCGTGGCC GCGCGCGACG GCAAGATCGA CGCCACCGCG GGCGAGATGC GCGCCCTGCG CGACCGCATC CCCGATCCCC ACGGCCGTGT GCACATGGAT CACCTGATCG TGTTCCTGTA CCTGAGCGCG CGGCGCTGGA GCGACGCGGT CCAGGCCTAC GAGCGCCGGC TCGAGGACGA GCCGCTGTCG CCGCAGACCT GCACCGAGCT GATCTGGGCG TACTGCGAGC TCGGCGACCT GCGCGCGGCC GCGGCCCTGG TCGAGCGCAT CGCGCAGGCG GCCGAGGCCG GCGAGCCCAT GTGGGGCTAT CTGGCGCAGC GGGCGCGGCT GATGTTTCTG GCCTTTGCCG GCCAGCGCGA GGCGCTCGAC GAGCTGTTCG CCGAGGACGG CGCGCTGCGC GCCCTGCCCC GCGCCAGCCA GGCCTTCTGG TCGGGCGTGG CGCATCTGCG CGCCGGCGAG ACCGAGGCCG CGCGCCAGCT CCTCAAGCGC GCCTCGCGCT GGACCCGCGG CGACGTGCGC GCCCGCGAGC TGGCGCGGCA GATGCTCGAT CGGCTCGAGT CGGGCGACGC CGAGCCGGCC GCGATCGCGG CCGTGGAGCT CGACGACGAG CTGCGCACCG CGGCCGAGGA TTTCCGCCGC GCGGCGCTGG CCGCGCCCGC CCTGCCCCAG CGCACGCCCC AGGTCGGACG CGTGTCCCTG CGCCAGGTGC CCGTGAGCGC GGCGCTGGTG CTGGCCAATC TGGCCGCCTT TGCCGCGGTC TACTGGATCT TCGACAGCAC CAGCGACACC GGCGCCCTGG TGCGTGCGGG CGCCAACGTC AAAGCCTGGG TCACCGAGCG CGGCCAGCTC TGGCGCCTGC CGACCTCGAT GTTCTTGCAC GTCGGCCTGC TTCACCTGCT GCTCAACGTC TACGGCCTGT GGATGCTGGG CAAGCTGGTC GAGCAGACCC TGGGCTCGGT GCGCAGCTTT GGCCTGTACA TGCTGTCCGG CCTGGTCGGC GCCTGGGCCA GCGCGCGCTT TGGCGCCGGC GGCATCTCGG CCGGGGCCTC GGGCGCTGTG CTCGGCCTGC TCGGCGCGCT GATCGCCGAG CTGGTCGTCC ACCACCGCGC CTACCCGCGC CACTTCCGCT CGGCCCTGCT CACCCCGCTG GTGTTCGTGG CCGCGGCCCA GGTCGGCATC GGCTTCTTCT ATCCGGTCAT CGATCAATGG GCGCACGTGG CCGGCCTGGC CACCGGCGCG TTTGCGGCCA TGGTGCTGTC GCCGCAGTCG CGGACGCCGC GCGGCGTCTC GCTGGCGCTC GGCGTCCTGC TGAGCGCGCT CGGCGTGGGC AGCATCGCGC TCGGCGCCCA CGGGGTCGCG ACCACGCGCT ACCGCAGCTT CTTCAACAGC GCCTGGGAGA CGCAGACCCT GGGCGGCCTG GCGTTTCGCG CGCCGGCCGG GCTGCGACGC GACTCGGGCG TGCTCATCGA CGGCGTCATG CTGTGGCTGG AAGCGACCAC CTGTCCGAAC TCGCCGAGCG GCCCGGTGCA CAGCTTGTGC GCGCCGGAGG GCGGCGATGT CGGCGCCCAG CTCGACCACG CCGAGGCCGT GCTGCGCCGC TCGTGGGGCG AGGCGACGCT GCGCGAGGTC GAGGCCGAGA GCGGGCCGGC GCCATGGCAG CGGCGCGCCC TGCACGCCAC CGACATCGGC ATCGATGGCG CGGATCGCTA CCGCATCATG CTGAGCGCGC GCGCCACGCC GAGCGCGACC TGGCTGTTCA TCATCCAGAC CCCGAGCGCG CTGGCCGACG CGCTCGAGCC GAGCGTGGTC GAGATGCTCG ACTCGGTGCG TGCGGCAGAT GCGCCCGCGC CCGCGCCCGC GCCCGCGCCC GCTGCCGATA CCGCCGGCGA CTGA
|
Protein sequence | MYVDFLLLSV VAVTAYLVPV VWRRRPPGRR GFGWLLLADG GAALLALLEP PGIDAELIGF IAIAAAVFLL VIPPLLRDLV HRALRADKPR LALHLLQLWE LLQPGMLSGE EREMIAIRVA ARDGKIDATA GEMRALRDRI PDPHGRVHMD HLIVFLYLSA RRWSDAVQAY ERRLEDEPLS PQTCTELIWA YCELGDLRAA AALVERIAQA AEAGEPMWGY LAQRARLMFL AFAGQREALD ELFAEDGALR ALPRASQAFW SGVAHLRAGE TEAARQLLKR ASRWTRGDVR ARELARQMLD RLESGDAEPA AIAAVELDDE LRTAAEDFRR AALAAPALPQ RTPQVGRVSL RQVPVSAALV LANLAAFAAV YWIFDSTSDT GALVRAGANV KAWVTERGQL WRLPTSMFLH VGLLHLLLNV YGLWMLGKLV EQTLGSVRSF GLYMLSGLVG AWASARFGAG GISAGASGAV LGLLGALIAE LVVHHRAYPR HFRSALLTPL VFVAAAQVGI GFFYPVIDQW AHVAGLATGA FAAMVLSPQS RTPRGVSLAL GVLLSALGVG SIALGAHGVA TTRYRSFFNS AWETQTLGGL AFRAPAGLRR DSGVLIDGVM LWLEATTCPN SPSGPVHSLC APEGGDVGAQ LDHAEAVLRR SWGEATLREV EAESGPAPWQ RRALHATDIG IDGADRYRIM LSARATPSAT WLFIIQTPSA LADALEPSVV EMLDSVRAAD APAPAPAPAP AADTAGD
|
| |