Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_6638 |
Symbol | |
ID | 8549055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 9098416 |
End bp | 9100350 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646391298 |
Product | glycoside hydrolase family 35 |
Protein accession | YP_003270997 |
Protein GI | 262199788 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.119053 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000205681 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAAGCAC ACCGCCGCAG CACCCTCGGC GCAGCCGGAA TAATCCTCGA TGACGCAGAC GGCGGCGCCG GCGCGCGCGA GTTGCCGCTG TTCTCGGGCG TGCTGCACTA CTGGCGCGTC GACCGCGCCG ACTGGCGCGC GTGTCTCGCC GCCATGCGCG AGATCGGCTT CGCGATGGTC CACGTGCCGG TGCCGTGGAG CGTGCACGAG CGCGGCCCGG GCAGCTACGT TTGGCGCCAG GAGCGCGACC TCGGTGGGTT TCTCGACCTG CTGGCCGAAA TGGGCATGCA CGCGGTGCTC GAACCCGGGC CCGTGATCGG CGCCGAGCTG CCGGGTGGCG GCGTGCCCGC GCGCATCCTC GACGACGCCG CGCTGCTGGC GCGGACCGCG CGCGGTGCGC CGGCGTGGGT GACGGCGGTG CCGCAGATGA TGACCGTGCC CTGCCTGGCG GCCTCGGCGC TGCGCGAGCA GGTCCTGGCC TGGTTCGCTG CGGTAGCCGA AATCGCGGCG CCGCGCCTGG TGCCGAACGG GCCGGTCGCG GCCGTGCACC TGGGTCGTCA CGCCGATCTC GTCGCCCAGC TCGGCCCCTT CGAGTACGGC TATCAACCCG ATGCGCTGCG CCTGTGGCAG GAGCTCGAGG GCGGCGAGCC GCCGCGGCAG TGGTCGGCCG AAGACGGCGC GCGCATGGCG GCGTGGATGC AATGCCGCGA GGTGGGCATG CGCCGCGATC TGGTCTGGCT GTCGGGCGCG CTCGACCAGG TCGGACTGCG CGAGGTGGTG CGCCTCGTCG ATCTACCGTG GTCGCTGCCC GGCGCCTTCG ACATCGCCGC GCTCGAACGC GCGCTGGGCG GCGAGGTCGC TGTCGGCATG CGCGTCGGCC ACCGTCCAGC CGTACCCGCG GCCCTGCACC GGCGCGCGCT ATACCTCACC GGCTCGGCGC GCCTGCCTGT ATTCGGACAG GTGTCCGTCG GTGGCCCGAC GCTGGGGGTG CCGGTCCCTG TGGACGCCGC TCAGCGCGCC GTGCTCGGCG TCTTGGCTGC GGGCGCGCGC GGCGTCGGTC TGCATCTGCT GGCCGAGTGC AGCGGCTGGT ACGGGGCGCC CGTGAGCGAA CTCGGCGAGG CCCAGCAGGC CGCCGAGTGG CTCACGCGGG TGCTCGCGGC GCTGCGTGAA GTATCGTGGA CCACGCTGCG CAGGCACACG CCGGTGGCCG TGATGGTCGG CCGCGCGGAG CAGCGCTTTG CCGCCGCGTC CTCGGCTGCG GGCGCGCTGG GGTCGCTGCT CGAGGGCTTG TTGCCGCCGG GGACCAACGA CCGGGCCAGC CTGGCGCGCG ACCTCGACGC CGCTGCCAGT CGCCGCTGGA CCGAGGCCGC CATCGATGCA CTCGAACTGG CACAGATTCC CTACCGCGTG GTCGACGAAG GCTGCGCGCC CGAGGCCTTT GCCGGGGTCC GCGCCGTGAT CGCGCCGACC CTGCGCCGTG TCGATCGCGG CGCCTGGCAG CGGCTGCACG AGCTCGCCCG CGGCGGCGCG GTGGTGATCG CCGGTCCCGA GCGTCCGCGC TGCGATGAGC GCGGCCGGGA GCTGGGCGAC GACGCCGCGC TGCCGGCGCG CGCGGGGCTC ATGCGGGCGG CCTCGCTCGA GGACCCCGAG GGCCTGGCCG ATGACCTCGC CGAGGTCGCC GGGGAGCTGT CCGAGCTGTG GCTCACGGCC GAGCAGGGCG AGGTGGACTG CTCGCTGTTC AGCGACCCGA GCGGCGCGCC GCGGGTGCTG TTCGTGAGCA ACCGGCGCGC CGCGGCGGTG GTCGCCGATG TCCTCGTGCC CGCCGGGGTC GCGCTCGAGG ACGCGATCAC GGGTGAGACC CTGCGGCCCG GGCGCGACGG CGTCGTCGAT GTGCGCCTCG AGCCGCTGCA GATCGCCATG CTGCTGGTGC GCTGA
|
Protein sequence | MEAHRRSTLG AAGIILDDAD GGAGARELPL FSGVLHYWRV DRADWRACLA AMREIGFAMV HVPVPWSVHE RGPGSYVWRQ ERDLGGFLDL LAEMGMHAVL EPGPVIGAEL PGGGVPARIL DDAALLARTA RGAPAWVTAV PQMMTVPCLA ASALREQVLA WFAAVAEIAA PRLVPNGPVA AVHLGRHADL VAQLGPFEYG YQPDALRLWQ ELEGGEPPRQ WSAEDGARMA AWMQCREVGM RRDLVWLSGA LDQVGLREVV RLVDLPWSLP GAFDIAALER ALGGEVAVGM RVGHRPAVPA ALHRRALYLT GSARLPVFGQ VSVGGPTLGV PVPVDAAQRA VLGVLAAGAR GVGLHLLAEC SGWYGAPVSE LGEAQQAAEW LTRVLAALRE VSWTTLRRHT PVAVMVGRAE QRFAAASSAA GALGSLLEGL LPPGTNDRAS LARDLDAAAS RRWTEAAIDA LELAQIPYRV VDEGCAPEAF AGVRAVIAPT LRRVDRGAWQ RLHELARGGA VVIAGPERPR CDERGRELGD DAALPARAGL MRAASLEDPE GLADDLAEVA GELSELWLTA EQGEVDCSLF SDPSGAPRVL FVSNRRAAAV VADVLVPAGV ALEDAITGET LRPGRDGVVD VRLEPLQIAM LLVR
|
| |