Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sked_14320 |
Symbol | |
ID | 8633068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sanguibacter keddieii DSM 10542 |
Kingdom | Bacteria |
Replicon accession | NC_013521 |
Strand | + |
Start bp | 1611715 |
End bp | 1614693 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | beta-galactosidase/beta-glucuronidase |
Protein accession | YP_003314202 |
Protein GI | 269794747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.99262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCCGTC ACGACACCGT CAGCCTGCCG ACCGGGACCG TTCCCGCGCG GTTCCGCCCG CCGACAGGCG CCTCCGACGC CGCGTCGGTG AGCCTCGACG GCCCGTGGCG CTTCCGGCTG TTCCCGTCGG CCGACACCGG TGCCGACCCT GCAGACCGCG GCGACGAGTG GGACACCATC GAGGTCCCCG GGCACTGGCA GCTCGCCGGT GCACCCGACA CCTGGCCCTA CGGCACCCCG GCCTACAGCA ACGTGCTCTA CCCCTTCCCG GTCGAGCCGC CCTTCGTGCC CGACGCGAAC CCGACGGGGG AGTACCGCCG GACCTTCGAG GTCCCGGCCG GGTGGGCCGC TGACGGTCGC GCCTACCTGC GCTTCGAGGG CGTGGACTCG TGGTTCGAGG TGTCGGTCAA CGGCGCCGTC GTCGCCACCT CGCACGGCTC CCGCCTGGCG ACAGAGATCG ACGTCACCGA CGTCGTCGTC CCGGGCGAGA ACCTGCTCTC GGTCCGCGTG ACCCAGTGGT CGGCGCTCAG CTACGTCGAG GACCAGGACC AGTGGTGGCT GTCGGGCATC TTCCGCAGCG TGACCCTCGA GCACCGGCCG GAGGCGAGCG TCGCGCACGC GACCGTCGTC GCGGACTACG ACCACACCAC GGGGACCGGC TCGCTCACCG TCGAGGTCGA GGGATCTGCC GAGGCTCGCG TCACCGTCCC CGAGCTCGGC ATCGACGTCG CGGCCGGCGA GACCGCGACT GCGCAGGTCG AGCCGTGGAG CGCCGAGCGG CCGCGTCTCT ACGACGTGAC CGTGACGACG CCGGGGGAGA CCGTCACCCT CCGTGCCGGG TTCCGCACCG TCTCGATCGA GGACGGGGTC TTCCTCGTCA ACGGCGCCCC CGTGAAGCTG CGTGGGGTCA ACCGCCACGA GTTCGACCCG CTTCGTGGCC GCTCGGTCAC CCCGGAGCGC ATGCTCGAGG ACGTCCTCCT CATGAAGCGT CACCACGTCA ACGCCGTGCG CACCAGCCAC TACCCGCCGC ACCCGCACTT CCTCGACCTG TGCGACGAGC ACGGCCTGTA CGTCGTCGAC GAGAACGACC TCGAGACGCA CGGGTTCATC GACGGGGGAT GGGTCGGCAA CCCCACCGAC GACCCGGCGT GGGAGGACGT CCTCGTCGAC CGTGTCACCC GGACCGTCCG CCGCGACGCC CACCACCCGA GCATCGTCCT GTGGTCGCTC GGCAACGAGG CCGGCAGCGG GTGCAACGTC GCCGCCATGA CCGCGGCCGT CCGCGCCCTC GACCCGACCC GCCCGGTGCA CTACGAGGGC GACTGGTCCT CCGACGACGT CGACGTGTAC TCCCGCATGT ACGCGACGAG CGAGGAGACC GAGCTCATCG GCCAGGGCGT CGAGGCGCCG CTCGCCCGCG CCGAGGCCGA CGCCCGCCGC CGTCAGATGC CGTTCCTGCA GTGCGAGTAC GCGCACGCGA TGGGCAACGG TCCCGGCGGG CTGAGCGACT ACGACGCGAT CTTCGACCGG TACCCGCGAC TCATGGGCGG GTTCATCTGG GAGTGGATCG ACCACGGCCT CACGCGCCGC ACCGACGACG GGACCGAGTT CGCGGCCTAC GGCGGCGACT TCGGCGAGGA CTTCCACGAC GGCACCTTCA TCGCCGACGG CCTCGTCCTG CCCGACCGCA CGCCGGCCCC GTCGCTCGTC GAGATGGCCG CGGTCTTCGC ACCCGTGCGC CTCGACCCGA CGGCCGACGG GACCACCCTG CTGGTCCGCA ACCGGTACGC CTTCCGCGAC ACCTCGCACC TGACCTTCGA GTGGACCCTC CACCACGGGG ACGACGTCCT CGCCACGGAC ACCCTCGACG ACCTCGTGCT CGCACCCGGT GAGGAGCGGA GCGTCGCCCC GCCCGCGGGC CTCGACCTGC CGTCGCCCGA CCACACCCCG ACCTGGTGGA CGCTGCGTGC GGTCCAGCAC GAGGCCGACC ACCTCGACGC ACCGTGGTTC GACGACGCGG CCGGGTCGTT CGTCGTGTCC AGCGGCCAGC TCGCCCTGAC CCCGGCCGTC TCCCTGCCGG AGGCGACCGG CCGCGCGCAG CAGACCTCGG ACGGCTTCAC CGTCGGGCAC GCGCGCTTCG ACGCGCGCGG CACCCTCCTC GAGCTGCACG GGCGCCCGGT GGCCGAGCTG CGCGTGGACG CCTGGCGCGC ACCGACCGAC AACGACCGTC GCGAGGGCAG CTGGGTCGAC CTCTCGGACG AGAGCGTCTG GAAGGAGAGC GGCCTGCACC TGCTCGCCGA GCGCAAGGTC TCGGCCGAGA TCACGGACGG CGCGCTGGTC GTGACCGCCC GCACCGCAGG CCCACGCACC CGCAACGGGT TCCGCACCGT GTACACGTGG CGCCCTGTCG CCGACGGCTC AGACGACGTC GACCTGCACG TCGCCATCAC CCCCGAGGGG CGCTGGGGCC AGAGCATCGC GCGCCTCGGT GTGCTGCTCG TGCTCGACGA GCCCGGAGCC GAGGACGTCG CCGTGCGCTG GCACGGGCTC GGCCCGGAGG AGTCGTACTC CGACTCCCGG AAGGCTGCCG TCGGCGGTGC GTACGAGCAC ACCGTCCGCA GCCTGCAGAC GCGCTACACG CACCCGCAGG AGAACGGCGC CCGACGCGGC GTGACCCACG CGGAGCTGAC CTTCGCCGAC GGGTCGGAGC TCACGCTCGA CGCCGGACCG AGCAGTGTCG GCGGCCGCAG CACGGCAGGC CTCGAGCTGT CGCTGCGCCC GTGGTCCGAC ACGGCACTCG ACCAGGCCGC GCACCCGCAC GACCTCGTGC CCGACGGGAA GCTCTGGCTG CACCTCGACG CCCGCCAGCA CGGCTTGGGC AGCGCGGCGT GCGGACCGGG CGTGCTCTCC AACGCCCGCC TCACGGCGGC GCCGGCCGAG GTGTCGGTGC GGCTCGCGTC GCAGCCGGTG GGACGCTGA
|
Protein sequence | MPRHDTVSLP TGTVPARFRP PTGASDAASV SLDGPWRFRL FPSADTGADP ADRGDEWDTI EVPGHWQLAG APDTWPYGTP AYSNVLYPFP VEPPFVPDAN PTGEYRRTFE VPAGWAADGR AYLRFEGVDS WFEVSVNGAV VATSHGSRLA TEIDVTDVVV PGENLLSVRV TQWSALSYVE DQDQWWLSGI FRSVTLEHRP EASVAHATVV ADYDHTTGTG SLTVEVEGSA EARVTVPELG IDVAAGETAT AQVEPWSAER PRLYDVTVTT PGETVTLRAG FRTVSIEDGV FLVNGAPVKL RGVNRHEFDP LRGRSVTPER MLEDVLLMKR HHVNAVRTSH YPPHPHFLDL CDEHGLYVVD ENDLETHGFI DGGWVGNPTD DPAWEDVLVD RVTRTVRRDA HHPSIVLWSL GNEAGSGCNV AAMTAAVRAL DPTRPVHYEG DWSSDDVDVY SRMYATSEET ELIGQGVEAP LARAEADARR RQMPFLQCEY AHAMGNGPGG LSDYDAIFDR YPRLMGGFIW EWIDHGLTRR TDDGTEFAAY GGDFGEDFHD GTFIADGLVL PDRTPAPSLV EMAAVFAPVR LDPTADGTTL LVRNRYAFRD TSHLTFEWTL HHGDDVLATD TLDDLVLAPG EERSVAPPAG LDLPSPDHTP TWWTLRAVQH EADHLDAPWF DDAAGSFVVS SGQLALTPAV SLPEATGRAQ QTSDGFTVGH ARFDARGTLL ELHGRPVAEL RVDAWRAPTD NDRREGSWVD LSDESVWKES GLHLLAERKV SAEITDGALV VTARTAGPRT RNGFRTVYTW RPVADGSDDV DLHVAITPEG RWGQSIARLG VLLVLDEPGA EDVAVRWHGL GPEESYSDSR KAAVGGAYEH TVRSLQTRYT HPQENGARRG VTHAELTFAD GSELTLDAGP SSVGGRSTAG LELSLRPWSD TALDQAAHPH DLVPDGKLWL HLDARQHGLG SAACGPGVLS NARLTAAPAE VSVRLASQPV GR
|
| |