Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sked_20120 |
Symbol | |
ID | 8633647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sanguibacter keddieii DSM 10542 |
Kingdom | Bacteria |
Replicon accession | NC_013521 |
Strand | - |
Start bp | 2239055 |
End bp | 2242003 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Excinuclease ABC subunit A |
Protein accession | YP_003314770 |
Protein GI | 269795315 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.578838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.503443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGATC GCCTCGTCGT CCAAGGTGCC CGTGAGCACA ACCTCCGCAA CGTCGACCTC GACCTGCCCC GCGACTCCCT GATCGTCTTC ACCGGACTGT CAGGCTCGGG GAAGTCCTCG TTGGCCTTCG ACACCATCTT CGCCGAAGGG CAGCGCCGGT ACGTCGAGTC GCTGTCCGCC TACGCGCGCC AGTTCCTCGG CCAGATGGAC AAGCCTGACG TCGACTTCAT CGAGGGCCTC TCGCCCGCGG TGTCGATCGA CCAGAAGTCG ACCAACCGGA ACCCCCGGTC GACCGTGGGC ACCATCACCG AGGTGTACGA CTACCTCCGC CTGCTCTTCG CGCGCGCTGG CACGCAGCAC TGCCCGGTGT GCGGCGAGCG CGTGACCGCG CAGACGCCCC AGCAGATCGT CGACAGGCTC CTCGAGCTCC CCGAGGGCAC CCGCTACCAG GTGCTCGCAC CGGTCGTGCG TGGCCGCAAG GGCGAGTACA CGGACCTGTT CAAGGAGCTG CAGTCCAAGG GCTTCGCCCG CGCCCGGGTC GACGGCGAGG TCGTCCAGCT CACCGAGCCC CCCGCGCTCG AGAAGAAGCT CAAGCACACG ATCGAGGTGG TGGTGGACCG CCTCGTCTCG CGCGAGGGCG TCCAGCGTCG CCTCACCGAC TCGGTCGAGA CCGCGCTCGG CCTCGCCGGC GGCCTGCTGG TCGTCGAGAT GGTCGACGCC GAGGTCGGCG ACCCCGACCG TGAGCGCCGG TTCTCGGAGC ACCGCGCGTG CCCCAACGAC CACGTGCTCA CCCTCGACGA GATCGAGCCG CGGACCTTCT CCTTCAACGC GCCCTACGGT GCGTGCCCCG AGTGCACCGG CATCGGTGTC CGGCTGGAGA TCGACCCCGA GCTCGTCATC CCCGACGAGG ACCGGTCGCT GGCCGACGGC GCCGTGGCGC CGTGGTCGCA GACCTCCTCC GAGTACTACC TCCGGGTGCT CGGCGCGCTC GCCAAGGACC TCGGCTTCTC CATGGACGTG CCGTGGCGCG CCCTGCCCGA GCGCGCGCGC AACGCGGTGC TGCACGGCCA GAACCACGAG GTGCAGGTCA AGTACCGCAA CCGGTGGGGA CGCGAACGCC AGTACTCGAC CGGCTTCGAG GGCGTCATCT CCTTCCTCGA GCGCCGTCAC TCCGAGACCG AGTCCGAGTG GTCGAAGGAG AAGTACGAGG CCTACATGCG CGAGGTGCCG TGCCCCGTGT GCGACGGCAC CCGCCTCAAG CCCGAGGTGC TCGCCGTGCT CGTCGACGGC AAGTCCATCG CCGAGGTGTG CGCCCTGCCG CTGCGCGAGG CCCGCGACTT CCTCGACAAC CTGCAGCTCG GCGTGCGCGA GCGCGCCATC GCGACCCAGG TGCTCAAGGA GATCCAGGAC CGCCTCGGCT TCCTCCTCGA CGTCGGCCTC GACTACCTCT CGCTCATGCG CCCTGCCGGC ACCCTCTCGG GTGGCGAGGC GCAGCGCATC CGGCTCGCGA CCCAGATCGG GTCCGGCCTC GTCGGCGTGC TCTACGTCCT CGACGAGCCG AGCATCGGCC TGCACCAGCG CGACAACCGC CGCCTCATCG ACACGCTGAC CCGCCTGCGC GACCTCGGCA ACACGCTCAT CGTCGTCGAG CACGACGAGG ACACGATCCG CACCGCGGAC TGGATCGTCG ACATCGGCCC CGGTGCCGGC GAGCACGGTG GTCGCGTGGT GCACTCCGGC GACTACGCGG GCCTCCTCGA GGCCCCCGAG TCCGTCACCG GCGCGTACCT CGCCGGCCGC CGGTCCATCC CCATGCCGAC GCTGCGCCGT CCCGTCGACC CTAAGCGTCA GCTGACGGTC ATCGGTGCGC GCGAGCACAA CCTCACGGGC ATCGACGTGA GCTTCCCGCT CGGGGTGCTC ACTGCGGTCA CCGGTGTGTC GGGCTCGGGC AAGTCGACGC TGGTCAACTC GATCCTCTAC ACGGTGCTCG CCAACGAGCT CAACGGGGCC CGCCAGGTCG CCGGCCGTCA CAAGCGCGTC ACCGGGCTCG ACCAGCTCGA CAAGGTCGTG CACGTGGACC AGGGCCCCAT CGGCCGGACG CCGCGCTCCA ACCCGGCCAC CTACACGGGT GTGTGGGACC ACGTGCGCCG GCTCTTCGCC GACACCAGCG AGGCGAAGGT CCGCGGCTAC ACGCCCGGAC GCTTCTCCTT CAACGTCAAG GGCGGGCGCT GCGAGGCGTG CTCGGGCGAC GGCACGCTCA AGATCGAGAT GAACTTCCTC CCGGACGTCT ACGTCCCCTG CGAGGTGTGC CACGGCGCGC GGTACAACCG CGAGACCCTC GAGGTGCACT TCAAGGGCAA GACGGTCGCG GACGTGCTCG ACATGCCGAT CGAGGAGGCC AACGAGTTCT TCGCGGCGGT CCCCGCGATC TCGCGGCACC TCAAGACGCT CGTCGACGTG GGCCTCGGCT ACGTGCGGCT GGGCCAGTCG GCCCCGACGC TCTCGGGCGG TGAGGCGCAG CGCGTCAAGC TCGCGAGCGA GCTGCAGAAG CGCTCGACCG GGCGGACCAT CTACGTGCTC GACGAGCCGA CCACCGGTCT GCACTTCGAG GACATCCGCA AGCTGCTCGC GGTGCTCCAG TCGCTGGTCG ACAAGGGCAA CTCGGTGCTG GTCATCGAGC ACAACCTCGA CGTCATCAAG AACGCCGACT GGGTCATCGA CATGGGTCCT GAGGGTGGTT CCGGCGGTGG CCTCGTGGTC GCCGAGGGCA CCCCCGAGCT CGTGGCGTCG GTCGAGGCCA GCCACACCGG CCGGTTCCTC GCCGAGGTGC TGCAGAACCA CGAGCCGGCC GGCGTGGCCC CGCTGCCCAA GAAGACCGCT GCCGCGGCCG TCACCAAGAA GACGACCAAG AAGCCGGCCA GCAGGACCAA GGCCAAGCTC AAGGAGTGA
|
Protein sequence | MSDRLVVQGA REHNLRNVDL DLPRDSLIVF TGLSGSGKSS LAFDTIFAEG QRRYVESLSA YARQFLGQMD KPDVDFIEGL SPAVSIDQKS TNRNPRSTVG TITEVYDYLR LLFARAGTQH CPVCGERVTA QTPQQIVDRL LELPEGTRYQ VLAPVVRGRK GEYTDLFKEL QSKGFARARV DGEVVQLTEP PALEKKLKHT IEVVVDRLVS REGVQRRLTD SVETALGLAG GLLVVEMVDA EVGDPDRERR FSEHRACPND HVLTLDEIEP RTFSFNAPYG ACPECTGIGV RLEIDPELVI PDEDRSLADG AVAPWSQTSS EYYLRVLGAL AKDLGFSMDV PWRALPERAR NAVLHGQNHE VQVKYRNRWG RERQYSTGFE GVISFLERRH SETESEWSKE KYEAYMREVP CPVCDGTRLK PEVLAVLVDG KSIAEVCALP LREARDFLDN LQLGVRERAI ATQVLKEIQD RLGFLLDVGL DYLSLMRPAG TLSGGEAQRI RLATQIGSGL VGVLYVLDEP SIGLHQRDNR RLIDTLTRLR DLGNTLIVVE HDEDTIRTAD WIVDIGPGAG EHGGRVVHSG DYAGLLEAPE SVTGAYLAGR RSIPMPTLRR PVDPKRQLTV IGAREHNLTG IDVSFPLGVL TAVTGVSGSG KSTLVNSILY TVLANELNGA RQVAGRHKRV TGLDQLDKVV HVDQGPIGRT PRSNPATYTG VWDHVRRLFA DTSEAKVRGY TPGRFSFNVK GGRCEACSGD GTLKIEMNFL PDVYVPCEVC HGARYNRETL EVHFKGKTVA DVLDMPIEEA NEFFAAVPAI SRHLKTLVDV GLGYVRLGQS APTLSGGEAQ RVKLASELQK RSTGRTIYVL DEPTTGLHFE DIRKLLAVLQ SLVDKGNSVL VIEHNLDVIK NADWVIDMGP EGGSGGGLVV AEGTPELVAS VEASHTGRFL AEVLQNHEPA GVAPLPKKTA AAAVTKKTTK KPASRTKAKL KE
|
| |