Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sked_29510 |
Symbol | |
ID | 8634585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sanguibacter keddieii DSM 10542 |
Kingdom | Bacteria |
Replicon accession | NC_013521 |
Strand | - |
Start bp | 3285423 |
End bp | 3288401 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | ATP-dependent transcriptional regulator |
Protein accession | YP_003315687 |
Protein GI | 269796232 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.611914 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGGAG ACTCGGAGAG GCACGCCGGA GCGCGCGACG GCGCGCGCCC CGCCCGGCTG CCCCGCGCGC CCTCGGTCCT CGTCGCCAAG CCGACGCTCA CCGCCCTGCT CGACCGCCAC GAGCCGCTCA CGGTGCTGCG GGCCCCCAAG GGGTACGGGA AGACCGCGCT CGTCAGCGGG TGGGCCAGGG GCCGCGCCGC GCACGTCGAC CTGGTGTGGG TGTCGTGCCG TCTCCCGTCG GTGCTCTCCG ACGGCATGCC CGCTCCGTTC TGGGCACGGC TGGCCGACGA GATGTGGCGT TCCGGTCTCG GCGTGGTCGA GCCGGGGCTC GCCCCTGGCC GCAGCGACGT GGTCGCCGCG CTGGCCGGGC GCTCCGACCC GCTGTGCGTG GTGGTCGACG ACTTCGACGA CAACACCGAC CCGCACGTGG CCGGCGAGCT GCTCGACCTC GTGAGCCGTT TCGACACCCT CGACGTCGTC GTCCTGGCAC GGTCGCCCCG CACGGCCGAC GCGCTCCTGG CGTCGACGGT GGACTCGGTC GAGCTGGGAC CCCAGGACCT CCTGCTGAGC GTGGACGACG CCGCCGGGCT CGTCTCCGCG CTGGGGCACG AGCTCACACC CGACCAGGTG CACCGGCTGG TCCGGGACAC CGGTGGCTGG CCGGCGCTGC TGCGGGCCGT GCTGGCGGGC TCTGTCTCTG CGTCGCGCAC CGAGGGACGG GTGAGGTCCG CCGCGCTCGG GGCAGCGGCG GACGACCTGC AGGCTGCGGT GACCGGCGGG TTCTCGGGGC GCCTACCCCC GGCTGCTCCC GGCGCCGCAG CGCTCCCTGG TCTGGTCTTC CCCGCACAGA CGACAGGCGG CCCTCCCGCC CGGCGCCCGT CGTCACGCAG CCCGTCCTCC CGCAGCCCGT CGTCGAGAGA CCTGCCCCCC GGTGGGCCGT CGCCCGCCCG CACCTCGCCT GCCGGTCCTG GAGCGCGCGT CGACCGGGCG GTGCTGCACT CGGTGGTCTC TCCCGTGGCG GAGGTCACCG TCGACCTCAC CGCAGGCTTC GCGTACCTCC GGGCGGTCTG GGACGAGCTC GACGACCAGG GGATGCGGCG GTTCGTCCAG CACACCGGGC TGCTGGGCGC GTTCACGGTG GACGACGCCG TGGCGAGCAG CGGCCTGCGC GACTCGGCCC GGCACGTGGG CGCGCTCCTC GACCTCGGGC TCCTCGTCCA GGACACGGAC GAGGACGACG CCCGGTTCCG CTACGTCGAG GCGGTGCGGC TCGCGGCCGC AGAGATCTTC CGGTCTGAGG AGCCGCGGCG GTTCCGGGTC GCGCAGCGTC GTCTCGCCCG CCTGCGGACG TCCCAGGGGC GGGTCGACGA GGCGGTGGGC CACCTCCTGG CCGCACGGAC GTGGGAGGCC TGCGCCGAGC TCGTCGACCT GCACTGGGCG ACACTCGTGG CCGAGCGCCC CGAGGTGGTC GCCGACGTGG TGGAGACCGT CCCCGAGGAG GTGCTGGCCC GGCACCCGCG CCTCGGCGTG GCGCGCGCCC ACCTCCTGCC GGTCCTGAGG CCGCAACGGC GCCGGACGTC CGGCTCGGGC GCGCCGGTCG ACGGTGCTCT CCACGACGTC GGTCCGCGCG CCACCCACCC CGAGGAGGGC TTGGCCCAGG TGCTCGGCTC GATGGAGGAG GGGCTGGGGC AGATCGTCAC GCTGCGCCTG ACGGGTGACT TCGGCGCGGC GACCACCCTG GCCGAGCGTG CGCAGGACGG GCTCGGGCCG CTGCTGGACA CAGCAGACAC CGCAGCGAAG GGCGAGCTGG CGCCGCTGCT CTACGAGTGG GCCGTGACGC GCCTGCTCGG CGCCGACGTC GAGGGCGCGC TCGCCGCCTT CCGCCAGGGG GCCGAGCTGG CGCGCAGCGT CGACCGGCCG GTCGTGCTCC GTGAGGCAGC GGTCGGTGCC GCCCTGTGCT GCGCGCTCCT GGGGTACCTC GAGCAGGCGG AGGAGTGGTT GCTCGAGGTG CCGCCCGGTG TACCCGTCGA GGACGCCCCG ATCCAGGAGG GGCGCGGGGT GCGCACGGTG CGGTCCATCG TGGCGCTCGA GCGGTTCGAG CCGGCCGGTC CGGAGCAGAC GGTGCTGCTG GGCGACCTGG GCGCGGCGTC GGGGCTGGGG GTGGTGGTCG TCGCGGTCCG GGCGCAGGCC GCCCTGCGTG ACGGTACGCA CTACGAGGTG CTCGCCGAGG TCGAGGAGGC GCGGGAGCAC CTCCAGGGCA CCACGGCGAC CAAGAGCCTC CGGGAGGCTC TCCTGGCGGG CGCCGCGGTC GACCTGTACC TGGGCCTCGG GTACGTGTCG AAGGCGCGGG CCGCCGTCGC GGGCATCGAC CACGGCTACG AGGTGGTCGG GCTGGCCTGC GCCCGGACGG ACCTCGTGGC GGGGGACCTG GAGGACGCCG TGACGCGGGC CTCTGCGCTG CTGGCGACGC GCGTGGCCGG TCCGCGCCTG CGGCTGGACC TCCTGCTGGT CCTGGCGGCG GCGCAGGCGG GGCTCGGGCA CCGGCACGAG GCGGCGGTCG CGCTCGAGGA GGCCACGTCG ATCTGCCGGG CGACCGGTCT GCTGCGGAGC TTTCTCAAGG TCCCGAGGGA GGTGCTCGAG GACCTCGCGC CGCTGGTACC GGGGGTCCGG GCCGTGCTGG AGGCCCCGGG GTTCGCCGAG GCCGTCGGGG TGTTCCCGAG CGAGCCGCCG CGCGCGCGGC TGAGCAGCCG TGAGCTGCAG GTGCTGCGGA GCCTGGCCGA GAGCCCCTCG CTCGCGAGCG TCGCACGGTC GCTCTTCCTG TCGTCCAACA CCGTCAAGAC GCACCTGCGC AGCATCTACC AGAAGCTCGG GACCCACTCG AGCGTCGAGA CCGTCGAGAA GGCGCGCGAC CTCGGCCTGC TCGACGACGA CGTCGCCGTC GAGGGAGAGG GCGACGAGCG CGACGGCGGC GAGCGCTGA
|
Protein sequence | MSGDSERHAG ARDGARPARL PRAPSVLVAK PTLTALLDRH EPLTVLRAPK GYGKTALVSG WARGRAAHVD LVWVSCRLPS VLSDGMPAPF WARLADEMWR SGLGVVEPGL APGRSDVVAA LAGRSDPLCV VVDDFDDNTD PHVAGELLDL VSRFDTLDVV VLARSPRTAD ALLASTVDSV ELGPQDLLLS VDDAAGLVSA LGHELTPDQV HRLVRDTGGW PALLRAVLAG SVSASRTEGR VRSAALGAAA DDLQAAVTGG FSGRLPPAAP GAAALPGLVF PAQTTGGPPA RRPSSRSPSS RSPSSRDLPP GGPSPARTSP AGPGARVDRA VLHSVVSPVA EVTVDLTAGF AYLRAVWDEL DDQGMRRFVQ HTGLLGAFTV DDAVASSGLR DSARHVGALL DLGLLVQDTD EDDARFRYVE AVRLAAAEIF RSEEPRRFRV AQRRLARLRT SQGRVDEAVG HLLAARTWEA CAELVDLHWA TLVAERPEVV ADVVETVPEE VLARHPRLGV ARAHLLPVLR PQRRRTSGSG APVDGALHDV GPRATHPEEG LAQVLGSMEE GLGQIVTLRL TGDFGAATTL AERAQDGLGP LLDTADTAAK GELAPLLYEW AVTRLLGADV EGALAAFRQG AELARSVDRP VVLREAAVGA ALCCALLGYL EQAEEWLLEV PPGVPVEDAP IQEGRGVRTV RSIVALERFE PAGPEQTVLL GDLGAASGLG VVVVAVRAQA ALRDGTHYEV LAEVEEAREH LQGTTATKSL REALLAGAAV DLYLGLGYVS KARAAVAGID HGYEVVGLAC ARTDLVAGDL EDAVTRASAL LATRVAGPRL RLDLLLVLAA AQAGLGHRHE AAVALEEATS ICRATGLLRS FLKVPREVLE DLAPLVPGVR AVLEAPGFAE AVGVFPSEPP RARLSSRELQ VLRSLAESPS LASVARSLFL SSNTVKTHLR SIYQKLGTHS SVETVEKARD LGLLDDDVAV EGEGDERDGG ER
|
| |