Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2572 |
Symbol | |
ID | 8754243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 2674458 |
End bp | 2677331 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, winged helix family |
Protein accession | YP_003409599 |
Protein GI | 284991045 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.526288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAGTTCC GGATCCTCGG ACCGCTCGAG GTCGCCGGGG ACGACGGTGT GCCCGTGGCC GTCGGCGGCC CGAAGCCGCG CGCCCTGCTC GCCGAGTTGT TGCTCCACCC CGGAGGCGTG GTGCCGACCG AGCGCCTGGT CGATGCTGTC TGGGGAGCAA AGCCGCCGCC GAACGCCGGC GTCGCCCTGC GCGCCTACGT TTCTCGGCTG CGTGCGGCCC TCCCCGCCGC CGAGGACGGG CCGCGGCTGC GCTACCGCGC ACCCGGCTAC CGGCTCGTGC TGACCGACGA CGAGCTCGAT GCCAGCGCGT TCACGCGGCT GGTCGCGGAG GCCCGCGAGT GCGCCACCGC GGGTGACCAC GGCCGCGCGC TGAGCCTGCT CGACAGCGCG CTCGGATTGT GGCGCGGGGA CGTGCTGGCG GAGTTCGACC CTCCAGCTCT CGGTGCCGAG GCGGATGTCG CGCGGCTCAC AGAGCTGCGA CTGCTCGCCG TAGAAGAGCG GGCCGAGACG ATGCTGCACC TGGGACGGAC ACCAGAGGTG ATCCCCGAGC TGGAGCCGCT GGTACGGCGC CATCCGGTTC GGGAACGGCT CAGCGTGCTT CTGATGCGGG CCCTCTATCT CAGCGGACGG CAGAGCGAGG CGCTGGAGGT GTTCCGGCGG CTGCGTCGGG TACTCGTCGA CGAGCTGGGC GTGGAGCCGT CGGAACCGAC CCGTGAAGTG CATCGTCAGC TGCTCGCGCA CGATCCCGTG CTGATCCCGC CAGCCGCCGT GCGGCCGACC AACCTGCCGC GACGCGGCAC CGGTTTCGTC GGCCGCACGG AGGAGCTCGC GCAGGTCACC GCGGCGCTGC GGTCGGCCCC GCTGGTGACG CTCACCGGTG CTGGCGGGGT CGGCAAGACC CGCCTGGCGC TCGAGGTGGC CGAGGCCGAG CGGGCTCGGT TCGCCAACGG GGTCTGGCTC TGCGAGCTGG CCCCACTGCC CGATGAAGGG CCGGTGAGCC ACGCGGTGGC CGCCGCCCTA CGCGTGCAGC AACGGCACGG GCTGACGATC GAGCAGACGG TGATCGAGTA CCTCCGGCCA CGGCAGTTGC TGCTGGTCCT GGACAACTGC GAGCACGTGC TCGACGCCGC GGCCCGGCTG GTACAGCAGG TCGTCGCTCA GTGCCCGGCG GTCGGTGTGC TGGCCACCAG TCGGGAGGCG CTCGGCGTGG ACGGCGAGCA GGCGTGGCCG GTGCCGCCGC TGTCCGAGCA CGACGCAGCC GCGCTCTTCG TGCAACGAGC GCGAGCGACC AGCCCGGGAT TCCACCCGGA CGGCGCCATC GACGGCTCGG TGGCCGACAT CTGCCGACGC CTCGACGGCC TGCCGTTGGC CATCGAGCTG GTCGCGGCGC AGATGCGCGT GATGACCCCG GCGGAGATGG CCCGGCGGCT CGACGACGAA CAGCTGCGCG TTCCGGGACC GCGGACGGCG CAGCTGCGCC ACCGGAGCCT GGCTGCGGCG ATCGACTGGT CCTACCGGCT GCTCTGCGAG CGCGAACGGC AGTTGTTCGC CCGGCTGTCG GTGTTCCGCG GTGGTGCCGA CTTCCCCGCC GTGCACGCGG TGTGCGCGGA GCCCGACGAC AGCGAGGACG ACACTCTCGA CCTGCTGACC GCGCTGATCG ACAAGTCGAT GGTCAAGGTC GGCCACGCTG CCGGAGACAG CAGCTACCGC GTCCTCGAGC CGCTGCGGGC GTACGGCCGG GATCGCCTCC CCGGAGACGC CGCGCTGCCC CGCCGACACG CCGCGTACTT CACAGCGCTG GCCGAGCAGG CGGCGCGCGG CATGCGCGGC CCGGACGAAG GGGCCTGGGT CGAGCGGACG AGTCCTGCGG TGGACAACCT GCGCGCGGCC TTCGAACAGG TCATGGCCGA CGGAGACACC GAGCTGGCCC TGCGGCTGGT GACCGCACTG CCGGAGGTCC TCCACATCCG CGTCGGCTAC GAGGCCGCCG GGTGGGCCGA ACGGGCCCTC GCCCTCGCCA CGGCCGAGCA CCCGCTGTAC GTCCCCTGCG TGGGCGCAGC CGCCCGCGGC GCATGGAACG TGGGCGACTT CCCGCGGGCC CGCCGGCTGG CCGAGCGAGC CGGCGGCCGC AACCCCCCAC CCGGCACCGC CCGGGTGGCC TACCCCGGCG ACGTGCTGGC CGACGTCGCG CTGTACGAAG GCGACGCCGC CTCGGCCCTG TGCCACTACG AGACGGAGGT GCTCCGCGCG CGGCGCGACG GCGACCCCAT CCGCCTGGTC TGGACCCTGT ACTACGTCGC CGTGTGTCAT GCGGTCCTGC GCACGCCCGC CCGGGGCCTC CCCGCGGCCG AGGAGAGCCT GCAGGTGGCC GAGGCGACCG CCAACCCCAC CGCCCGGTCG ATGGCCCGGT ACGCCCTCGG CCTCGCGCTC AAGAAGTCCG ACCCCGACCG CGCGCTGGCG CTCTTCGACG AGGCGGAGGC TCTGGCCGCG TCGGTCGGCA ACTCCTGGTG GCGGGGTGTC GCTCTGATGG AGGCGGCGGC CACCCGCGCC GTGCACCGCG ACCCGGCTGC CGCCGCACGG GCACTCGCCG ACGTCCTCGA CCACTGGGAG CGGGTCGGCG ACTGGACCCA GCAGTGGCTC AACCTCCGGT ACATCATCCG GTTGCTGGTC CGGCTCGGCC ACGACGAAGA CGCCGTCGTC CTGCACCACT GTCTGCTCAC GGCCGCGAAG CCCTCCCCCC TCGACACGGC GCGGCTGGCG GGGCTGCGCG ACCGCCTCGA CCGCGGGCGG TACGCCGCCG CCGCAACCCG GGGCGCCGGC CTGTCGGCCA CCGAGGCCGT CCTCCACGCG CGCGCGGCGC TACGTGCCGC TTGA
|
Protein sequence | MQFRILGPLE VAGDDGVPVA VGGPKPRALL AELLLHPGGV VPTERLVDAV WGAKPPPNAG VALRAYVSRL RAALPAAEDG PRLRYRAPGY RLVLTDDELD ASAFTRLVAE ARECATAGDH GRALSLLDSA LGLWRGDVLA EFDPPALGAE ADVARLTELR LLAVEERAET MLHLGRTPEV IPELEPLVRR HPVRERLSVL LMRALYLSGR QSEALEVFRR LRRVLVDELG VEPSEPTREV HRQLLAHDPV LIPPAAVRPT NLPRRGTGFV GRTEELAQVT AALRSAPLVT LTGAGGVGKT RLALEVAEAE RARFANGVWL CELAPLPDEG PVSHAVAAAL RVQQRHGLTI EQTVIEYLRP RQLLLVLDNC EHVLDAAARL VQQVVAQCPA VGVLATSREA LGVDGEQAWP VPPLSEHDAA ALFVQRARAT SPGFHPDGAI DGSVADICRR LDGLPLAIEL VAAQMRVMTP AEMARRLDDE QLRVPGPRTA QLRHRSLAAA IDWSYRLLCE RERQLFARLS VFRGGADFPA VHAVCAEPDD SEDDTLDLLT ALIDKSMVKV GHAAGDSSYR VLEPLRAYGR DRLPGDAALP RRHAAYFTAL AEQAARGMRG PDEGAWVERT SPAVDNLRAA FEQVMADGDT ELALRLVTAL PEVLHIRVGY EAAGWAERAL ALATAEHPLY VPCVGAAARG AWNVGDFPRA RRLAERAGGR NPPPGTARVA YPGDVLADVA LYEGDAASAL CHYETEVLRA RRDGDPIRLV WTLYYVAVCH AVLRTPARGL PAAEESLQVA EATANPTARS MARYALGLAL KKSDPDRALA LFDEAEALAA SVGNSWWRGV ALMEAAATRA VHRDPAAAAR ALADVLDHWE RVGDWTQQWL NLRYIIRLLV RLGHDEDAVV LHHCLLTAAK PSPLDTARLA GLRDRLDRGR YAAAATRGAG LSATEAVLHA RAALRAA
|
| |