Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_1222 |
Symbol | |
ID | 8752884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 1282070 |
End bp | 1285126 |
Gene Length | 3057 bp |
Protein Length | 1018 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional activator domain protein |
Protein accession | YP_003408340 |
Protein GI | 284989786 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.983461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGGG CGCTCAGGTA CACACCGCCG CTCGCCAGCC GGGACCTGAT CGTGCGGCCC CGGCTCCTGG ACGCGCTGCG GAGCCGCTTC GAGCGCCCCC TCACCGCGGT GGTGGCACCC GCCGGCTTCG GCAAGACCAC GCTGCTGGGT CAGGCCGTGA GCGAGAACGC GCTGTCCCCC GCAGGGGAGG ACCGCTGGTT GACCTGTCAG CGGGACGACA CGACGCTCTC GGTCCTGGCC TCCGGTGCCT TCGCCGCGGT CGGTCTCTCG GCCCCGGTGC CGGAGGACCC GCGGCAGGCC GCGGTCGCGG TCGCCGAGGC GCTCTGGAGC GCGGCCCCGC GCCACGTGGC CCTGGTCCTC GACGACGTCC ACCTGGTGAC CCGGGGATCG CCGGCCGGCG AGTTCCTCGG CCACCTGGTC GAGGAGCTGC CGAGGAACGG GCACCTGGTC CTCGCCTCCC GTCCGCCGCT GCCGCTGTCG ATCTCGCGGC TGGTGGCCAC CGGGCACGCC GTGGTGCTGC GGGAGGGCGA CCTGCAGTTC CACGAGGACG AGGTCGCCGC GTTCGCCGAG TCCCGCGGTG TCCCACCGGA GCTGCTCAGC GACGTCGGGG GCTGGCCGGC CCTGGCCGAG CTGACGGCCA CGGCGGGCCC GTTCGCCGTC AGCGGGTACG TCTGGGAGGA GCTGCTCACG CAGCTGTCGC CGGAGCGCCG CCACGCACTC GCGCTGCTCG TCGCCGTGGG CGGGGCGGAC GACGAGATCG CCGCCGCGCT GCTCGGCCGG GACGTGGACC TCCGGTCCCT GCTGGACGGG CTGCCCCTGG TCGTGCGCTC CCGCTCCGGG TGGTGGTCGC TGCACGGGCT GTGGGCCGCG GCCCTCCAGC ACCACCTCGA CTCCGGGCAG GTGGCGGAGG CCCGGCGGAC CGCGGCCACC GTCCTGCGCG GCCGCGGGCA GTACCGGGAG GCGATGGGGC TGCTGCTGGA CGCCCAGGCG TGGGAGGACG TCCGCCGGCT GGTCGTGGAG GTCTGCGAGT GCTGCACCGC GCTGGTGCCG ACCGACGTCC TCGAGTCGTG GCTGCGCCGG TTCCCGCCCG AGGTGCAGCA GAGCCCCGAG GGGCTGCTGC TGGCCGCCAT GGCGGCGGAG CCGACCAGCC CGGGGGCGGC CGAGGAACTC CTGGAGCAGG CGCTCGCCGG CGCGCACGGG GACCCCGACC TGCGCTACGC GTGCCTCAAC GCGCTCGTCC AACTGGCGTT CTGGGGCAAG GACCGGCAGC GGATGCAGTT CTTGCTGCAG GAGCTGGAGC AGCTGGCGGC GGAGGGGCAC CCCGGAGCCC CGGCGTTCGT CGCGCTGGTC CGCGGCGGGG AGGCGCGCTC CACCGACGAG GTGCGATCCG CGCTGGCGGA CCCGGGTCTG ATGTCGCGGA CGGCGCTCAA CTCCGTACAG CACCCCGTCC AGCAGTGGCT GCACGCCCAC CTCGTCCTGC TGAGGCTGGG CGACGCCTCG ACCGGCGAGG TGCTCGCCCG GCGGGCGCTG TCCCACCCGG CGACCACGAT GCACGGCGTC TCGCGGAGCC TCCTCCTGGA GTCCTTCCGG CTCCGGGGAC GCCTGGAGGA GGCGGAGCGC CTGCTGCCGT ACCTGCTCGG CGACATGCGG CCGGACAAGG TCCTCACGTC CCCCGACCTG GTGCCCTGTG CGGTCTCCGT GCTCGACGTC CTGGGGCGGC ACGGGGAGGC GGGGGAGCTC CTCGGGAGGT ACCGGCCGGT CCTCGGCGCC TCCCCGGTCG CCTGGGCACC CCACGCCCGG ACGCTGGCCG AGGCCTTCCA CGCCCTCTCG GGGGGCGCCG AGCACCAGGC GGCGACCACA CTGCGGTCGA TCGTCCCGTC GAACGGCGGG CGCCTCTCCG GTGCGCTGCG GGTCTCCCCG GTCGCGCTGC CGCTGCTCTA CGTGCTCCTG CCGGAGCTCC GGGACCGCTG GGACGCCGAC CCGCCGCCCG GCGCCCTCGC CGACCTGCAC GCCGGGGCGC GCGCCCTGGT CCAGCTGCGC GAGCACGGGT CGACGGCGGC GGCCGGCGCC CTCCCCTCCA GCGTCTGGCC GGTCCTGCGC GCCCTCCTGC CGGTGCCGTG GGTGGCCGAG CTGGCCCTGG GGATGGTGGC AGCCGGTCAG GACGGTGCCC GTGCGCTGAT CGAGGACCTC GGCCCCGCCG CCCGCGCGAC CCTGCGCGCC CAGGCCGCGA CGGCACCCCC GACCCTCGCG GCCACCGCCC GCTCGCTGCT GCGCGCGATC CCGGCGGTGC CGACCGCACG GGTGCACCTG CGCGTCCTCG GCCCGCTCGA GCTCCGCCGC GACGGTGTGG TCGTCGCCGC GCCGGAGCTG CGCAGGGAGC GGGTGCGTCA GCTCCTCGGC TACCTGCTCC TCCACGACCG GCCGACCCGG ACCGCCATCA CCTCCGAGCT GTGGCCCGAC CTCGACGACG CGGCGGCCGG CCGGAACCTG CGCGTCACCC TCACCTACCT GCAGAACCTC CTCGAGCCCG ACCGCGGCGA GCTCGACCCG CCGTACTTCC TGCGGAGTGC CGGCCCGGTG CTGCACCTGG TGACCGACGG CGCGCTGGAG ATCGACGTGC TGCAGTTCGA GCGGACGCTG GACGAGGCCG CCCGGCTGGA GCGCCAGGGC GCGCCCTCGG CTGCACTCTC GGCCTACCTG CGGGCAGCAG AGCTGTGGAG TGGGAACCTC CTCGCCGACG TGACCGGTGC GAGCTGGCTG GAGTGGGAGC GCGACCGCCT GCGGAGCCGG TTCGTCTCCA GCGCCGTCCG GGCCGGCAAC CTGCTGCTGG CGCGGGGGGA CACCACGACG GCCCGGACCC TCGCCGAGCG TGCCCTGCGG GCCGACGACT GCTCGGAGGA CGCCCACCAG CTGCTCATCG CCGTCCACCT CGCGGACGGC GACCTCGGCG ACGCCCACCG CGCCCTGCGC CGCTGCCAGC AGATGCTGCG CGAGCTGGGT GTGCCGCCCC AGCCGCGCAC GCGCGCGCTC GCGCAGCGGC TGATCCCCCG CGGCTGA
|
Protein sequence | MSRALRYTPP LASRDLIVRP RLLDALRSRF ERPLTAVVAP AGFGKTTLLG QAVSENALSP AGEDRWLTCQ RDDTTLSVLA SGAFAAVGLS APVPEDPRQA AVAVAEALWS AAPRHVALVL DDVHLVTRGS PAGEFLGHLV EELPRNGHLV LASRPPLPLS ISRLVATGHA VVLREGDLQF HEDEVAAFAE SRGVPPELLS DVGGWPALAE LTATAGPFAV SGYVWEELLT QLSPERRHAL ALLVAVGGAD DEIAAALLGR DVDLRSLLDG LPLVVRSRSG WWSLHGLWAA ALQHHLDSGQ VAEARRTAAT VLRGRGQYRE AMGLLLDAQA WEDVRRLVVE VCECCTALVP TDVLESWLRR FPPEVQQSPE GLLLAAMAAE PTSPGAAEEL LEQALAGAHG DPDLRYACLN ALVQLAFWGK DRQRMQFLLQ ELEQLAAEGH PGAPAFVALV RGGEARSTDE VRSALADPGL MSRTALNSVQ HPVQQWLHAH LVLLRLGDAS TGEVLARRAL SHPATTMHGV SRSLLLESFR LRGRLEEAER LLPYLLGDMR PDKVLTSPDL VPCAVSVLDV LGRHGEAGEL LGRYRPVLGA SPVAWAPHAR TLAEAFHALS GGAEHQAATT LRSIVPSNGG RLSGALRVSP VALPLLYVLL PELRDRWDAD PPPGALADLH AGARALVQLR EHGSTAAAGA LPSSVWPVLR ALLPVPWVAE LALGMVAAGQ DGARALIEDL GPAARATLRA QAATAPPTLA ATARSLLRAI PAVPTARVHL RVLGPLELRR DGVVVAAPEL RRERVRQLLG YLLLHDRPTR TAITSELWPD LDDAAAGRNL RVTLTYLQNL LEPDRGELDP PYFLRSAGPV LHLVTDGALE IDVLQFERTL DEAARLERQG APSAALSAYL RAAELWSGNL LADVTGASWL EWERDRLRSR FVSSAVRAGN LLLARGDTTT ARTLAERALR ADDCSEDAHQ LLIAVHLADG DLGDAHRALR RCQQMLRELG VPPQPRTRAL AQRLIPRG
|
| |