Gene Gobs_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1222 
Symbol 
ID8752884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp1282070 
End bp1285126 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content76% 
IMG OID 
Producttranscriptional activator domain protein 
Protein accessionYP_003408340 
Protein GI284989786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.983461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGGG CGCTCAGGTA CACACCGCCG CTCGCCAGCC GGGACCTGAT CGTGCGGCCC 
CGGCTCCTGG ACGCGCTGCG GAGCCGCTTC GAGCGCCCCC TCACCGCGGT GGTGGCACCC
GCCGGCTTCG GCAAGACCAC GCTGCTGGGT CAGGCCGTGA GCGAGAACGC GCTGTCCCCC
GCAGGGGAGG ACCGCTGGTT GACCTGTCAG CGGGACGACA CGACGCTCTC GGTCCTGGCC
TCCGGTGCCT TCGCCGCGGT CGGTCTCTCG GCCCCGGTGC CGGAGGACCC GCGGCAGGCC
GCGGTCGCGG TCGCCGAGGC GCTCTGGAGC GCGGCCCCGC GCCACGTGGC CCTGGTCCTC
GACGACGTCC ACCTGGTGAC CCGGGGATCG CCGGCCGGCG AGTTCCTCGG CCACCTGGTC
GAGGAGCTGC CGAGGAACGG GCACCTGGTC CTCGCCTCCC GTCCGCCGCT GCCGCTGTCG
ATCTCGCGGC TGGTGGCCAC CGGGCACGCC GTGGTGCTGC GGGAGGGCGA CCTGCAGTTC
CACGAGGACG AGGTCGCCGC GTTCGCCGAG TCCCGCGGTG TCCCACCGGA GCTGCTCAGC
GACGTCGGGG GCTGGCCGGC CCTGGCCGAG CTGACGGCCA CGGCGGGCCC GTTCGCCGTC
AGCGGGTACG TCTGGGAGGA GCTGCTCACG CAGCTGTCGC CGGAGCGCCG CCACGCACTC
GCGCTGCTCG TCGCCGTGGG CGGGGCGGAC GACGAGATCG CCGCCGCGCT GCTCGGCCGG
GACGTGGACC TCCGGTCCCT GCTGGACGGG CTGCCCCTGG TCGTGCGCTC CCGCTCCGGG
TGGTGGTCGC TGCACGGGCT GTGGGCCGCG GCCCTCCAGC ACCACCTCGA CTCCGGGCAG
GTGGCGGAGG CCCGGCGGAC CGCGGCCACC GTCCTGCGCG GCCGCGGGCA GTACCGGGAG
GCGATGGGGC TGCTGCTGGA CGCCCAGGCG TGGGAGGACG TCCGCCGGCT GGTCGTGGAG
GTCTGCGAGT GCTGCACCGC GCTGGTGCCG ACCGACGTCC TCGAGTCGTG GCTGCGCCGG
TTCCCGCCCG AGGTGCAGCA GAGCCCCGAG GGGCTGCTGC TGGCCGCCAT GGCGGCGGAG
CCGACCAGCC CGGGGGCGGC CGAGGAACTC CTGGAGCAGG CGCTCGCCGG CGCGCACGGG
GACCCCGACC TGCGCTACGC GTGCCTCAAC GCGCTCGTCC AACTGGCGTT CTGGGGCAAG
GACCGGCAGC GGATGCAGTT CTTGCTGCAG GAGCTGGAGC AGCTGGCGGC GGAGGGGCAC
CCCGGAGCCC CGGCGTTCGT CGCGCTGGTC CGCGGCGGGG AGGCGCGCTC CACCGACGAG
GTGCGATCCG CGCTGGCGGA CCCGGGTCTG ATGTCGCGGA CGGCGCTCAA CTCCGTACAG
CACCCCGTCC AGCAGTGGCT GCACGCCCAC CTCGTCCTGC TGAGGCTGGG CGACGCCTCG
ACCGGCGAGG TGCTCGCCCG GCGGGCGCTG TCCCACCCGG CGACCACGAT GCACGGCGTC
TCGCGGAGCC TCCTCCTGGA GTCCTTCCGG CTCCGGGGAC GCCTGGAGGA GGCGGAGCGC
CTGCTGCCGT ACCTGCTCGG CGACATGCGG CCGGACAAGG TCCTCACGTC CCCCGACCTG
GTGCCCTGTG CGGTCTCCGT GCTCGACGTC CTGGGGCGGC ACGGGGAGGC GGGGGAGCTC
CTCGGGAGGT ACCGGCCGGT CCTCGGCGCC TCCCCGGTCG CCTGGGCACC CCACGCCCGG
ACGCTGGCCG AGGCCTTCCA CGCCCTCTCG GGGGGCGCCG AGCACCAGGC GGCGACCACA
CTGCGGTCGA TCGTCCCGTC GAACGGCGGG CGCCTCTCCG GTGCGCTGCG GGTCTCCCCG
GTCGCGCTGC CGCTGCTCTA CGTGCTCCTG CCGGAGCTCC GGGACCGCTG GGACGCCGAC
CCGCCGCCCG GCGCCCTCGC CGACCTGCAC GCCGGGGCGC GCGCCCTGGT CCAGCTGCGC
GAGCACGGGT CGACGGCGGC GGCCGGCGCC CTCCCCTCCA GCGTCTGGCC GGTCCTGCGC
GCCCTCCTGC CGGTGCCGTG GGTGGCCGAG CTGGCCCTGG GGATGGTGGC AGCCGGTCAG
GACGGTGCCC GTGCGCTGAT CGAGGACCTC GGCCCCGCCG CCCGCGCGAC CCTGCGCGCC
CAGGCCGCGA CGGCACCCCC GACCCTCGCG GCCACCGCCC GCTCGCTGCT GCGCGCGATC
CCGGCGGTGC CGACCGCACG GGTGCACCTG CGCGTCCTCG GCCCGCTCGA GCTCCGCCGC
GACGGTGTGG TCGTCGCCGC GCCGGAGCTG CGCAGGGAGC GGGTGCGTCA GCTCCTCGGC
TACCTGCTCC TCCACGACCG GCCGACCCGG ACCGCCATCA CCTCCGAGCT GTGGCCCGAC
CTCGACGACG CGGCGGCCGG CCGGAACCTG CGCGTCACCC TCACCTACCT GCAGAACCTC
CTCGAGCCCG ACCGCGGCGA GCTCGACCCG CCGTACTTCC TGCGGAGTGC CGGCCCGGTG
CTGCACCTGG TGACCGACGG CGCGCTGGAG ATCGACGTGC TGCAGTTCGA GCGGACGCTG
GACGAGGCCG CCCGGCTGGA GCGCCAGGGC GCGCCCTCGG CTGCACTCTC GGCCTACCTG
CGGGCAGCAG AGCTGTGGAG TGGGAACCTC CTCGCCGACG TGACCGGTGC GAGCTGGCTG
GAGTGGGAGC GCGACCGCCT GCGGAGCCGG TTCGTCTCCA GCGCCGTCCG GGCCGGCAAC
CTGCTGCTGG CGCGGGGGGA CACCACGACG GCCCGGACCC TCGCCGAGCG TGCCCTGCGG
GCCGACGACT GCTCGGAGGA CGCCCACCAG CTGCTCATCG CCGTCCACCT CGCGGACGGC
GACCTCGGCG ACGCCCACCG CGCCCTGCGC CGCTGCCAGC AGATGCTGCG CGAGCTGGGT
GTGCCGCCCC AGCCGCGCAC GCGCGCGCTC GCGCAGCGGC TGATCCCCCG CGGCTGA
 
Protein sequence
MSRALRYTPP LASRDLIVRP RLLDALRSRF ERPLTAVVAP AGFGKTTLLG QAVSENALSP 
AGEDRWLTCQ RDDTTLSVLA SGAFAAVGLS APVPEDPRQA AVAVAEALWS AAPRHVALVL
DDVHLVTRGS PAGEFLGHLV EELPRNGHLV LASRPPLPLS ISRLVATGHA VVLREGDLQF
HEDEVAAFAE SRGVPPELLS DVGGWPALAE LTATAGPFAV SGYVWEELLT QLSPERRHAL
ALLVAVGGAD DEIAAALLGR DVDLRSLLDG LPLVVRSRSG WWSLHGLWAA ALQHHLDSGQ
VAEARRTAAT VLRGRGQYRE AMGLLLDAQA WEDVRRLVVE VCECCTALVP TDVLESWLRR
FPPEVQQSPE GLLLAAMAAE PTSPGAAEEL LEQALAGAHG DPDLRYACLN ALVQLAFWGK
DRQRMQFLLQ ELEQLAAEGH PGAPAFVALV RGGEARSTDE VRSALADPGL MSRTALNSVQ
HPVQQWLHAH LVLLRLGDAS TGEVLARRAL SHPATTMHGV SRSLLLESFR LRGRLEEAER
LLPYLLGDMR PDKVLTSPDL VPCAVSVLDV LGRHGEAGEL LGRYRPVLGA SPVAWAPHAR
TLAEAFHALS GGAEHQAATT LRSIVPSNGG RLSGALRVSP VALPLLYVLL PELRDRWDAD
PPPGALADLH AGARALVQLR EHGSTAAAGA LPSSVWPVLR ALLPVPWVAE LALGMVAAGQ
DGARALIEDL GPAARATLRA QAATAPPTLA ATARSLLRAI PAVPTARVHL RVLGPLELRR
DGVVVAAPEL RRERVRQLLG YLLLHDRPTR TAITSELWPD LDDAAAGRNL RVTLTYLQNL
LEPDRGELDP PYFLRSAGPV LHLVTDGALE IDVLQFERTL DEAARLERQG APSAALSAYL
RAAELWSGNL LADVTGASWL EWERDRLRSR FVSSAVRAGN LLLARGDTTT ARTLAERALR
ADDCSEDAHQ LLIAVHLADG DLGDAHRALR RCQQMLRELG VPPQPRTRAL AQRLIPRG