Gene Gobs_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_1996 
Symbol 
ID8753667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp2071605 
End bp2075030 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content74% 
IMG OID 
Producttranscriptional regulator, winged helix family 
Protein accessionYP_003409060 
Protein GI284990506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.942344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGGTGG AGGTCCGGCT CCTCGGCGGT GTGACGGCCA TCGACGGCGA GGGGACCCCG 
CTGGACGTGG GGTCGCCGAA GTGCCGTGTC CTGCTGGCCG CGCTGGCGCT GTCGGCGGGC
GAACCGGTGC CGGCCTCACG CCTGATCGAC GTGGTGTGGC CCGACGACCC GCCCCGCACC
GCGGACAAGA CACTGCAGGG GTACGTCGCG CAGCTGCGCC GCGGGCTTGG GGCGGCGGCG
ATCACCCGGA GCGGCGGGAC CTATCGACTC GCGCTGGACC CCGACCAGGT CGACGTGGCC
CGGTTCCGTC GGCTGCTGAG CGCCGGGGAC GTCGACGGCG CGCTGGACGC CTGGGGCGGC
ACCCCGCTGG CCGGCCTGGA GGCACCCGGA CTCCAGGCCG CGGCAGACGC ACTGGTGGAC
CAGTGGATGG ACGCGACGGA GGACCGGCTG CGCCGGCGGC TGTCCACCGA CCCGGCGGCC
GTCATCGCTG CCCTGACCGA GCAGACCGCG GCTCACCCGT TCCGCGAGGA GCTGTGGGCC
CTGCTGATGA TTGCGCTGTA CCGAGCGGGA CGACAGGCCG ATGCGCTGTC CGCGTTCCAG
CGCGCCCGGG AGCACCTGAT CGACGAGCTC GGCGTCGAAC CGGGCCCACG CCTGCGGGGG
GTCGAGGCGC AGATTCTCGC CCACGACCAG CAGCTCGAGG GCAGCGCCGC GGCGGCGGGC
CCACCGGGGC CAGCGCCCGG GCCCGGACCG ACCGGGACCG TGACGTTCGG CTTCGCCGAG
GTCGCCGATG CCACGCTGCT GTGGGCCCAG CAGGGTCGCA AGATGGCTCA GGCGATCACC
CGACTGGACA CCGTCGTCCG CGACGTCACC GAGCGGCACG GCGGTACCAG CGTCGTCGCC
GCCGGCGAGT CGATCGGCGT GGCGTTCCAC CGGGCCGACG ATGCGGCCAC GTGGGCGACC
GACCTGCAGC TCGCGGTCGA CTCGGAGTCC TGGCCCGGCG GGATCGAGCT GCGCCTCCAG
GTGGCGCTGC ACACCGGCGA GACCGCGGAA CACGGCGGCG GCTACTTCGG GCTGGCGGTG
CACACCGCCT TGCGGCTGGC GGCCGCCGCC CACGGTGGGC AGGTCCTCCT GTCAGCGGTG
ACGGCCGCGC TGCTGGAACG CGACGACCTC CGAGACCTCG GCGCCTACCG GCTCGACGGC
ATGGCCGGGG AACGCACCGT GCTCCAGCTG GATGACGGCG ACCACCCGAT GCCGCGGACC
GCCTCTACGG GGCGGGGCGA CCTCCCCGGC CGTACCCCTC GGCTCCTCGG CCGCGAGCGA
GACCTCGCGG CCATCGCCGA CGCACTGGAG GCATCGACGG TCGTCACCCT GGTCGGCCCC
GGTGGGGTCG GCAAGACCAC GCTCGCACTG GCAGCCGCTC GGCACGCTCG GACCCAGGGC
CCCTGGCGGG TCTGGCTCGC CGAGCTGGGC GAGATCTCGA ACGACGCCGA CGTGCCCGCC
ACGATCGCGG AGACCCTCGG CGTCATCGGC GGCGCCGGCC GGACCCTGAG CGATTCGATC
GTGGCGACGC TGCGCACGCG TCCGACTCTG CTCGTCCTGG ACAACTGCGA ACACGTCGTG
GCCGGCGCGG CAGCGCTGGC CCGGGCGATC GCGGCCGCCG GCGGGGACAC GCTGGTCCTG
GCCACCTCCC GCGAACCGCT GGCCATCCCC GGCGAACAGC TGCTGCGGGT GGAGCCGCTG
GACGTGACCG GCCCCGCGGT GGAGCTCTTC GCAGAGCGGG CCCGCGCCGT GGACGCGACG
TTCGACCTGC ACGACGTGCG GACGGAGGTC GAGGAGATCT GCCGTCGCCT GGACGGTCTG
CCGCTGGCGA TCGAGCTCGC CGCCGCACGC ACGGTCCACC TGACACCGGC CCAGCTGCTC
GATCGGCTCG ACGACCGCTT CCGGCTCCTG GCCGGCAACA GCCGGGGCAG CGCGGAGCGT
CACCGGACCC TGCAGGCCAC CGTGCAGTGG TCCTACGACC TGCTGTCACA CCCCCAGCAG
CTGCTGTTCG AACGGCTGGC GGTGTTCGCC GGTCCCTTCG ACCTGTCCGC TGCGGAGACG
GTCGGAGGGC GCGAGGAGCT GGACGCCGTG GAGACCGACC GGCTGCTCGG CGACCTCGTG
CAGCGCTCGA TCGTCACTGT CGGCCCTGGC CCCTTCGGCC GGCAATTCCG CCTGCTGGAG
ACCCTGCGGG AGTTCGCGCT CGACCGACTC ACCGCACATG GCGACCGGGA GACGGTCGCC
GCGCGCCACG CCGAGTGGTG CCGCGAACGG ACGGCGGACA TCGGCCGGCT GCTGACCGGG
CAGGACGAGG TCGAGGGCGT CGCCCGGCTC GCCGAGCTGT GGCCCGACCT GCGCGCGGCG
TTCGGCTGGG CCGCCACCAC CGGGGACCTC GGCCTGGCCG ACGCCCTGGT GCGCCCCGTC
GCCCCCGAGG TCAGCCTCCG GCGGCGGGTC CAGATCGGCG ACTGGGCCGA ACGGATCCTC
GAGCTGACGC CACCCGACGA CGACTTCCGC CGGGTCTACT GGCTGCTGTG GGCCGGGCAC
CGGCATGCGC AGGCCGGAGA CCGCGAGGCA CTCGACGGAC TCGTGCGACG CCATGGCCAC
CGGGATCATC CCGTGATCCG GTTCAACCAC ACCTACCTGT CGGACGTGGA CGTGGACGCC
CACGCCGCCT CGACGGACGC GGTCGCGTGG TTGCGCGAGC ACGGCGAGCA CCGCACGGCG
GACATCCTGG ACGTGTCCGG TGTCGCGGCG TCCCTCATCG TGTTGCAGCG CTTCGATGAA
CTCGACGCCG TGGCCGCGGG CATGGCGGAG CGACACCGCC TCCACGGCCC GGCCACCCTC
CGGTACTTTG CCCTCGGCCT GCGCGGCTAC GCCGCCCAGT ACCAGGGCCG GCACGACGAC
GCGGCCCGGT TGTTCTCGCA GGCCGAGCAG CTGGAGCTCC CGGCCGGGAC CTACCGGATC
CTGCAGACGG CGCAGGCGCG GCTCGCCTTC GCCGCCGGTG ACCACCCGCG GGCGTACCGG
CTCCTGCGCG ACAACATCCA CACGCTCCTC GACAGCGACC ACACGGACGT GACGCGCATG
ATCGCCGTCG AGTTCATCAC CATGATGGCG GCCACGGACC GCCTCGCCGA TGCGGCCCAC
GTGCTGCCCT ACCTCGACAC GACCGGCAGG TTCGCCCTCC TCGCGCGGGA GAGCCTCATC
GCTGACGCAG TGCGCCGGAT GGAAGCCGAC CCGGCCCTGG TCGACCACCT CCGCGGAGAC
CCGGACGCCC GCGGGGCCCT GACGTTTATG CGGGACGTGC TCGACGAGCT CCTCGGGAGC
ATGGACGCGG AACGGTGGGC TGCCACGGGT CCCGGCGCGG GTACGGATCA GTCCGCGATC
AGCTGA
 
Protein sequence
MVVEVRLLGG VTAIDGEGTP LDVGSPKCRV LLAALALSAG EPVPASRLID VVWPDDPPRT 
ADKTLQGYVA QLRRGLGAAA ITRSGGTYRL ALDPDQVDVA RFRRLLSAGD VDGALDAWGG
TPLAGLEAPG LQAAADALVD QWMDATEDRL RRRLSTDPAA VIAALTEQTA AHPFREELWA
LLMIALYRAG RQADALSAFQ RAREHLIDEL GVEPGPRLRG VEAQILAHDQ QLEGSAAAAG
PPGPAPGPGP TGTVTFGFAE VADATLLWAQ QGRKMAQAIT RLDTVVRDVT ERHGGTSVVA
AGESIGVAFH RADDAATWAT DLQLAVDSES WPGGIELRLQ VALHTGETAE HGGGYFGLAV
HTALRLAAAA HGGQVLLSAV TAALLERDDL RDLGAYRLDG MAGERTVLQL DDGDHPMPRT
ASTGRGDLPG RTPRLLGRER DLAAIADALE ASTVVTLVGP GGVGKTTLAL AAARHARTQG
PWRVWLAELG EISNDADVPA TIAETLGVIG GAGRTLSDSI VATLRTRPTL LVLDNCEHVV
AGAAALARAI AAAGGDTLVL ATSREPLAIP GEQLLRVEPL DVTGPAVELF AERARAVDAT
FDLHDVRTEV EEICRRLDGL PLAIELAAAR TVHLTPAQLL DRLDDRFRLL AGNSRGSAER
HRTLQATVQW SYDLLSHPQQ LLFERLAVFA GPFDLSAAET VGGREELDAV ETDRLLGDLV
QRSIVTVGPG PFGRQFRLLE TLREFALDRL TAHGDRETVA ARHAEWCRER TADIGRLLTG
QDEVEGVARL AELWPDLRAA FGWAATTGDL GLADALVRPV APEVSLRRRV QIGDWAERIL
ELTPPDDDFR RVYWLLWAGH RHAQAGDREA LDGLVRRHGH RDHPVIRFNH TYLSDVDVDA
HAASTDAVAW LREHGEHRTA DILDVSGVAA SLIVLQRFDE LDAVAAGMAE RHRLHGPATL
RYFALGLRGY AAQYQGRHDD AARLFSQAEQ LELPAGTYRI LQTAQARLAF AAGDHPRAYR
LLRDNIHTLL DSDHTDVTRM IAVEFITMMA ATDRLADAAH VLPYLDTTGR FALLARESLI
ADAVRRMEAD PALVDHLRGD PDARGALTFM RDVLDELLGS MDAERWAATG PGAGTDQSAI
S