Gene Gobs_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGobs_0857 
Symbol 
ID8752514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeodermatophilus obscurus DSM 43160 
KingdomBacteria 
Replicon accessionNC_013757 
Strand
Start bp910187 
End bp913189 
Gene Length3003 bp 
Protein Length1000 aa 
Translation table11 
GC content73% 
IMG OID 
Producttranscriptional regulator, SARP family 
Protein accessionYP_003407992 
Protein GI284989438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.198633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCTTC GGATACTCGG CCCGCTCGAG GTTCGGGGCA CAGGCGGGCC GCTTGTCCTT 
GGGGGTGTCC AGCAGCGCGG GGTGCTGGCC ATGCTGGCGC TCCACCTCAA CGAGGTGGTC
ACCACCGACT TCCTTATCGA CGGTTTGTGG GGGGAGGAGG CGCCGGCGAG CGCTACCAAC
ATCGTGCAGG GCTATGTCTC GCGGCTACGG AAGGCTCTGC AGGCGGAGAA CGTGCCGGAT
CGAGTGGGTG CCGCGGTCTT GCTGCGCCGG GGTCCCGGCT ATCTGCTCGA ACTTGATCCC
GAGCAGGTCG ACCTGTATCG CTTCCAGCGG CTACTCCGCG AGGGCGTGCG GGCTCTGCGC
CCGGCGCCGG TCCGGGCTGC AAGCACGCTG CGCGAGGCGC TGGGCTTGTG GCGTGGGCAG
CCGCTGGCGG AGTTCGCCGA CGTGCCCTTC GCCCGAGTCG AGATACCGCG GTTGGAGCAG
CAGCGGCTCG GTGCCCTCGA GGCACGCCTG AAGGCGGATC TTGCACTCGG GCGACACGCC
GAGGTCATCG GCGAGCTGAA GGCCCTCGTC GCTCGGCATC CCCTGCATGA GGGCCTACAC
CAGCTGCTCA TGCTCAGCCT GTACCGCTCG GGCCGGCAGG CCGAGGCACT GGAGGCCTAC
CGGCAGGCGC GCGAGACCCT GGCCGAGGAG CTGGGCATCG ACCCCGGTCG GGCGCTGCAG
GAGCTCGAGG CCGCCATCCT CACCCACGAC CCGGACCTGG ACTGGACCCC GTCGCCCGAC
GGCCCCACTT CTCAGGGGTC AGCCGACACG CGAGGCTCTC TCGCCGTCCC GCGGCCGCAG
GCGTGGAACG TGCCGGCCCG CAATCCGCAC TTCACCGGCC GGGACGACCT GCTCACCGAG
CTGCGCCGGC GGCTGCACGG GGAGGAACCG ACGCTGGTGG TGCAGGCCTT GTACGGCCTG
GGTGGGGTGG GCAAGACCCA GCTGGCGATC GAGTACGCCC ACCGGTTCGC CGCCGACTAC
GACCTGGTGT GGTGGATCGA CGCCGAACAG CCGGTGCTCA TCGGCGACCA GCTGGCTGGC
CTGGCCGCCC GGCTGGACTT GCCGGCCGGG CGCACGGTGG CCGACACCGT CGACCGACTG
CTGGCCGAGC TGCGCGGGCG GGACCGGTGG CTGCTGGTCT TCGACAACGC CAAACACCCT
CAGGACATCG CCGACTATCG GCCCTGTGGA GCCGGGCACG TGCTGATCAC CTCCCGCTCC
CCGGGCTGGG GAGCACTCGG CGGGCGGCTG GAGGTCGACG TGCTGGCCCG GGCGGAGACG
ATCGCCTTGC TGCGCGCCCG AATCCCGAGC ATGAGCGAGG AGCTGGCCGA CAAGCTCGCC
GCCGAGCTCG GGGATCTGCC GCTGGCCGCC GCGCAGGCCG CCGGCTATCT GGAGCAGACC
GACCTGCCGG CGGCGGACTA TCTGCGCCGC TTCCGCACCC GCCGAGCCGA CCTGCTCACC
CGGGGCGACG TGGTGGGCTA TCACGGCCGG GTCGACACCG CCTGGGCGCT GTCCCTGGAC
CGGCTGCGCG GCGAGGAACC GGCCGCGGTT CAGCTGCTGG AGCTGGCCGC CTTCCTCGCT
CCCGAACCCA TTCCGCTGTC CCTGGTCGGC GGGCACGCCG AGCTGCTGGA GGAGCCGCTG
CGCGGCATCG CCGCCGACCC CGACGCCCTC GCCGACACCG TCGGCTCTCT CGTCGGATAC
TCCCTGGCCC GCCGCCACCC CGAGGGCTTC CAGGTCCACC GGCTGGTGCA GGCGGTCATC
CGCCACCAGC TTCCCTCCGA CCGGCAGCAG GACACCGCCC AGCGGGTGGT AGCGCTGCTG
GCCGCCGCAT CCCCCGGCGA CCCGGACGAT CCGGTCAGCT GGGCCGCCTA CGCCCGGCTT
GCGCCGCACG TGCTCGCCAC CGCCCCGCTG GGGGACTCCT CCTCCGCCAG CCGGCAGCTG
GTCCTGGACA CCATCCGCTA CCTGCAGGCC CACGGGGACA GCTCCGGCAG CCGGGCCGTC
TGCACACCAC TGCTCGACCG CTGGCGCGAG GTCCTCGGCC CCAACCATCC CGACACCCTG
ACCGCCGCCA ACAGCCTCAC CCTTGCCCTC TTCGCAGTGG GCGAGGGCAA GTCAGCCCAC
GCGCTGAGTG AGGACACTCT GCAGCGCTGC CGCCGGGTGC TCGGCCCCGA CCACGCCACC
ACTCTGTTGG CGGCGACCGC GCTGACCGTT GCCCTGAACC ACAGGGGGGC GGCCGAGCCG
GCCCGCGCCC TGGGCCAGGA CACCCTGCAG CGCTGCCGCC GGGTGCTCGG CCCCGACCAC
ATCACCACCC TGTGGGCGGC GGCCGCCCTC GCCGTCGCCC GGGCCGTGCT GGGGGAGGTG
GAGCCGGCCC GCTCCCTGGG CCAGGACACC CTGCAGCGCT GCCGCCGGGT GCTCGGCCCG
GACCACGTGA TCACCCTCTT GGCGGCGGGC GCCCTCGCCG TCGCCCTGGT CGTGCTGGGG
GAGGTGGAGC CGGCCCGCTC CCTGGGCCAG GACACCCTGC AGCGCTGCCG CCGGGTGCTC
GGCCCGGACC ACGTGATCAC CCTGTGGGCG GCGGGCGCCC TGACCCACGC CCTCGTTCAG
CTGGGCGAGG CCGAGCCGGC CCGCACCCTG GGCCAGGACA TCCTGCAGCG CCGGGTGTTC
GGCCCACACC ACGTGATCAC CCTCTTGGCG GCGGGCGCCC TGACCCACGC CCTCGTTCAG
CTGGGCGAGG CCGAGCCGGC CCGCACCCTG GGCCAGGACA CCCTGCAGCG CTGCCGCCGG
GTGCTCGGCC CGGACCACGT AATCACCCTG TGGGCGGCGG GCGCCCTGAC CCTTGCCCTG
ATTCAGCTGG GCGAGGTCGA GCCGGCCCGC ACCCTGGGCC AGGACACCCT GCAGCGCTGC
CGCCGGGTGC TCGGCTCCGA CCACCCGATC ACGCTGTACC TGACAGCCGC CAACATCACC
TGA
 
Protein sequence
MDLRILGPLE VRGTGGPLVL GGVQQRGVLA MLALHLNEVV TTDFLIDGLW GEEAPASATN 
IVQGYVSRLR KALQAENVPD RVGAAVLLRR GPGYLLELDP EQVDLYRFQR LLREGVRALR
PAPVRAASTL REALGLWRGQ PLAEFADVPF ARVEIPRLEQ QRLGALEARL KADLALGRHA
EVIGELKALV ARHPLHEGLH QLLMLSLYRS GRQAEALEAY RQARETLAEE LGIDPGRALQ
ELEAAILTHD PDLDWTPSPD GPTSQGSADT RGSLAVPRPQ AWNVPARNPH FTGRDDLLTE
LRRRLHGEEP TLVVQALYGL GGVGKTQLAI EYAHRFAADY DLVWWIDAEQ PVLIGDQLAG
LAARLDLPAG RTVADTVDRL LAELRGRDRW LLVFDNAKHP QDIADYRPCG AGHVLITSRS
PGWGALGGRL EVDVLARAET IALLRARIPS MSEELADKLA AELGDLPLAA AQAAGYLEQT
DLPAADYLRR FRTRRADLLT RGDVVGYHGR VDTAWALSLD RLRGEEPAAV QLLELAAFLA
PEPIPLSLVG GHAELLEEPL RGIAADPDAL ADTVGSLVGY SLARRHPEGF QVHRLVQAVI
RHQLPSDRQQ DTAQRVVALL AAASPGDPDD PVSWAAYARL APHVLATAPL GDSSSASRQL
VLDTIRYLQA HGDSSGSRAV CTPLLDRWRE VLGPNHPDTL TAANSLTLAL FAVGEGKSAH
ALSEDTLQRC RRVLGPDHAT TLLAATALTV ALNHRGAAEP ARALGQDTLQ RCRRVLGPDH
ITTLWAAAAL AVARAVLGEV EPARSLGQDT LQRCRRVLGP DHVITLLAAG ALAVALVVLG
EVEPARSLGQ DTLQRCRRVL GPDHVITLWA AGALTHALVQ LGEAEPARTL GQDILQRRVF
GPHHVITLLA AGALTHALVQ LGEAEPARTL GQDTLQRCRR VLGPDHVITL WAAGALTLAL
IQLGEVEPAR TLGQDTLQRC RRVLGSDHPI TLYLTAANIT