Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0857 |
Symbol | |
ID | 8752514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 910187 |
End bp | 913189 |
Gene Length | 3003 bp |
Protein Length | 1000 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003407992 |
Protein GI | 284989438 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.198633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCTTC GGATACTCGG CCCGCTCGAG GTTCGGGGCA CAGGCGGGCC GCTTGTCCTT GGGGGTGTCC AGCAGCGCGG GGTGCTGGCC ATGCTGGCGC TCCACCTCAA CGAGGTGGTC ACCACCGACT TCCTTATCGA CGGTTTGTGG GGGGAGGAGG CGCCGGCGAG CGCTACCAAC ATCGTGCAGG GCTATGTCTC GCGGCTACGG AAGGCTCTGC AGGCGGAGAA CGTGCCGGAT CGAGTGGGTG CCGCGGTCTT GCTGCGCCGG GGTCCCGGCT ATCTGCTCGA ACTTGATCCC GAGCAGGTCG ACCTGTATCG CTTCCAGCGG CTACTCCGCG AGGGCGTGCG GGCTCTGCGC CCGGCGCCGG TCCGGGCTGC AAGCACGCTG CGCGAGGCGC TGGGCTTGTG GCGTGGGCAG CCGCTGGCGG AGTTCGCCGA CGTGCCCTTC GCCCGAGTCG AGATACCGCG GTTGGAGCAG CAGCGGCTCG GTGCCCTCGA GGCACGCCTG AAGGCGGATC TTGCACTCGG GCGACACGCC GAGGTCATCG GCGAGCTGAA GGCCCTCGTC GCTCGGCATC CCCTGCATGA GGGCCTACAC CAGCTGCTCA TGCTCAGCCT GTACCGCTCG GGCCGGCAGG CCGAGGCACT GGAGGCCTAC CGGCAGGCGC GCGAGACCCT GGCCGAGGAG CTGGGCATCG ACCCCGGTCG GGCGCTGCAG GAGCTCGAGG CCGCCATCCT CACCCACGAC CCGGACCTGG ACTGGACCCC GTCGCCCGAC GGCCCCACTT CTCAGGGGTC AGCCGACACG CGAGGCTCTC TCGCCGTCCC GCGGCCGCAG GCGTGGAACG TGCCGGCCCG CAATCCGCAC TTCACCGGCC GGGACGACCT GCTCACCGAG CTGCGCCGGC GGCTGCACGG GGAGGAACCG ACGCTGGTGG TGCAGGCCTT GTACGGCCTG GGTGGGGTGG GCAAGACCCA GCTGGCGATC GAGTACGCCC ACCGGTTCGC CGCCGACTAC GACCTGGTGT GGTGGATCGA CGCCGAACAG CCGGTGCTCA TCGGCGACCA GCTGGCTGGC CTGGCCGCCC GGCTGGACTT GCCGGCCGGG CGCACGGTGG CCGACACCGT CGACCGACTG CTGGCCGAGC TGCGCGGGCG GGACCGGTGG CTGCTGGTCT TCGACAACGC CAAACACCCT CAGGACATCG CCGACTATCG GCCCTGTGGA GCCGGGCACG TGCTGATCAC CTCCCGCTCC CCGGGCTGGG GAGCACTCGG CGGGCGGCTG GAGGTCGACG TGCTGGCCCG GGCGGAGACG ATCGCCTTGC TGCGCGCCCG AATCCCGAGC ATGAGCGAGG AGCTGGCCGA CAAGCTCGCC GCCGAGCTCG GGGATCTGCC GCTGGCCGCC GCGCAGGCCG CCGGCTATCT GGAGCAGACC GACCTGCCGG CGGCGGACTA TCTGCGCCGC TTCCGCACCC GCCGAGCCGA CCTGCTCACC CGGGGCGACG TGGTGGGCTA TCACGGCCGG GTCGACACCG CCTGGGCGCT GTCCCTGGAC CGGCTGCGCG GCGAGGAACC GGCCGCGGTT CAGCTGCTGG AGCTGGCCGC CTTCCTCGCT CCCGAACCCA TTCCGCTGTC CCTGGTCGGC GGGCACGCCG AGCTGCTGGA GGAGCCGCTG CGCGGCATCG CCGCCGACCC CGACGCCCTC GCCGACACCG TCGGCTCTCT CGTCGGATAC TCCCTGGCCC GCCGCCACCC CGAGGGCTTC CAGGTCCACC GGCTGGTGCA GGCGGTCATC CGCCACCAGC TTCCCTCCGA CCGGCAGCAG GACACCGCCC AGCGGGTGGT AGCGCTGCTG GCCGCCGCAT CCCCCGGCGA CCCGGACGAT CCGGTCAGCT GGGCCGCCTA CGCCCGGCTT GCGCCGCACG TGCTCGCCAC CGCCCCGCTG GGGGACTCCT CCTCCGCCAG CCGGCAGCTG GTCCTGGACA CCATCCGCTA CCTGCAGGCC CACGGGGACA GCTCCGGCAG CCGGGCCGTC TGCACACCAC TGCTCGACCG CTGGCGCGAG GTCCTCGGCC CCAACCATCC CGACACCCTG ACCGCCGCCA ACAGCCTCAC CCTTGCCCTC TTCGCAGTGG GCGAGGGCAA GTCAGCCCAC GCGCTGAGTG AGGACACTCT GCAGCGCTGC CGCCGGGTGC TCGGCCCCGA CCACGCCACC ACTCTGTTGG CGGCGACCGC GCTGACCGTT GCCCTGAACC ACAGGGGGGC GGCCGAGCCG GCCCGCGCCC TGGGCCAGGA CACCCTGCAG CGCTGCCGCC GGGTGCTCGG CCCCGACCAC ATCACCACCC TGTGGGCGGC GGCCGCCCTC GCCGTCGCCC GGGCCGTGCT GGGGGAGGTG GAGCCGGCCC GCTCCCTGGG CCAGGACACC CTGCAGCGCT GCCGCCGGGT GCTCGGCCCG GACCACGTGA TCACCCTCTT GGCGGCGGGC GCCCTCGCCG TCGCCCTGGT CGTGCTGGGG GAGGTGGAGC CGGCCCGCTC CCTGGGCCAG GACACCCTGC AGCGCTGCCG CCGGGTGCTC GGCCCGGACC ACGTGATCAC CCTGTGGGCG GCGGGCGCCC TGACCCACGC CCTCGTTCAG CTGGGCGAGG CCGAGCCGGC CCGCACCCTG GGCCAGGACA TCCTGCAGCG CCGGGTGTTC GGCCCACACC ACGTGATCAC CCTCTTGGCG GCGGGCGCCC TGACCCACGC CCTCGTTCAG CTGGGCGAGG CCGAGCCGGC CCGCACCCTG GGCCAGGACA CCCTGCAGCG CTGCCGCCGG GTGCTCGGCC CGGACCACGT AATCACCCTG TGGGCGGCGG GCGCCCTGAC CCTTGCCCTG ATTCAGCTGG GCGAGGTCGA GCCGGCCCGC ACCCTGGGCC AGGACACCCT GCAGCGCTGC CGCCGGGTGC TCGGCTCCGA CCACCCGATC ACGCTGTACC TGACAGCCGC CAACATCACC TGA
|
Protein sequence | MDLRILGPLE VRGTGGPLVL GGVQQRGVLA MLALHLNEVV TTDFLIDGLW GEEAPASATN IVQGYVSRLR KALQAENVPD RVGAAVLLRR GPGYLLELDP EQVDLYRFQR LLREGVRALR PAPVRAASTL REALGLWRGQ PLAEFADVPF ARVEIPRLEQ QRLGALEARL KADLALGRHA EVIGELKALV ARHPLHEGLH QLLMLSLYRS GRQAEALEAY RQARETLAEE LGIDPGRALQ ELEAAILTHD PDLDWTPSPD GPTSQGSADT RGSLAVPRPQ AWNVPARNPH FTGRDDLLTE LRRRLHGEEP TLVVQALYGL GGVGKTQLAI EYAHRFAADY DLVWWIDAEQ PVLIGDQLAG LAARLDLPAG RTVADTVDRL LAELRGRDRW LLVFDNAKHP QDIADYRPCG AGHVLITSRS PGWGALGGRL EVDVLARAET IALLRARIPS MSEELADKLA AELGDLPLAA AQAAGYLEQT DLPAADYLRR FRTRRADLLT RGDVVGYHGR VDTAWALSLD RLRGEEPAAV QLLELAAFLA PEPIPLSLVG GHAELLEEPL RGIAADPDAL ADTVGSLVGY SLARRHPEGF QVHRLVQAVI RHQLPSDRQQ DTAQRVVALL AAASPGDPDD PVSWAAYARL APHVLATAPL GDSSSASRQL VLDTIRYLQA HGDSSGSRAV CTPLLDRWRE VLGPNHPDTL TAANSLTLAL FAVGEGKSAH ALSEDTLQRC RRVLGPDHAT TLLAATALTV ALNHRGAAEP ARALGQDTLQ RCRRVLGPDH ITTLWAAAAL AVARAVLGEV EPARSLGQDT LQRCRRVLGP DHVITLLAAG ALAVALVVLG EVEPARSLGQ DTLQRCRRVL GPDHVITLWA AGALTHALVQ LGEAEPARTL GQDILQRRVF GPHHVITLLA AGALTHALVQ LGEAEPARTL GQDTLQRCRR VLGPDHVITL WAAGALTLAL IQLGEVEPAR TLGQDTLQRC RRVLGSDHPI TLYLTAANIT
|
| |