Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2405 |
Symbol | |
ID | 8754076 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2498398 |
End bp | 2501475 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transcriptional activator domain protein |
Protein accession | YP_003409449 |
Protein GI | 284990895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.375799 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGGG CGCTCCGGTA CGCACCACCG CTCGCCGACC GCGGGCTCAT CGTCCGGCCC CGGCTGCTGC ACCGGCTGCA CTCCCGGTTC GAGCGGCGCC TGACCGCCGT CGTGGCGCCG GCCGGGTTCG GCAAGACGAC GCTGCTGGCC CAGGCGGTCC AGGAGAACAC GCTGTCCCCG CTGGGGGAGG ACCGCTGGCT GACCTGCCAG CGGGACGACA CGTCGCTGTC GTTCCTGGCC GCGGGCGCGT TCGCCGCCGT GGGCCTCACC GCGCCGGTCC CGCAGGACCC GCGGGAGGCC GCGGTCACCG TCGCCGAGGC CATCTGGAGC GCCGCGCCGC GGCACCTCGC GCTGATCCTC GACGACGTCC ACCTGGTGCG ACCGGGCTCG CCCGGCGGCC ACTTCGTGGC CGAGCTGGTC GAGGAGCTGC CGCGCAACGG TCACGTCGTC CTGGCGTCCC GCCCGCCGCT GCCGCTGCGC GCCTCGCGGC TGCTCGCCAG CGGCGAGGCG GTCGTCCTCG GCGAGACCGA GCTCCACTTC GCGCGCGAGG AGATGGCGGC GTTCGCGGCG TCCCGGCAGG TGCCGCCAGA CCTGCTGCGG GACGTCGGCG GCTGGCCGGC ACTGGCCGAG CTGACCGCCA CCGCCGGCCC GGACGCCGTC AGCGGCTACG TGTGGGAGGA GCTGCTCAGC CGGCTGTCGC CGGAGCGCCG GCGGGCGCTC AGCGTGCTGG TCGCCGTCGG TGGCGCCGAC GACGAGCTGG CCGCGGCCCT GCTCGGGCCG GACGTGCGGC TGGAGGAGCT GCTGGAGGGC CTGCCGCTGG TGGTCCGCGG TCCCTCGGGC TGGTGGTCGC TGCACGGGCT GTGGTCGGCG ATCCTGGCGC ACCGCCTGGA CCCCGGGCAG CTGGCGCAGG CCCGCCGCAC CGCCGGCCTG GTCCTGGCCC GTCGCGGGCG CTACCACGAC GCCATGGAGC TGCTGGCCGA CGCCGGGGCG TGGGACGACG TCCGCCGTCT CGTCGTCGAG GTGTGCGAGG TCGGCACGCC GCTGGTCCCG CCGGACGTGC TCGAGGTGTG GCTGCACCGG CTGCCCCCGG ACGTGCAGGA GGGGCCCGAG GGGCTGCTGC TGGCCGCGAA CGTCGCCGAG CCGACCAGCC TGGCCAGCGC CGAGGCGTTG CTGGAGCGCG CCTTCGCGCT GGCTCCCGAC GTGGCACCGG TGCGCTTCGC CTGCCTCAAC GCCCTGGTCG AGGTCGGGCT GCGCCGGTCC GACCGCCGGG AGATGGAGCT GCACATCGAG CGGCTGACCG GGCTGGCGGC CCGCGGCCAC GAGCGCGCCG CCGGGTGGAT CGCGCTGTTC CGGGGTCTGC TGGCACGCAC GCCCGCGGAG GTGCGCGCGC AGCTGGCCAC GCCGGCGCTG GTGGCGGGGA CCGGGCTCAG CCCGGTGCAG CAGTGGCTGC GCGCCCACCT GATGCTACTC AAGCTGGGCG ACGCCGAGGG CGCCGAGCGC GTCGTCCGCC GGGCCCTCGC CTCGGCCGGC CCCAACATGG CGGTGCTCTT CCGGAGCCAG CTGGTCGAGT CGCTGCGGAT GCGCGGGCGG CTCGACGAGG CCGAGGCGCT GCTGCCCGAC CTGCTCGCCG GGATCGACCC CGCCAAGGTC CTGACCTCCC CGGAGACCGT CACCTGCGCC GTCGTCCTGC TCGGTCTCCT CGGCCGCGAC GACCAGGCCG GGGAGCTGCT GGGGGCGCAC CGGCAGGCCG TCGCCGATTC GTCGGTCGCC TGGGCGCCGG TCGCCGGGGC CGTGGCGGAG GCGGCCCACC GGGTGTCGCT CGGTCAGGAG GCCGCGGCCG CCGAGGCCCT GCGCGCGGTG CTGCGGCTGG ACGTGGCCCG CAGCCGGGCG GTGCGTCAGG TGTCCGCCGC GGCGCTGCCG CTGCAGTACG TCCTGCTCCC CGAGGTGCGG CCGCGGTGGG ACGCCGCGGC CCCGCCGGGG TGCTTCCGGC CGGCGCTGCA GCTGGCCCGG GCGTTGGTCG CGATCCGGGA GGAGGGGTCG TTGGAGCAGG TGGCCGGGCT GCCGGCCGAG GCCCGCGCGG TCATGCGTGC CGTCCTGCCG GTGCCGTGGA CGGCGGAGCT CGCTCTCGGC ATGGTCGCCT CGGGCCTCCA GGAGGGGCAC GCCCTGGTCG AGGAGCTCGG GCCGCGCGCC CGGCCCACCC TGCGGGCGCT CAGCAGCGGA GGCCCCGCGC CGGTCGCCGC CACCGCCCGC CGGCTGCTGC GTGAGCTGCC GGCCGCGCCG CCCGCCCGGC TGGAGCTGCG GGTCCTGGGG CCGATGCAGC TGCGCCGGAA CGGGGTCGTC GTGGTCGCCC GCGAGCTGCG CCGCGAGCGG GTGCGCCAGC TGCTCGGGTA CCTGCTCGCC CACGACGGTC CCTCCCGGGC GGCGGTCACG GCCGACCTGT GGCCCGACCT CGACGAGGCG GCCGCGGCTC GCAACCTGCG GGTCACCCTC GCCTACCTGC AGGACCTGCT CGAACCCGAC CGCGGCGAGT CCGACGCGCC GTACTTCGTC CGCAGCGGGG GGCCGGTCCT GCACCTCCTC GTCGGTGAGG CGCTGCAGGT GGACGCCGTG GACTTCGAGC GTTCCCTCGA CGAGGCGGCG CGGCTCGAGC GCCAGGGTGC GCCGTCGGCC GCCCTGGCCG CCTACGAGCA AGCGCTGCGG CTCTGGGACA CCGACTACCT GCCCGACGTG AGCGGCGGCG ACTGGCTGGA GTGGGAGCGC GACCGGATGC GGGGGCGGTT CGTGGCGGCC GCCGTCCGGG CCGGGGAACT CCTGCTGGCC CGGGGGGACG CCGGCGCCGC CCGGACGCTG GGCGAACGGG CGCTGCGGGT CGACGCCTGG TCCGAGGAGG CCCACCAGCT GCTGGTCGCC GCGCTGCTGG AGGCCGGGGA CGCAGCCGAC GCCCGCCGGG CGCTGCGGCG CTGCCTGCAG GCGCTCGACG ACCTCGGCGT CCCGGCGCAG CCGCGCACCC TCGCCCTGGC CCGGCGTCTC GAGGTGCACG CCCCCGCGCG GCGGGACCGG CCGCGGGCTC GCGGCTGA
|
Protein sequence | MSRALRYAPP LADRGLIVRP RLLHRLHSRF ERRLTAVVAP AGFGKTTLLA QAVQENTLSP LGEDRWLTCQ RDDTSLSFLA AGAFAAVGLT APVPQDPREA AVTVAEAIWS AAPRHLALIL DDVHLVRPGS PGGHFVAELV EELPRNGHVV LASRPPLPLR ASRLLASGEA VVLGETELHF AREEMAAFAA SRQVPPDLLR DVGGWPALAE LTATAGPDAV SGYVWEELLS RLSPERRRAL SVLVAVGGAD DELAAALLGP DVRLEELLEG LPLVVRGPSG WWSLHGLWSA ILAHRLDPGQ LAQARRTAGL VLARRGRYHD AMELLADAGA WDDVRRLVVE VCEVGTPLVP PDVLEVWLHR LPPDVQEGPE GLLLAANVAE PTSLASAEAL LERAFALAPD VAPVRFACLN ALVEVGLRRS DRREMELHIE RLTGLAARGH ERAAGWIALF RGLLARTPAE VRAQLATPAL VAGTGLSPVQ QWLRAHLMLL KLGDAEGAER VVRRALASAG PNMAVLFRSQ LVESLRMRGR LDEAEALLPD LLAGIDPAKV LTSPETVTCA VVLLGLLGRD DQAGELLGAH RQAVADSSVA WAPVAGAVAE AAHRVSLGQE AAAAEALRAV LRLDVARSRA VRQVSAAALP LQYVLLPEVR PRWDAAAPPG CFRPALQLAR ALVAIREEGS LEQVAGLPAE ARAVMRAVLP VPWTAELALG MVASGLQEGH ALVEELGPRA RPTLRALSSG GPAPVAATAR RLLRELPAAP PARLELRVLG PMQLRRNGVV VVARELRRER VRQLLGYLLA HDGPSRAAVT ADLWPDLDEA AAARNLRVTL AYLQDLLEPD RGESDAPYFV RSGGPVLHLL VGEALQVDAV DFERSLDEAA RLERQGAPSA ALAAYEQALR LWDTDYLPDV SGGDWLEWER DRMRGRFVAA AVRAGELLLA RGDAGAARTL GERALRVDAW SEEAHQLLVA ALLEAGDAAD ARRALRRCLQ ALDDLGVPAQ PRTLALARRL EVHAPARRDR PRARG
|
| |