Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3650 |
Symbol | |
ID | 8755335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 3829485 |
End bp | 3830516 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003410606 |
Protein GI | 284992052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0546711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGGAG GAACAAGACT GGGGCGGAGC ACCGCCGCGG CGCTCACCGC CCTGGTGGCG GCCGGGGCGC TCGCCGGTTG TGGCGGGGGC GACGCGGAGG CCGCGGAGAC GCTCACGGTC TACAGCGCCC AGCACGAGAG CCTGGTGCGC ACGATGCTCG AGGGCTTCAC CGAGGAGACC GGCATCGCGC TGGAGTTCCG CGACGCCAAC GACGCGGAAC TGGCCAACCA GATCGTGCAG GAGGGCGAGG CCAGCCCGGC CGACGTCTTC CTCACCGAGA ACAGCCCCTC GATCGACGTC CTCGACCGCG AGGGCCTGCT CGCCCCGCTG GACCAGTCGA CGCTGGACCA GGTCGGCGCG CAGTACCGGC CCTCCTCGGG CAACTGGACC GGCTTCGCCG CCCGCTCCAC CGTGCTGGTG CACAACCCCG CGCAGCTGCC CCAGGACCAG CTGCCGGCGT CGATCCTCGA CCTCGCGAAC CCGGAGTGGC AGGGCCGCAT CGGTATCGCG GCGGGCGGTG CGGACTTCCA GGCGATCGTG GCCGGGGTCC TGGCGCTGCG CGGGGAGGAG GCGACCCGGG CCTGGCTGGA GGGGCTGGAG CGCAACGCCA ACGTGTACCC CAGCAACAGC GCGGTGATGG TCGCCGCCGA CGAGGGCGAG ATCGACGCCG GCGTCATGTA CCACTACTAC TTCTACCGGG ACCGCGCCGA GAACGGCCTG AAGAGCGACG ACGCCGAGCT GCACTTCTTC CGCAACTCCG ACCCGGGCGC CTTCCTCAGC ATCTCCGGCG CCGGCGTCCT GGCCTCGTCC GACCAGCCCG AGCAGGCCCA GCGACTGGTC GCCTACCTGA CCTCGCCGCC GGCGCAGCAG CGGCTGGCCG AGAGCACGGC ACTGGAGTAC GCCGTCGGCA ACGACGTCCC CTCCGCGGAG GCGCTGCCGC CGCTGGCGGA GCTGCAGGCG CCGCAGGTCG ACCCGGGCTC CCTCGACCAG CAGCGGGTCA CCGAGCTGAT GCAGGACGTG GGGCTGCTCT GA
|
Protein sequence | MLGGTRLGRS TAAALTALVA AGALAGCGGG DAEAAETLTV YSAQHESLVR TMLEGFTEET GIALEFRDAN DAELANQIVQ EGEASPADVF LTENSPSIDV LDREGLLAPL DQSTLDQVGA QYRPSSGNWT GFAARSTVLV HNPAQLPQDQ LPASILDLAN PEWQGRIGIA AGGADFQAIV AGVLALRGEE ATRAWLEGLE RNANVYPSNS AVMVAADEGE IDAGVMYHYY FYRDRAENGL KSDDAELHFF RNSDPGAFLS ISGAGVLASS DQPEQAQRLV AYLTSPPAQQ RLAESTALEY AVGNDVPSAE ALPPLAELQA PQVDPGSLDQ QRVTELMQDV GLL
|
| |