Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0480 |
Symbol | |
ID | 8752134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 524598 |
End bp | 526307 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003407633 |
Protein GI | 284989079 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.105843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGTC GTCCCCAACG CGCGCTCGCC GCCTCGGCGG CCGCGCTCAC CGCAGTCACC CTGGCTGCCT GTGCCTCGAG CGACCGGGAC ACCGGTGGTG GCGGGGAGGG CGGTGGTGAG GCCCGGTCCG GCGGCACCAT GGTCTTCGGC GCCACCGGCG ACCCCGCGAT GCTCGACCCG GCGTTCGGCA GCGACGGCGA GACCTTCCGC GTCTCCCGCC AGGTCTTCGA GGGCCTGCTC GGCAACGAGC TCGGTGGCAC CGACCCCGTC CCCGAGCTGG CCGAGGACTG GGAGGTCAGC GAGGACGGGC TGGAGTACAC CTTCAACCTG CAGCAGGGCG TCCAGTTCCA CGACGGCACA AACTTCAACG CCGAGGCGGT CTGCTTCAAC TTCGACCGCT GGTACAACTT CGAGGGCCTG GCCACCAGCC CGAGCGCCTC GTACTACTAC CAGGCGGTCT TCGGCGGCTT CGCCAGCACG CCGGACACCC CGAGCATCTA CGAGAGCTGC GAGGCGACCG ACGAGAACAC CGCGGTCATC CGGCTCACCC AGGTCACCTC GAAGTTCCCC GCCGCGCTGG CGCTGCCGGC GTTCTCCATC CAGAGCCCGA CGGCCCTGCA GGAGTACGAC GCCGACAACC TGTCCGGTGC CGAGGACGCG CTGGCCTACC CCGAGTACGC CCTGGAGCAC CCGACCGGCA CCGGGCCCTT CCGGTTCGAG AGCTGGGACC GCGGCAACGG TCAGGTCACC CTGGTCCGCA ACGAGGACTA CTGGGGCGAG CCGGCGCTGC TCGACGAGCT CATCATCCGC ACCATCCCGG ACGGCAACAC CCGCCGCCAG GAGCTGCAGG CCGGCTCGAT CGACGGCTAC GACTTCGTGG CGCCGGCGGA CTACCAGTCG CTGCAGGACG AGGGTCACCA GGTCCTGGTC CGCGACCCGT TCAACATCCT CTACCTGGGC TTCAACGGCG GGAACGTCCC CGGCACCAGC GCCAACCCGG CGCTGCAGGA CCCGCGGGTG CGGCAGGCCA TCGCGCACGC GATCGATCGC GACACGATCG TCAGCTCGCT GCTCCCCGAG GGCGCCGAGG CGGCCATCGA GTTCATGCCG CCGACGGTCG ACGGCTACGC CGAGGACGTC ACGACCTACG ACCACGACCC GAACCGGGCG CGGCAGCTGC TGCAGGAGGC CGGGGCCGAG GGCACGACGC TGCGGTTCTA CTACCCGACC GAGGTCAGCC GCCCCTACCT GCCGGACCCG GCCGCGATGT TCCAGGTCAT CAGCCAGGAC CTGACCGACG CCGGCTTCAC CATCGAGCCG GTCGCGCTGC CGTGGAACCC GGACTACCTG AACGCCGTCC AGTCCGGTCA GGCTGACATC CACCTGCTCG GGTGGACCGG TGACTACAAC GACGCCTACA ACTTCATCGG CACCTTCTTC GCCGAGGCGT CGAACAACCA GGCGTCGGCG GAGTTCGGTG CGTTCAGCGC GCCGGAGATC TTCCAGGCGC TCGCCCGTGC CGACCAGGAG CCGGACCCGG CCGCCCGTAC CGCGCTCTAC CAGGAGGCGA ACCGGCTGAT CATGGACTAC CTGCCCGGTG TCCCGATCTC CCACTCGCCG CCGGCTCTGG TCGTCGCGGA GAACGTCACG GGTCTGGAGC CCAGCCCGCT GACCGCCGAG GTCTTCTCCA CGGTCTCCAT CAGCGAGTGA
|
Protein sequence | MQRRPQRALA ASAAALTAVT LAACASSDRD TGGGGEGGGE ARSGGTMVFG ATGDPAMLDP AFGSDGETFR VSRQVFEGLL GNELGGTDPV PELAEDWEVS EDGLEYTFNL QQGVQFHDGT NFNAEAVCFN FDRWYNFEGL ATSPSASYYY QAVFGGFAST PDTPSIYESC EATDENTAVI RLTQVTSKFP AALALPAFSI QSPTALQEYD ADNLSGAEDA LAYPEYALEH PTGTGPFRFE SWDRGNGQVT LVRNEDYWGE PALLDELIIR TIPDGNTRRQ ELQAGSIDGY DFVAPADYQS LQDEGHQVLV RDPFNILYLG FNGGNVPGTS ANPALQDPRV RQAIAHAIDR DTIVSSLLPE GAEAAIEFMP PTVDGYAEDV TTYDHDPNRA RQLLQEAGAE GTTLRFYYPT EVSRPYLPDP AAMFQVISQD LTDAGFTIEP VALPWNPDYL NAVQSGQADI HLLGWTGDYN DAYNFIGTFF AEASNNQASA EFGAFSAPEI FQALARADQE PDPAARTALY QEANRLIMDY LPGVPISHSP PALVVAENVT GLEPSPLTAE VFSTVSISE
|
| |