Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2049 |
Symbol | |
ID | 8753720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2129364 |
End bp | 2131151 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003409108 |
Protein GI | 284990554 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.352532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACAGCA CGCTGGAGCG GGGCCCTGCC TGGGCGCGGG GCACCCGCGG CCGGGTGGCC GGCATCGCCG CCGTCGTCCT GCTGGTCGTC GGGGTGGCGG TCAGCTGCGC CAAGGCCGGC GAGGACCGCG CCACCCTCCC CGTCTTCGGC GGCGGTCCGG ACGCCGGGGT CACCGGGGTC CGGGCGCCGT CCGAGGCCTC CGGGGGCACG CTGCGGGTGG TGACCGGCGA GGTCGACAAC CTCGACCCGC AACGCTCCTA CCTCCCCGGA GTGTGGAACC TCATGCGGCT CTACACCCGC ACCCTGGTCA GCTACTCCTC CGAGCCCGGC CGCACCGCCG AGCTGGTGCC CGACCTCGCG ACCGACCTGG GGACCACGCC CGACGGCGGG GCCACCTGGA CCTTCACCCT CCGCGAGGGG GTGCGGTTCG AGACCGGCCA GCCGATCACC TCGCGCGACG TCAAGTACGG CATCGAGCGC TCGTTCGCCT CGGACGTGGT CGTCGGCGGC CCGACCCGTG TGGTCGAGCT GCTCGACGAC CCGGGCAACC CCTACGCCGG CCCGTACCAG GACGAGACGC CCGGCCGGCT GGGCCTGGCC TCGGTCGAGA CGCCCGACGA CCGCACCATC ACCTTCCGGC TGCGCGCGCC CCAGCCGGAC TTCCCGTACG TCATGGCGCT GCCCTCGAGC AGCCCGGTCC CGGCGGACCA CGACACCGGC GCCGACTACG GCCTCGACCC GGTCTCCTCC GGCCCCTACC TGGTCGCCAC GCGGGACGAC GTGACGGGCA TCGTGCTCGA GCGCAACCCG CAGTGGGACC CGGCCACCGA TGACGTCCGC ACCGCGCTGC CCGACCGGGT CGTCGTGCGC ACCGGGCTGA CCGGTGTGCA GCGCGACCAG GCGCTGCTGG CCGGGTCCGC TGACGTCGAC ATCTCCGGCG CCGGCATCCA GGCGCCGACC ACCGCCCGGC TGGGCGAGGA CACCGGCGGC GAGGTCCGGC TGGTCGACCG GATCGACGAG GTGACCAACA ACGTCGTCCG GCTGCTCGCG CTGCCGACGG ACGTCGCGCC GTTCGACGAC CCGGCCTGCC GGGCCGCGGT GGCCGCGGTC GTGGACCGCG CGGCGGTGCA GGAGGTGCTC GGCGGGCCCG CGGAGGCCGT GCGCACCTCC CAGCTGTGGC CGCGCGGCGT GGACGGCGGG CCGGAGGAGC CCGACCCGAC CGCCGACCCC GCCACCGCCG AGGAGTCCCT GGCCGCCTGC GGCCGGCCGG AGGGCTTCGG CACCGTGCTC GCCGTCCCGG ACGCCCCGAC CGGCGTGGCC GTGGCCGAGG AGGTGGCCGG GCAACTCTCC GGCATCGGCA TCGAGGCCGA GGTACGGGCG CTGGACGCAC CGAGCTTCTA CGCGACCGAG GTCGGCAACC CCGACCGGGT CCGCGACAAC GGGATCGGCA TCGTGCTGGC CACCTGGACC GCCGACTTCC CGACCCCCGG CTCGTTCCTC GTGCCGCTGG TCGACGGCCG GTCAGTCAGC ACCGTCGGCA ACACCAACTT CGCCCGCCTG CGGGACCCCG GCATCGACGG GCTCATCGAC GCCGCCCGCG CGGCCGCCGG CGACCCCGAG GCGGCCCGCG CGGCCTGGCG CGAGGTGGCG ACCGCGGCCA CCGCGACCGG CGCGTACGTG CCGCTGGCCG AGTCGCGGGT GCAGCTGCTG GCCGGCCAGC GGCTGCGCAA CGGCGTGGTG ATGGGCCCGT ATACGAGCTA CGACCTGGCG ACCGCCGGCG TCCGCTGA
|
Protein sequence | MDSTLERGPA WARGTRGRVA GIAAVVLLVV GVAVSCAKAG EDRATLPVFG GGPDAGVTGV RAPSEASGGT LRVVTGEVDN LDPQRSYLPG VWNLMRLYTR TLVSYSSEPG RTAELVPDLA TDLGTTPDGG ATWTFTLREG VRFETGQPIT SRDVKYGIER SFASDVVVGG PTRVVELLDD PGNPYAGPYQ DETPGRLGLA SVETPDDRTI TFRLRAPQPD FPYVMALPSS SPVPADHDTG ADYGLDPVSS GPYLVATRDD VTGIVLERNP QWDPATDDVR TALPDRVVVR TGLTGVQRDQ ALLAGSADVD ISGAGIQAPT TARLGEDTGG EVRLVDRIDE VTNNVVRLLA LPTDVAPFDD PACRAAVAAV VDRAAVQEVL GGPAEAVRTS QLWPRGVDGG PEEPDPTADP ATAEESLAAC GRPEGFGTVL AVPDAPTGVA VAEEVAGQLS GIGIEAEVRA LDAPSFYATE VGNPDRVRDN GIGIVLATWT ADFPTPGSFL VPLVDGRSVS TVGNTNFARL RDPGIDGLID AARAAAGDPE AARAAWREVA TAATATGAYV PLAESRVQLL AGQRLRNGVV MGPYTSYDLA TAGVR
|
| |