Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_1079 |
Symbol | |
ID | 8752740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 1139341 |
End bp | 1140999 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003408209 |
Protein GI | 284989655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.133059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGGTTGA GCAAGCGCTC TGCTGCGCTC ATCGCGACCG GTCTGACCGG CGCGATGGTG CTGTCCGCCT GTGGCGGTGG CGGCGACGAG GACGGCGGAT CGGGGGCTGC CGCCGCGGAC GGCGGCAGTT TCACCGTCTA CATCGCCGAG CCCGAGAACC CGCTCATCCC CGGCCAGACC ACCGAGACCG AGGGTGCCCA GGTCCTCTAC TCCCTCTTCA CCGGACTCGT CCAGTACGAC TCGGAGACCA ACGAGGCCGT CTACACCGGG GTCGCCGACT CGATCGAGTC CGCGGACCAG ACCACCTGGA CCGTCAAGCT CAAGGACGGC TGGACCTTCC ACGACGGCAG CCCGGTGCGC GCCCAGTCGT TCGTCGACGC CTGGAACTGG ACCGCCTACA GCCCCAATGC GGCCAACAGC TCGTACTTCT TCGCCAACAT CGCCGGCTAC GACCAGCTGC AGGCCCCGAC CGACGACGCC GGCAACGTGA CCGGTGACCC GGCCGCCACG GCGATGAGCG GTCTGCGGGT GGTCGACGAC CTGACCTTCG AGGTCACCCT GTCCGCCCCG TACGCCCAGT GGCCCACCAC GGTCGGTTAC AGCGCCTTCT ACCCGCTGCC CCCGGCCTTC TTCGAGGACT CGGCGGCCTT CGGCGAGCAG CCGATCGGCA ACGGCCCGTT CCGTGCCGAT GAGCCCTTCG TCCCCGGTAC CGGCGTCACG CTGACCCGCT ACGACGAGTA CGGCGGCGAC AAGCCGGCCA ACGCCGCGTC TGTGCAGTAC GTGGTCTACG CCGAGCAGGC GACCGCGTAC CGGGACCTGC AGGCCGGGAA CCTCGACATC ATGGACGAGC TCCCGCCGGA CGCCCTCGCT TCCGCCGAGG CCGAGCTGGG CGACCGCCTC CTGCAGGTCG AGCAGGGGGA CATCACCTCG CTGGGCTTCC CGACCTACGA CGAGCGCTTC GCCGACCCGA ACGTGCGGCG TGCCTTTTCG ATGGCGATCG ACCGCGAGTC GATCACCGAG GCGATCTTCC AGGGCACCCG CATCCCGGCG ACGTCCTTCA TCAACCCGGT GGTCGACGGC TACCGCGAGG GCGCCTGTGA CGTCTGCGAG CTGAACGTCG AGGAGGCCAA CCGGCTCCTG GACGAGGCGG GCTTCGACCG CAGCCAGCCG GTCGACCTGT GGTTCAACGC CGGCGCCGGC CACGAGATCT GGATGGAGGC CGTCGGCAAC CAGATCCGGG AGGGCCTCGG CGTCGACTAC ACGCTCCAGG GCAACCTCGA CTTCGCCGAG TACCTGCCGC TGCAGGACGA GCAGGGCATG ACCGGCCCGT TCCGGTCCGG CTGGATCATG GACTACCCGG TCGCCGAGAA CTTCCTCGGC CCGCTGTACT CCAGCACCGC GCTGCCGCCC GGCGGGTCGA ACGTGACCTT CTACAGCAAC CCCGAGTTCG ACGCGCTGCT GCAGCAGGGC AACTCGGCCG ACACCAACGA GGCGGCGATC CAGGCCTACC AGGCCGCGGA GGACCTCCTG CTCCGCGACA TGCCGGCCAC CCCGCTGTTC TACCGGGTCA ACCAGAGCGC CCACTCGGAG AACGTCGACA ACGTGGTCGT CGACGCCTTC AACCGGATCG ACACGGCAGC CGTCCAGGTC GTCGGCTGA
|
Protein sequence | MRLSKRSAAL IATGLTGAMV LSACGGGGDE DGGSGAAAAD GGSFTVYIAE PENPLIPGQT TETEGAQVLY SLFTGLVQYD SETNEAVYTG VADSIESADQ TTWTVKLKDG WTFHDGSPVR AQSFVDAWNW TAYSPNAANS SYFFANIAGY DQLQAPTDDA GNVTGDPAAT AMSGLRVVDD LTFEVTLSAP YAQWPTTVGY SAFYPLPPAF FEDSAAFGEQ PIGNGPFRAD EPFVPGTGVT LTRYDEYGGD KPANAASVQY VVYAEQATAY RDLQAGNLDI MDELPPDALA SAEAELGDRL LQVEQGDITS LGFPTYDERF ADPNVRRAFS MAIDRESITE AIFQGTRIPA TSFINPVVDG YREGACDVCE LNVEEANRLL DEAGFDRSQP VDLWFNAGAG HEIWMEAVGN QIREGLGVDY TLQGNLDFAE YLPLQDEQGM TGPFRSGWIM DYPVAENFLG PLYSSTALPP GGSNVTFYSN PEFDALLQQG NSADTNEAAI QAYQAAEDLL LRDMPATPLF YRVNQSAHSE NVDNVVVDAF NRIDTAAVQV VG
|
| |