Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0803 |
Symbol | |
ID | 8752460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 855796 |
End bp | 857481 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003407938 |
Protein GI | 284989384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.310817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGACG GCCCCCTGAT CGTCCAGTCG GACAAGACCC TGCTGCTCGA GGTCGACCAC CCCGCCGCCC GCGACTGCCG CGCGGCGATC GCGCCGTTCG CGGAGCTGGA GCGCTCCCCG GAGCACGTGC ACACCTACCG GGTGACGCCG CTGGCGCTGT GGAACGCGCG GGCCGCCGGG CACGACGCCG AGCAGGTCGT CGACGCACTG GTCCGCCACT CCCGCTACCC GGTGCCGCAC GCGCTGCTGG TCGACATCGC CGACACGATG GACCGGTTCG GCCGGCTCAC GCTGGCCAAC AACCCGGTGC ACGGGCTGGT GCTCACCACC TCCGACCGCG CGGTGCTCGA GGAGGTCGTC CGCAGCAAGC GGGTGGCGCC GATGCTGGGC GCGCGGATCG ACGAGGACAC CGTGGTCGTC CACCCGTCGG AGCGCGGGCG GCTCAAGCAG GCCCTGCTCA AGATCGGCTG GCCCGCCGAG GACCTGGCCG GCTACGTCGA CGGCCAGGCG CACCCGATCG AGCTGGCCCA GGACGGCTGG CACCTGCGCG ACTACCAGCA GGAGGCGGTC GAGGGGTTCT GGGCCGGCGG CTCGGGCGTC GTCGTCCTCC CCTGTGGCGC GGGCAAGACG CTGGTCGGCG CCGCGGCGAT GGCCGAGGCG AAGGCGACCA CGCTGATCCT GGTGACCAAC ACCGTCGCCG GGCGGCAGTG GAAGCGCGAG CTGATTGCCC GCACCTCGCT GACCGAGGAG GAGATCGGCG AGTACTCCGG CGAGCGCAAG GAGATCCGCC CGGTCACCAT CGCCACCTAT CAGGTGATCA CCACCCGTCG GAAGGGCGAG TACCGGCACC TGGACCTGTT CGACGCCCAG GACTGGGGCC TGATCGTCTA CGACGAGGTG CACCTGCTGC CCGCACCGAT CTTCCGGCTC ACCGCCGACC TGCAGTCCCG CCGTCGGCTG GGCCTGACCG CCACGCTGGT GCGCGAGGAC GGCCGCGAGG ACGACGTCTT CTCTCTCATC GGCCCGAAGC GCTACGACGC ACCGTGGCGT GACATCGAGG CGCAGGGCTA CATCGCGCCG GCCGAGTGCA TCGAGGTGCG GGTGTCCCTC GACGACGAGG AGCGGATGAC CTACGCGGTC GCCGAGCCCG AGGAGCGGTA CCGGATCGCC GCGACCGCGA CGTCGAAGCT GCCGGTCATC CGGCGGGTGC TCGACCGGCA CCCCGACGAG CAGAAGCTGG TCATCGGCGC CTACCTCGAC CAGCTCGACG AGCTCGGCCG GGCGCTCGAC GCCCCGGTCA TCCAGGGGTC GACGACCAAC CGGGAGCGGG AGAAGCTCTT CGACGCCTTC CGCGCCGGGG AGGTCAAGAC CCTGGTGGTG TCCAAGGTCG CCAACTTCTC CATCGACCTG CCCGAGGCCG CCGTCGCCGT CCAGGTGTCG GGGACCTTCG GCTCGCGCCA GGAGGAGGCC CAGCGACTGG GCCGGGTGCT GCGCCCCAAG GCCGACGGCC GGCAGGCGCA CTTCTACACG GTGGTCAGCC GCGACACCCT CGACGCCGAG TACGCCGCCC ACCGGCAGCG CTTCCTGGCC GAGCAGGGCT ACGCGTACAC CATCGTCGAC GCCGCCGACC TGGCCGGCCC CGGTGAGGTC AACGGCCCCG ACTGGGTGGA CGAGCCCGCC GACTGA
|
Protein sequence | MSDGPLIVQS DKTLLLEVDH PAARDCRAAI APFAELERSP EHVHTYRVTP LALWNARAAG HDAEQVVDAL VRHSRYPVPH ALLVDIADTM DRFGRLTLAN NPVHGLVLTT SDRAVLEEVV RSKRVAPMLG ARIDEDTVVV HPSERGRLKQ ALLKIGWPAE DLAGYVDGQA HPIELAQDGW HLRDYQQEAV EGFWAGGSGV VVLPCGAGKT LVGAAAMAEA KATTLILVTN TVAGRQWKRE LIARTSLTEE EIGEYSGERK EIRPVTIATY QVITTRRKGE YRHLDLFDAQ DWGLIVYDEV HLLPAPIFRL TADLQSRRRL GLTATLVRED GREDDVFSLI GPKRYDAPWR DIEAQGYIAP AECIEVRVSL DDEERMTYAV AEPEERYRIA ATATSKLPVI RRVLDRHPDE QKLVIGAYLD QLDELGRALD APVIQGSTTN REREKLFDAF RAGEVKTLVV SKVANFSIDL PEAAVAVQVS GTFGSRQEEA QRLGRVLRPK ADGRQAHFYT VVSRDTLDAE YAAHRQRFLA EQGYAYTIVD AADLAGPGEV NGPDWVDEPA D
|
| |