Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3738 |
Symbol | |
ID | 8755423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3904713 |
End bp | 3907820 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003410688 |
Protein GI | 284992134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGACA CGCCCACGGG GCTGTACGAG CACCTGGTCA CCGATCGGCT CAACAAGGAC CTGCCCGCGG CCGCTCCGGA CCTTATCCAG CTCGGCACGC TCGACCCGGC CGATGCGCAC GAGCCGCTGA CCCGCCACAT CGCCGTACTC GTCAGCCGGG CCTTGCGGGC GGCCGGGGGG AGCGACGCAA CGGCCATCAA TCGGCAGGTC GAGCTAGCCA ACCGCCTCGT TGCCGCCATC GGCGACCTGG CGCCGGATGT CGCCGACATG GACGACGCCG TCACCGCCCC GGCCCGAACC CTTCTGGCGA TCGCTCCGCC AAGTGGCATC CCCGGTCCCA CGTCATTCCC CGAGCGCCCC GCAGTCCCGC TGTCGACGAG CGCCTTGCTC GTCAACGGCC GCGGACAACC GCGGATCGGC TACGAGGTGA CCCGCGAGAT GGCCTCGGCT GACCGGGTGG ACCTGCTGTG TGCCTTCATC AAGTGGCAGG GCCTACGGGT CATCGAGAAG CAGCTCACCG AGCTGCGCGA GCGCGGCGGC CGGCTGCGTG TCATCACCAC CACGTACATG GGCGCCACTG ACCAGCGCGC ACTCGACCGC CTGGTCGAGC TGGGCGCAGA GGTCAAGGTC TCCTACGAGA CCCGGACGAC GCGGCTGCAC GCCAAGGCTT GGCTCTTCCA CCGCGTCACC GGCCTGTCGA CGGCATACGT CGGCTCGTCG AACCTCTCCC GCACCGCCCT CACCGACGGT CTCGAGTGGA ACGTGCGGTT GTCCAGCGTC GAGCAGGCGC ACCTGCTCAC CACGTTCGCC GAGACCTTCA ACAGCTACTG GCTCGACCCG TCGTTCGAGA CCTACGAACC GCAGCGTGAC GGCGACCGGC TCCGCGAGGC GCTGGCCGCG GAGCGCGGCG GCCCGTCCGA CCTGCCCATC GAGATCACGA CGCTCGACGT GCGCCCCTTC GGCTACCAGC AGGAGATCCT CGAGGCACTC GACGCCGAGC GGACCGTCCA CGGCCGGTGG CGCAACCTCG TGGTGATGGC GACCGGCACT GGCAAGACGG TCGTTTCCGC GCTGGACTTT CGGCGGCTGC GCGAGAGCGG CATGGTCGAC CGGCTCCTCT TCGTCGCCCA CCGCGAGGAG CTGCTGACGC AGAGCCACTC TACGCTCAGG CACGTGCTGC GCCAGGGTGA CTTTGGCGAG CTCTTCGTCG GCGGGCAGCG TCCTCGCGAG TGGCGGCACG TGTTCGGCTC CGTGCAGTCG CTCAACCAGC TCGACCTCGG CGAGCTCGAC CCCAGTCACT TTGACGTCGT CATCGTCGAC GAGTTCCACC ACGCCGAGGC GAGCACCTAC CGGAGACTGC TCGAGCACCT CAAGCCGACC GTCCTCCTCG GCCTGACGGC GACGCCGGAA CGAACGGACG GCGCCGACGT GCGCACCTGG TTCGGCGGAC GCACCGCTGT CGAGCTGCGA CTGTGGGAGG CACTCGAGCA GAACCTCCTC GCGCCGTTTC AGTACTTCGG CGTCCACGAC GACGTGGCGC TCGACCGGCT GCGGTGGAAG CGCGGCCGCG GCTACGACGT CACCGAGCTG AGCAACGTCT ACACCGGCGA CGACCACCGC GTCCGGCTCG TCCTGCAGGC GGTCAAGGAC AAGGTCGAGG ACCCCGGCCG GATGCGGGCG CTGGGCTTCT GCGTCAGCAT TCAGCACGCG CAGTACATGG CCGACCGGTT CACCAGGGCC GGCATCCCGA GCCGGGCTGT GACGTCGACG TCGTCTCGTG AGGAGCGGGC GGCGGCACTG GCCGCCCTGC GCGACCGGGC GGTGAACGTC CTCTTCACCG TCGATCTGTT CAACGAGGGC CTCGACATCC CGACCGTCGA CACCGTGCTG TTCCTACGGC CGACCGAGAG CGCGACGGTG TTCCTCCAGC AGTTGGGCCG CGGCCTGCGG CTGGCCGAGG ACAAGGCCTG CCTGACCGTC CTCGACTTCA TCGGCGCCCA GCACAAGAAG TTCCGCTTCG ACCTGCGCTT TCGTGCGCTG ACCGGCAGCT CTCGACGCGG GCTACAGCGC GACGTCGAGC AGGGCTTCCC CACCCTGCCG GCGGGCTGCC ACATCGAGTT GGACCGGGTG GCGCAGCGGA TCGTCCTCGA CAACATCAAG CAGTCGCTGA GCGTGCCGTG GCAAGAACTG GTCAGCGAGG CGAAGCGCGA GCAGTCACCG TCGCTGTCCG AGTTCCTCGA GGAGACGGGC GTCGAGCTCG AGGACCTGTA CCGCGGCAAC GGTCGCAGTT GGTTGGACCT CAAGCGGGCC GCCGGCTGGG TCGACGACGC GCCAGGACCG GACGACTCCG CGCTGGGTGC CGCGCTGGGC CGCATGCTGC ACGTCGACGA CCTGGAGCGG TTGCGGTTCT TCGGGTCAAT TACGCGGTTG GACACCGTGG CGCAGACCAG CGTCCGCATA CGGCGGCTGG CCGCGATGCT GCACTTCTCG CTGTTCGGCT CACAGAGGCC GTTCAGCAGC GGCGAGGCGT CGCTCGCCCG ACTGTTGGCC CACCGGGGAC GCGCGGAAGA GCTCGTCGAG TTGAGCGACG TCCTGCACGG GCGAATTCAC CGGGTGGCCC GACCGTTGGC TGAAGCCGGC GACCGTCCAC TGCACGTGCA CGCCCGCTAT AGCCAGAACG AGGCTCTGGC GGCCTTCGGC AAGACGAATT TGAGCGGCTC ATTCGGTGCG GGCGTGCACT GGGTGGAGGA GGACCAAGCC GACGTCTTCT TTGTGACCTT AAAGAAGACT GAGACGCATT ACTCGCCCAC GACGATGTAC GCGGACCATG CCATCTCTCC AACCATGTTC CAGTGGGAGT CTCAGAACAC GACCTCTGAG CGCACCGCGG TAGGCAAGCG ATACATCCAC CACCGAGACA TGAGCACGTC CGTGCACCTG TTTGTCCGCG AGACCAAGAC GGCGGATGGA ACACTCGGGA CGCCGCCGTA CCTCTACGCC GGGCCAATGT CTTACGTCTC CCACACCGGC GAACGCCCCA TGCGGATCCT CTGGCGACTC GAACACGCAC TCCCGGCCGA TGTCTTCCAC GCCGCTAGAG TCGCGTGA
|
Protein sequence | MGDTPTGLYE HLVTDRLNKD LPAAAPDLIQ LGTLDPADAH EPLTRHIAVL VSRALRAAGG SDATAINRQV ELANRLVAAI GDLAPDVADM DDAVTAPART LLAIAPPSGI PGPTSFPERP AVPLSTSALL VNGRGQPRIG YEVTREMASA DRVDLLCAFI KWQGLRVIEK QLTELRERGG RLRVITTTYM GATDQRALDR LVELGAEVKV SYETRTTRLH AKAWLFHRVT GLSTAYVGSS NLSRTALTDG LEWNVRLSSV EQAHLLTTFA ETFNSYWLDP SFETYEPQRD GDRLREALAA ERGGPSDLPI EITTLDVRPF GYQQEILEAL DAERTVHGRW RNLVVMATGT GKTVVSALDF RRLRESGMVD RLLFVAHREE LLTQSHSTLR HVLRQGDFGE LFVGGQRPRE WRHVFGSVQS LNQLDLGELD PSHFDVVIVD EFHHAEASTY RRLLEHLKPT VLLGLTATPE RTDGADVRTW FGGRTAVELR LWEALEQNLL APFQYFGVHD DVALDRLRWK RGRGYDVTEL SNVYTGDDHR VRLVLQAVKD KVEDPGRMRA LGFCVSIQHA QYMADRFTRA GIPSRAVTST SSREERAAAL AALRDRAVNV LFTVDLFNEG LDIPTVDTVL FLRPTESATV FLQQLGRGLR LAEDKACLTV LDFIGAQHKK FRFDLRFRAL TGSSRRGLQR DVEQGFPTLP AGCHIELDRV AQRIVLDNIK QSLSVPWQEL VSEAKREQSP SLSEFLEETG VELEDLYRGN GRSWLDLKRA AGWVDDAPGP DDSALGAALG RMLHVDDLER LRFFGSITRL DTVAQTSVRI RRLAAMLHFS LFGSQRPFSS GEASLARLLA HRGRAEELVE LSDVLHGRIH RVARPLAEAG DRPLHVHARY SQNEALAAFG KTNLSGSFGA GVHWVEEDQA DVFFVTLKKT ETHYSPTTMY ADHAISPTMF QWESQNTTSE RTAVGKRYIH HRDMSTSVHL FVRETKTADG TLGTPPYLYA GPMSYVSHTG ERPMRILWRL EHALPADVFH AARVA
|
| |