Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_0796 |
Symbol | |
ID | 8752453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 848542 |
End bp | 850338 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | type III restriction protein res subunit |
Protein accession | YP_003407931 |
Protein GI | 284989377 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.42368 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGTG CAGCACGCGC GGAGGCGGTC GCCCCGCAGA GGACCAGCCG ACCGCTCCGG GTCTGGCAGC AGGCCGCTCT GGAGAAGTAC GAGCAGGAGT CCCCCAAGGA CTTCCTGGTC ACCGCGACCC CGGGCGCCGG CAAGACGACC TTCGCCCTCA CCCTCGCGTT CCGGCTGCTG CAGCGGCGCG AGGTCGCCCG CGTGGTCGTC GTCTGCCCCA CCGACCACCT GCGCCTGCAG TGGGCCGACG CCGCCGACCG GATGGGCATC GTCCTCGACC CGGGGCTGAC CAACGCCGTC GGCCCGGTGC GCGCCGGCAC CCAGGGCTAC GTGACCACCT ACGCGCAGGT CGCCGGCAAG CCGATGCTGC ACGCCGCCCG CTCCACCGCG GTCAAGACGC TGGTCATCCT CGACGAGGTG CACCACGCCG GTGACGGGCT CTCCTGGGGC GAGGCGGTCG AGGAGGCCTA CGGCTTCGCC GCGCGCCGGT TGTGCCTCAC CGGGACGCCG TTCCGCACCA AGCCCGACGA GCGCATCCCC TTCGTCCGCT ACGAGGAGGA CGGCTTCGAG GGCGACGACG GCGAGGGCGG CATGGGGCTG GTCAGCCGCG CCGACTACAC CTACGGCTAC AAGGAGGCGC TGGCCGACAA CGTCGTCCGC CCGGTCGTCT TCGCCGCCTA CACCGGGACG TCGCGGTGGC GGAACTCCGC CGGCGAGGTG GTCGCCGCCT CGCTGTCCGA GGCCGGCACC CGGTCGGTGG AGATGCAGGC CTGGCGCACC GCGCTGGACC CCAAGGGCCA GTGGGTGCCG CACGTCATCG CGGCCATGGA CGACCGGATC ACCCACCTGC GCGAGGACGG CGGCATGCCC GACGCCGCCG GGCTGATCCT CGCCAGCGAC CAGGACGACG CACGCGCCTA CGCCAAGATC GTCCGCCGGG TCACCGGCAA GGCGCCCGAG CTGATCCTCT CCGACGACCC CAAGGCGTCG AAGAGGATCG AGCGCTTCGC CAACGGCTCG GCCCGGATCG CGGTCTGCGT GCGGATGATC TCCGAGGGCG TCGACGTCCC GCGGGCCGCC GTCCTCGCCT GGATGACCTC CTACCGGACG CCGCTGTTCT TCGCGCAGGC CGTCGGCCGC GTGGTCCGTG CCCGGGCCTC GCACGAGTCG GCGACGGTGT TCCTCCCCGC CGTCCGGCCG CTGCTGGGCC TCGCCGCCTC GATGGAGGAG CAGCGCAACC ACGTCATGCC GCCGCCGAAG ACGGTGCAGG GCGACGAGCT GGACCTGGAG CCGCTGCCTC CCAAGGAGCG CGAGCCGGTC ACCATGAAGC AGTTCCAGGC GCTGGAGGCC GACGCCCGGT TCGCGCACGT GCTGGCCAGC GGCACCGCGC ACACCGGTGA GGGCCGACCC GCTGCCGAGC CGCTCGCCGC CGAGGAGGAT GACTTCCTCG GCATCCCGGG CCTGCTCACC GCCGAGCAGA CGGCGTCGCT GCTGGCCAAG CGGGACGACG AACTGCGGAT CCGGATCGCC CAGCGCGCCC ACTCCGACGA CGACACCGGC GAGCTGCCCA TCGTCGAGGA CCACCCCGAC GAGGACGACG CCGGCCGCTC CTGGCGTGAC GCCGCCGAGC TCCGGCGGGA GGTCAACCGG CTCGTCGCCC GGGTCGCGGC CAAGACCTCC AGACCGCACG GGGTGGTGCA CACCGAGCTG CGCAAGGCCG TGCCCGGCCC ACCATCGGCG TCGGCGTCGG TCGACGTGCT GCGCGCCCGC CGCGAGCGCC TCCGCACGAT GCTCTGA
|
Protein sequence | MASAARAEAV APQRTSRPLR VWQQAALEKY EQESPKDFLV TATPGAGKTT FALTLAFRLL QRREVARVVV VCPTDHLRLQ WADAADRMGI VLDPGLTNAV GPVRAGTQGY VTTYAQVAGK PMLHAARSTA VKTLVILDEV HHAGDGLSWG EAVEEAYGFA ARRLCLTGTP FRTKPDERIP FVRYEEDGFE GDDGEGGMGL VSRADYTYGY KEALADNVVR PVVFAAYTGT SRWRNSAGEV VAASLSEAGT RSVEMQAWRT ALDPKGQWVP HVIAAMDDRI THLREDGGMP DAAGLILASD QDDARAYAKI VRRVTGKAPE LILSDDPKAS KRIERFANGS ARIAVCVRMI SEGVDVPRAA VLAWMTSYRT PLFFAQAVGR VVRARASHES ATVFLPAVRP LLGLAASMEE QRNHVMPPPK TVQGDELDLE PLPPKEREPV TMKQFQALEA DARFAHVLAS GTAHTGEGRP AAEPLAAEED DFLGIPGLLT AEQTASLLAK RDDELRIRIA QRAHSDDDTG ELPIVEDHPD EDDAGRSWRD AAELRREVNR LVARVAAKTS RPHGVVHTEL RKAVPGPPSA SASVDVLRAR RERLRTML
|
| |