Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2220 |
Symbol | |
ID | 8753891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 2306165 |
End bp | 2307772 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003409274 |
Protein GI | 284990720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.176223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGTACC CCCGGCCTAG CGTGAGGGCG TGTTCCCTCG GCGGGGGTTT CGGGGTCGGC CTGACGGTGG CCGACCGGAC GCCGCCGCCC CCTGGGTTCG GCCCACCGGC CAACTCCCCA CCGGCGCTGG CCGAGGTGCT GCCGGTGTAC GCGCGGACGG CGGAGGAGAA GGTCGCGGAG CTGCAACGGG TGCAGCAGCT GGAATCCGGA CTCGCCGCCT ACAAGCTGGA GCTGATCGCG TCGTTCGCTG CCGACCGCCC GGCTCAGCTC GATCGCCGGC CCGGGCAACC GGGCGCCGCC GCCGGAGATG ACTCGACGCC GGACGGTGTG TCGGAGTTCT TCGCCGACGA GCTGGCGTTG ACGCTGAACT GCGCCCGGGC GTCGGCGACC ACGCTGACCG AGCACGCGCT CACGCTCACC GGCTCGCTGC GGGCCACGCT GGAGGAGCTG GCGCAGAGCC GACTGGACTG GCCCCGCGCC CGGACCATGG CCGAGGAGCT GGGCGAGAAG GTGGGCGGCA CTCACCCGCA GGTGATCGCC GCGGTCGAGG CCGCGGTGCT GCCCGAGGCG CCGTCGCTGT CCGTCCGCCG GCTCAAGGAC CGGCTGCGCC AGGAGCTGGC CGCCCGGGAC GCCGCGGCCT CCGACCGGGC GCGCGAGGAT GCCCAACGGG CGGTGACCGT GCGCCGCCGG CCGGTGGGCG GCGGCGTCAG CGAGCTGATC GCCGGCATGC CCGACGAGCT GGCCGCGGCG TGTCAGGCGA CGATCGACGA GCTGGCCTGG AGGGCGAAGA AGGCCGGCGA CGACCGTCCG ATCGGGATGC TGCGGGTCGG GGTGCTCGCC GACCTGATCC AGCGGCCCTG GCTGGTGCCC GAGCCGGTGG CCGCCCACGT CGAGGTGCAG GTGCCGCTGC GTGCGCTCAC CCCCGGCGGG TTCCTGGCGC AGGGCTCCCC GCTGCCGCCG GCCTACACCC GGCCGGGGTC GGTGGCCGGA CCCACCGGCG CGGTGGCGGG GGTACCGATC ACCGCCGCGC ACGTCCGCAA CCTGCTCGCC CAGTTTGACG CGATCGGCCT GCAGGCCCCG CCGGGTGGGT CGATCAGCTT CTCCTTCGCC GACGACCGCG GGGCGCTGCG GGCGGTCGCG ACCCTGCGCG AACTGCGGCA GGCTGCCAGC CGGGGTTGCC CTGTCCACCG CGACGGCGCC TGTGACTGCG CGGTCATCGA CCGCCCCGAG GCCACCGACG CCTACGCACC CACCGCCGCG CAAGGCCGCT TCCTCACCAC CCGCGACCGC ACCTGTCGGC ACCCGGGCTG CAGCAACCGC GCCGGGTGGG CCGACGCCGA CCACGTCATC CCCTACGCCC AGGGCGGAGA GACCGACTGC GCCAACCTGT GCTGCCTGTG CCGCCGGCAC CACCGGCTCA AGACCTTCGC CCCGGGCTGG ACCTACGCCA TGACCGCCGA CGGCATCCTC ACCGTCACCA CACCCGCCGG CGTGACCCGC ACCAGCCGAC CACCTGGCCT GCACCTCACC GGCCCCCGAG TGCTCACCCG GCCACCGGAC CAGCCGCCAG CGGCACCCGA CCCCGCCGAC GACCCACCAC CGTTCTGA
|
Protein sequence | MSYPRPSVRA CSLGGGFGVG LTVADRTPPP PGFGPPANSP PALAEVLPVY ARTAEEKVAE LQRVQQLESG LAAYKLELIA SFAADRPAQL DRRPGQPGAA AGDDSTPDGV SEFFADELAL TLNCARASAT TLTEHALTLT GSLRATLEEL AQSRLDWPRA RTMAEELGEK VGGTHPQVIA AVEAAVLPEA PSLSVRRLKD RLRQELAARD AAASDRARED AQRAVTVRRR PVGGGVSELI AGMPDELAAA CQATIDELAW RAKKAGDDRP IGMLRVGVLA DLIQRPWLVP EPVAAHVEVQ VPLRALTPGG FLAQGSPLPP AYTRPGSVAG PTGAVAGVPI TAAHVRNLLA QFDAIGLQAP PGGSISFSFA DDRGALRAVA TLRELRQAAS RGCPVHRDGA CDCAVIDRPE ATDAYAPTAA QGRFLTTRDR TCRHPGCSNR AGWADADHVI PYAQGGETDC ANLCCLCRRH HRLKTFAPGW TYAMTADGIL TVTTPAGVTR TSRPPGLHLT GPRVLTRPPD QPPAAPDPAD DPPPF
|
| |