Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3785 |
Symbol | |
ID | 8755470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 3965252 |
End bp | 3966748 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003410732 |
Protein GI | 284992178 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0713491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTCAG CCGGTCCCGC GCGTACGCCG CTGGAGGCCG AGCTGGTCGG CCGGCTGCTC GAGCGGCCGC CGACCAGCGC GCGGCTGCCT GTGGGGCTGC TGACCCGGGC GGAGAAGGCC GCCGAGCTGC AGCGCCTGCA GGCCCGCAAG GCGATGGACG CCGCCTACGA GGCCGAGCTC GTCATGGGCC TGGCCGACGA CACCCCGGAC TCCCTCGACC CGCCGCCGGG CCACCCCGGC GCCAGGAAGG GCTCGTGGGC CCCGGATCCC GAGCTGCCCG GGGTGAGCGA GTTCTTCACC TCCGAGCTGG CAGTGGTGCT CAACTGCGGC CGGGGCACCG CCTCCCACCT GGCGCACCGC GCCTGGACCT ACCGGGGGAA CCTGCCGGCC ACCTGGGCCG CGCTGGCCGA TGGGGTTCTG GACGAGCCCC GCGCGAAGGT CCTCGCCGAC GTCCTCACCC ACACGACACC GGCGATCGCC CGGGGAATCG AGTCGCGGCT GCTGCCCGAG GCGCCCGGCC TGTCCACCGG CCGGTTGCGG GCCCGGGCGC TGGCACTTCT GCTGGAACTC GATACCGACG CCGTCGACGC GCGGCGCAAG GACGCTCGTC GGCAGGCCGA CGTGCGCTCC TATCCCTCAC ACCTGGAGGG CATGAGCACG CTGGCTGCGG ACCTGCCCAC CCCGGTGTCG GCCGAGTGCC TCGACGTGGT CGACCGGTTG GCAGCGATGC TCAAGACCGA TGGCGACCCC CGGCCGATCG GCGAGCTGCG CGCCGTGGTG CTGGCTGACC TGATCCGCCG TCCCTGGGAC ACCAGCCGGA CGCCGGTCAC AGCTCAGCTG ACGATCACCG CCGCACTCGA CGCGCTGGCC GGCCGGACCG ACCAGCCCGG GGAGGTCAAC GGGCAGCCGA TCACCGCCGC CCAGCTGCGC GAGCTGCTCA TCCGGCTCGG TGCCCTGGGG CTGCAGACAC CCGAGGGCGG CACGGTGACC CTCGCGGTCA CCGACGACGG CGCTCTGGTG GCCACCACCA CCCTCGACCA GCTGCGCCGT CTGGCCCGCC GTGGCTGCGC CACCCACCAC GAGCAGGACT GCGGCTGCCC GGTGCTCGAC CGACCGGCAC CCACCGACGC CTACCCACCC ACCGCCGCCC AGGACGCCTT CGTCACCACC CGCGACCGCG CCTGCCGCTT CCCCAACTGC GGCCAGCGCG TCGGCTGGAC CGACCGCGAC CACGTCGTCC CGCACGCCGA CGGCGGCGCC ACTGACTGCG CCAACCTGTG CTGCCTGTGC CGCAGCCACC ACCGCCTCAA GACCCACGCC CGCGGCTGGC GATTCGCCAT GGACAACGAC GGCGCCCTGC ACGTCACCAC ACCATCGGGC GTCACCCGCA CCACCCGACC ACCCGGCCTG CGACCATCCC AGCCACCCGG ATCAACAGCG GCCGCCTCGA CTCCGCCAGC GGTGTCCATC TCGGACGACG ATCCGCCACC CTTCTGA
|
Protein sequence | MRSAGPARTP LEAELVGRLL ERPPTSARLP VGLLTRAEKA AELQRLQARK AMDAAYEAEL VMGLADDTPD SLDPPPGHPG ARKGSWAPDP ELPGVSEFFT SELAVVLNCG RGTASHLAHR AWTYRGNLPA TWAALADGVL DEPRAKVLAD VLTHTTPAIA RGIESRLLPE APGLSTGRLR ARALALLLEL DTDAVDARRK DARRQADVRS YPSHLEGMST LAADLPTPVS AECLDVVDRL AAMLKTDGDP RPIGELRAVV LADLIRRPWD TSRTPVTAQL TITAALDALA GRTDQPGEVN GQPITAAQLR ELLIRLGALG LQTPEGGTVT LAVTDDGALV ATTTLDQLRR LARRGCATHH EQDCGCPVLD RPAPTDAYPP TAAQDAFVTT RDRACRFPNC GQRVGWTDRD HVVPHADGGA TDCANLCCLC RSHHRLKTHA RGWRFAMDND GALHVTTPSG VTRTTRPPGL RPSQPPGSTA AASTPPAVSI SDDDPPPF
|
| |