Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3644 |
Symbol | |
ID | 8755329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 3823090 |
End bp | 3824316 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | HNH endonuclease |
Protein accession | YP_003410600 |
Protein GI | 284992046 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.499825 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGCAGC TGCTGGCCTC CCTCGACGCC CTGGCCGCCG AGGACCTGGC CCCGTTGTTC GGCCCGGCGC TGCTGGACCG GCTGGGTCGG CTGCTGGTGG CGCAGAACCG GTTCGCCGCC GAGGTGGCCC GCACCGTGCG CGAGGCCGAG GTGTCCGGTG CCGCGGAGGT CGACGGGCTG AAGACGATGA CCTCCTGGCT GCGCGGGCAC GCGCATCTGT CCACGTCCGA GGCCGCGCGG GTGGTGCGCG CGGGTCGGGC GCTGGCGCAC CTGCCGGCCA TGGCCACGGC GTTCGCCGCC GGTGACGTGA CCGCCGAGCA GGCCGGGCTG CTCGGCCGGG TCGCCGAGCC GGAGGCTCTG GCTCTGGCCG CCGGGCAGGG CGTGGACCTG GGCGGCGTGG ACACCGTGCT CACCGAGGTG GCTGTGACCC GTCCGCACGC CGACACCGCC AAGGCGGTGC ACCACTACCT GGACCACCTC GACGCCGACG GCCCGGAGCC CGACCCCACC GAGGGGCGGC GGCTGAGCAT CGCCCGGCAC GCCGACGGGT CGATCTCCGG CCGCTTCGAC CTCGATGCGG TGGGTGGGGA GAAGCTGCAG ACGGCGCTGG AGTCCGTGGT GCAGAGCGGG CGGTGTGCCG GGGATGAGCG GACCCGGGCC CAGCAGCAAG CGGATGCGCT GGTGCAGCTG TGCGACAACC AATTGGCCTC CGGTCAGCTG CCCATGCTCC GTGGGCACAA GCCGCAGGTG TTGGTCAAGG TCGGCATCGA GGACCTGGTC GACGCGGCCA CCGGTGCCGG CGCCGCGAAG CTGGGGTTCG GCGCCACTAT TTCCGCCGCC CGGGCGCGGA GGATCGCGTG CGACGGCACC CTCACCCGGA TCGTGATGGG CCCGGACGGG AAACCGCTGG ACTACGGCCG CAGCGTGCGC CTGGTGCCGC CGCACGTGCG CCGGGCCGCG GAAGTGCGGG ACGGTGGGTG CGTGTTCGCC GGCTGCGGTG CGCCGACCTC CTGGTGCGAC GTCCACCATC TGCTGGAGTG GGCCAACGGC GGCCAGACCA GCCTCGACAA CAGCGCGCTG CTGTGCGAAC GGCACCACAC GAAGGTCCAC CACGGCTTCC GGGTCGAGCG ACAACCCGAC GGCCGATGGC GCACCTGGCG CCCCGACGGC ACCGAGATCC GCACCGGACC GGGCCGCACC GGCCCACCCC TTCCCGCCGC CGCCTGA
|
Protein sequence | MEQLLASLDA LAAEDLAPLF GPALLDRLGR LLVAQNRFAA EVARTVREAE VSGAAEVDGL KTMTSWLRGH AHLSTSEAAR VVRAGRALAH LPAMATAFAA GDVTAEQAGL LGRVAEPEAL ALAAGQGVDL GGVDTVLTEV AVTRPHADTA KAVHHYLDHL DADGPEPDPT EGRRLSIARH ADGSISGRFD LDAVGGEKLQ TALESVVQSG RCAGDERTRA QQQADALVQL CDNQLASGQL PMLRGHKPQV LVKVGIEDLV DAATGAGAAK LGFGATISAA RARRIACDGT LTRIVMGPDG KPLDYGRSVR LVPPHVRRAA EVRDGGCVFA GCGAPTSWCD VHHLLEWANG GQTSLDNSAL LCERHHTKVH HGFRVERQPD GRWRTWRPDG TEIRTGPGRT GPPLPAAA
|
| |