Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_4103 |
Symbol | |
ID | 8755794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 4314656 |
End bp | 4317529 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | 1,4-alpha-glucan branching enzyme |
Protein accession | YP_003411039 |
Protein GI | 284992485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCCC CGGACGAGAC CGAGCGCACC ACCGACTCCC CCGGAAGCGG CGGGCGCGCG CCCGCGAGCC GCGAGGAGCT GCAGGCGGCC GTCGAGGAGA AGACCGCCGC GGCCGAGGCC GCCGAGGCGG AGCCGGTCGT CAACGCCACC CCGGCACGCA AGCGGGCCGC GGCGAAGAAG GCCCCGGGGG CGGGGAAGGC CCCGGCGAAG AAGGCGACTG CCAAGAAGAC CGCCGCGGCG TCCGATGGCG AGGCCCCGGC GAAGAAGGCG ACCAAGCGTG CGCCGCGGAA GAGGGCGGCC CAGGCCGACG CGCCCCAGGG CGAGGTCGTC GCCGAGCAGG GCGGCACCGC CGCACCGGTG GTCGCCCCGG CGCCGATCGC CGAGCCGGGC AGCCCGAGCG GCGCCCCGGC CGGCCAGCCG GAGCCCCCGA GCCCGGACCC GGCGGAGCCG AGCACGCCGC AGGCCCCGGA CGAGGACGGC GGCGCGGGCC CGGCCGAGAC CCAGCCTTCG GAGACACCGT CCGCGAACAC CCCGCCATCG GCGGCCCGGC CCGTCGACAC GCCCCGGCCC GCCGAGGACG CCACGGCCGG CACCGGGTCG GAGGTGGCGG CGGCGACCGT CCCCGAGGGG CAGGCCCCCG CCGAGGAGGG CCTCGCCCAG CAGGAGGCGC CCGCCGAGGT CAGCGAGGAG CAGCTGCGCG CCGTCGTCGA CGGGTGGTCC TACGCCCCGC ACAGCGTGCT CGGCGCGCAC CCCGCCCGCG ACGGCTGGGT GGTCCGCACG CTGCGGCCCG ACGCGGTCTC GGTGACCGTC GTCGACGAGG ACGGCTCCCG CCACGACACC CGTCAGCTGC ACCCGGGTGG CGTCTTCGAG GCCCACCTGC CCACGCAGCC CGGGGACTAC CGGGTCGAGG TGACCTACGG CGACGGCGCG GACGGCACCA ACACCTTCGT CGTCGACGAC CCGTACCGCT GGCTGTCCAC CATCGGTGAG CTCGACCAGC ACCTCATCCG CGAGGGCCGG CACGAGCAGC TGTGGGAGGT GCTCGGCGCC CACGTGCGGC GCTACGACAC CCCGCGCGGC CAGGTCGAGG GCGTCTCCTT CGCCGTCTGG GCGCCCAGCG CCCAGGGCGT GCGGGTGACC GGCGACTTCG ACTACTGGGA GGCGCGCGCC TACCCGATGC GGTCGCTGGG CTCCTCCGGC GTCTGGGAAA TCTTCATCCC GGGCGTCCAG GTGGGCGTGA AGTACCGCTA CCACGTGCTC GGTCGCGACG GCGTGTGGCG GCACAAGTCC GACCCGCTGG CCTTCGAGAC CGAGGTGCCG CCGCTCAACG CCTCGATCGT CACCGAGTCC CACCACGAGT GGGCCGACGA CGAGTGGTTG GCCGAGCGGG CCCGCGGCGG CTGGCACCAG CGGCCGATGA GCGTCTACGA GGTGCACGCG GGTTCGTGGC GGCAGGGCCT GTCCTACCGC GAGCTGGCCG ACGAGCTGGT CGGCTACGTC GTCGAGCACG GCTTCACCCA CATCGAGTTC ATGCCGCTGG CCGAGCACCC CTTCGGCGGC TCCTGGGGCT ACCAGGTCAC CTCCTACTAC GCGCCCACCT CGCGCTTCGG CAGCCCCGAC GACCTGCGCT ACCTCATCGA CCGGGCGCAC CAGGCCGGCA TCGGCGTCAT AGTCGACTGG GTCCCGGCGC ACTTCCCCAA GGACGACTGG GCGCTGGCCC GCTTCGACGG CACCCCGCTG TACGAGCACG GCGACCCGCG CCGCGGCGAG CAGCTGGACT GGGGGACCTA CGTCTTCGAC TTCGGCCGCT CCGAGGTCCG CAACTTCCTC GTCGCCAACG CGCTGTACTG GTGCAAGGAG TTCCACGTCG ACGGCATCCG CGTCGACGCG GTCGCCTCGA TGCTCTACCT GGACTACTCC CGCGACGAGT GGGTGCCCAA CGTGCACGGC GGCCGGGAGA ACCTGGAGGC GATGGCGTTC CTGCAGGAGA TGAACGCCAC CGTCTACCGC GAGGTCCCCG GCGTCGTCAC CATCGCCGAG GAGTCGACCG CCTGGCCCGG CGTCACCCGG CCCACCCACC TCGGCGGCCT GGGCTTCGGC TTCAAGTGGA ACATGGGCTG GATGCACGAC TCGCTGGGCT ACATGTCCAA GGAGCCGGTG TACCGCGGCT ACCACCACGG CCAGTTGACG TTCTCGATGG TCTACGCCTA CTCCGAGAAC TACGTGCTGC CGATCAGCCA CGACGAGGTC GTCTACGGCA AGGGCTCGCT GCTGCGGAAG ATGCCCGGGG ACCGGTGGCA GCAGCTGGCC AACCTGCGCG GCTACCTCGC CTACATGTGG GCCCACCCCG GCAAGCAGCT GCTGTTCATG GGGTCGGAGT TCGCCCAGGA CGCCGAGTGG GCCGAGAGCC GCTCGCTGGA CTGGTGGCAC CTCGACGACC CGGCGCACCG CGGCGTCCTG CAGCTGGTGA CCGACCTCAA CGCCCGGTAC AAGGAGACCG CCGCGCTGTG GTCCCAGGAC GTCGACCCCG CCGGCTTCCA GTGGATCGAC GCCAACGACG CCTCGGGGAA CGTGCTGTCC TTCCTGCGCT ACGGCCGCAC CGAGGCCACC GGCGACGGCT CGGACGGCGC CGGTGGCGAG GCGCTGGCCT GCGTCGCGAA CTTCTCGGGC ACCCCGCACC ACGGCTACCG CGTCGGCCTG CCGCGGCCGG GCACCTGGCG CGAGGTGCTC AACACCGACG CCGAGGGCTA CGGCGGCTCG GGCGTCGGCA ACCACGGCTC CGTCGAGGCG GTCGAGCAGC CCTGGCACGG TCAGCCCTAC TCGGCCACCC TCGCCGTCCC GCCCCTGGGC ACCGTCTGGT TCGTGCACGA GTAA
|
Protein sequence | MTSPDETERT TDSPGSGGRA PASREELQAA VEEKTAAAEA AEAEPVVNAT PARKRAAAKK APGAGKAPAK KATAKKTAAA SDGEAPAKKA TKRAPRKRAA QADAPQGEVV AEQGGTAAPV VAPAPIAEPG SPSGAPAGQP EPPSPDPAEP STPQAPDEDG GAGPAETQPS ETPSANTPPS AARPVDTPRP AEDATAGTGS EVAAATVPEG QAPAEEGLAQ QEAPAEVSEE QLRAVVDGWS YAPHSVLGAH PARDGWVVRT LRPDAVSVTV VDEDGSRHDT RQLHPGGVFE AHLPTQPGDY RVEVTYGDGA DGTNTFVVDD PYRWLSTIGE LDQHLIREGR HEQLWEVLGA HVRRYDTPRG QVEGVSFAVW APSAQGVRVT GDFDYWEARA YPMRSLGSSG VWEIFIPGVQ VGVKYRYHVL GRDGVWRHKS DPLAFETEVP PLNASIVTES HHEWADDEWL AERARGGWHQ RPMSVYEVHA GSWRQGLSYR ELADELVGYV VEHGFTHIEF MPLAEHPFGG SWGYQVTSYY APTSRFGSPD DLRYLIDRAH QAGIGVIVDW VPAHFPKDDW ALARFDGTPL YEHGDPRRGE QLDWGTYVFD FGRSEVRNFL VANALYWCKE FHVDGIRVDA VASMLYLDYS RDEWVPNVHG GRENLEAMAF LQEMNATVYR EVPGVVTIAE ESTAWPGVTR PTHLGGLGFG FKWNMGWMHD SLGYMSKEPV YRGYHHGQLT FSMVYAYSEN YVLPISHDEV VYGKGSLLRK MPGDRWQQLA NLRGYLAYMW AHPGKQLLFM GSEFAQDAEW AESRSLDWWH LDDPAHRGVL QLVTDLNARY KETAALWSQD VDPAGFQWID ANDASGNVLS FLRYGRTEAT GDGSDGAGGE ALACVANFSG TPHHGYRVGL PRPGTWREVL NTDAEGYGGS GVGNHGSVEA VEQPWHGQPY SATLAVPPLG TVWFVHE
|
| |