Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3691 |
Symbol | |
ID | 4443692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4151937 |
End bp | 4153715 |
Gene Length | 1779 bp |
Protein Length | 592 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691515 |
Product | glyoxylate carboligase |
Protein accession | YP_833166 |
Protein GI | 116672233 |
COG category | [R] General function prediction only |
COG ID | [COG3960] Glyoxylate carboligase |
TIGRFAM ID | [TIGR01504] glyoxylate carboligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAAGA TGCGCACCGT AGACGCTGCC GTCGCGATCC TGGAAAAGGA GGGCGCCACT GAGGCGTTCG GCCTGCCAGG CGCGGCGATC AACCCGTTCT ACTCCGCCAT GCGGAACCAC GGCGGCATCC GCCACACGCT GGCCCGCCAC GTTGAGGGCG CCAGCCACAT GGCGGACGGC TACTCCCGTG CCGCGGATGG AAACATCGGC ATCTGCATCG GCACCTCCGG CCCGGCCGGC ACGGACATGA TCACCGGGCT GTACGCGGCC TGGGCCGATT CCATCCCCAT GCTCTGCATC ACCGGCCAGG CCCCCGTTGC CAAGCTGCAC AAGGAAGACT TCCAGGCCGT GGACATCGAG TCCATCGCCA AGCCTGTCAC CAAGATGGCC ATGACCATCC TGGAGCCCGG CCAGGTTCCG GGCGCCTTCC AGAAGGCCTT CCAGCTGATG CGCTCCGGCC GTCCCGGCCC GGTGCTGCTG GACCTGCCGA TCGACGTGCA GATGGCAGAG ATCGAATTCG ACATCGACAC CTACGAGCCC CTGCCCGTGG AGAAGCCCAA GGCCAGCCGC AAGCAGCTGG AAAAGGCCCT GGACATGCTG ACCGCAGGCG AGCGCCCGCT GATCGTGGCC GGCGGCGGCG TCATCAACGC CGGCGCTTCC GAGCAGCTGG TGGAACTGGC CGAACTGCTG GGCGTTCCGG TCATCCCCAC CCTGATGGGC TGGGGCGCCA TCTCCGACGA CCACCCCCTG ATGGCCGGCA TGGTGGGCCT GCAGACCTCG CACCGCTACG GCAACGAGAA CTACCTGCGC AGCGACTTCG TGATCGGCGT CGGCAACCGC TGGGCCAACC GCCACACCGG AGGGCTGGAC ACCTACACGG CCGGCCGCAC GTTCGTGCAC ATCGACATCG AGCCCACGCA GATCGGCCGC GTGTTCTCCC CGGACTTCGG CATCGCGTCC GACGCCGGCG CAGCGCTGAC CGGGCTGGTT GAACTGGCCC GCGAACGCCA GGCCGCCGGA TCCCTGCCGG ACTACTCCGC CTGGGCCGCC GAATGCCAGG AGCGCAAGGC CACCCTGCAC CGCAAGACGC ACTTCGAGAA CATCCCCATC AAGCCGCAGC GCGTGTACGA GGAGATGAAC AAGTCCTTCG GCAAGGACAC CACCTACGTT TCCACCATCG GCCTGTCCCA GATCGCCGGC GCCCAGATGC TGCACGTTTT CGGCCCGCGC AAGTGGATCA ACGCCGGCCA GGCAGGGCCC CTGGGCTGGA CCGCTCCGGC CGCCCTGGGC GTGGTGCGCG GCAAGCCGGA CGAGACCGTT GTTGCTTTGT CCGGTGACTA CGATTTCCAG TTCATGATCG AGGAACTGGC CGTGGGCGCG CAGTTCAACC TGCCGTACAT CCACGTGGTG GTGAACAACT CCTACCTGGG CCTGATCCGC CAGTCCCAGC GCGGCTTCAA CATGGAACAG AACGTGTCCC TGGCTTTCGA GAACATCAAC TCCCCGGAAA CAAACGGCTA CGGCGTGGAC CACGTCAAGG TGGCCGAGGG CCTGGGCTGC AAGGCCGTCC GGGTGGAGAA CCCGAACGAT CTGGGTGCCG CCTTCGACAA GGCGAAGGCA CTGGCCGGCG AGTTCAAGGT TCCCGTGGTG GTTGAAGTGA TCCTGGAAAA GATCACCAAC ATCTCCATGG GCGTGGAAAT CAACGCGGTG AACGAGTTCG AGGAACTGGC CGAGACCAGC GCCGACGCCC CCACCGCCAT CCTGGCGATG CAGGCCTAA
|
Protein sequence | MTKMRTVDAA VAILEKEGAT EAFGLPGAAI NPFYSAMRNH GGIRHTLARH VEGASHMADG YSRAADGNIG ICIGTSGPAG TDMITGLYAA WADSIPMLCI TGQAPVAKLH KEDFQAVDIE SIAKPVTKMA MTILEPGQVP GAFQKAFQLM RSGRPGPVLL DLPIDVQMAE IEFDIDTYEP LPVEKPKASR KQLEKALDML TAGERPLIVA GGGVINAGAS EQLVELAELL GVPVIPTLMG WGAISDDHPL MAGMVGLQTS HRYGNENYLR SDFVIGVGNR WANRHTGGLD TYTAGRTFVH IDIEPTQIGR VFSPDFGIAS DAGAALTGLV ELARERQAAG SLPDYSAWAA ECQERKATLH RKTHFENIPI KPQRVYEEMN KSFGKDTTYV STIGLSQIAG AQMLHVFGPR KWINAGQAGP LGWTAPAALG VVRGKPDETV VALSGDYDFQ FMIEELAVGA QFNLPYIHVV VNNSYLGLIR QSQRGFNMEQ NVSLAFENIN SPETNGYGVD HVKVAEGLGC KAVRVENPND LGAAFDKAKA LAGEFKVPVV VEVILEKITN ISMGVEINAV NEFEELAETS ADAPTAILAM QA
|
| |