Gene Arth_3691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3691 
Symbol 
ID4443692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4151937 
End bp4153715 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content66% 
IMG OID639691515 
Productglyoxylate carboligase 
Protein accessionYP_833166 
Protein GI116672233 
COG category[R] General function prediction only 
COG ID[COG3960] Glyoxylate carboligase 
TIGRFAM ID[TIGR01504] glyoxylate carboligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGA TGCGCACCGT AGACGCTGCC GTCGCGATCC TGGAAAAGGA GGGCGCCACT 
GAGGCGTTCG GCCTGCCAGG CGCGGCGATC AACCCGTTCT ACTCCGCCAT GCGGAACCAC
GGCGGCATCC GCCACACGCT GGCCCGCCAC GTTGAGGGCG CCAGCCACAT GGCGGACGGC
TACTCCCGTG CCGCGGATGG AAACATCGGC ATCTGCATCG GCACCTCCGG CCCGGCCGGC
ACGGACATGA TCACCGGGCT GTACGCGGCC TGGGCCGATT CCATCCCCAT GCTCTGCATC
ACCGGCCAGG CCCCCGTTGC CAAGCTGCAC AAGGAAGACT TCCAGGCCGT GGACATCGAG
TCCATCGCCA AGCCTGTCAC CAAGATGGCC ATGACCATCC TGGAGCCCGG CCAGGTTCCG
GGCGCCTTCC AGAAGGCCTT CCAGCTGATG CGCTCCGGCC GTCCCGGCCC GGTGCTGCTG
GACCTGCCGA TCGACGTGCA GATGGCAGAG ATCGAATTCG ACATCGACAC CTACGAGCCC
CTGCCCGTGG AGAAGCCCAA GGCCAGCCGC AAGCAGCTGG AAAAGGCCCT GGACATGCTG
ACCGCAGGCG AGCGCCCGCT GATCGTGGCC GGCGGCGGCG TCATCAACGC CGGCGCTTCC
GAGCAGCTGG TGGAACTGGC CGAACTGCTG GGCGTTCCGG TCATCCCCAC CCTGATGGGC
TGGGGCGCCA TCTCCGACGA CCACCCCCTG ATGGCCGGCA TGGTGGGCCT GCAGACCTCG
CACCGCTACG GCAACGAGAA CTACCTGCGC AGCGACTTCG TGATCGGCGT CGGCAACCGC
TGGGCCAACC GCCACACCGG AGGGCTGGAC ACCTACACGG CCGGCCGCAC GTTCGTGCAC
ATCGACATCG AGCCCACGCA GATCGGCCGC GTGTTCTCCC CGGACTTCGG CATCGCGTCC
GACGCCGGCG CAGCGCTGAC CGGGCTGGTT GAACTGGCCC GCGAACGCCA GGCCGCCGGA
TCCCTGCCGG ACTACTCCGC CTGGGCCGCC GAATGCCAGG AGCGCAAGGC CACCCTGCAC
CGCAAGACGC ACTTCGAGAA CATCCCCATC AAGCCGCAGC GCGTGTACGA GGAGATGAAC
AAGTCCTTCG GCAAGGACAC CACCTACGTT TCCACCATCG GCCTGTCCCA GATCGCCGGC
GCCCAGATGC TGCACGTTTT CGGCCCGCGC AAGTGGATCA ACGCCGGCCA GGCAGGGCCC
CTGGGCTGGA CCGCTCCGGC CGCCCTGGGC GTGGTGCGCG GCAAGCCGGA CGAGACCGTT
GTTGCTTTGT CCGGTGACTA CGATTTCCAG TTCATGATCG AGGAACTGGC CGTGGGCGCG
CAGTTCAACC TGCCGTACAT CCACGTGGTG GTGAACAACT CCTACCTGGG CCTGATCCGC
CAGTCCCAGC GCGGCTTCAA CATGGAACAG AACGTGTCCC TGGCTTTCGA GAACATCAAC
TCCCCGGAAA CAAACGGCTA CGGCGTGGAC CACGTCAAGG TGGCCGAGGG CCTGGGCTGC
AAGGCCGTCC GGGTGGAGAA CCCGAACGAT CTGGGTGCCG CCTTCGACAA GGCGAAGGCA
CTGGCCGGCG AGTTCAAGGT TCCCGTGGTG GTTGAAGTGA TCCTGGAAAA GATCACCAAC
ATCTCCATGG GCGTGGAAAT CAACGCGGTG AACGAGTTCG AGGAACTGGC CGAGACCAGC
GCCGACGCCC CCACCGCCAT CCTGGCGATG CAGGCCTAA
 
Protein sequence
MTKMRTVDAA VAILEKEGAT EAFGLPGAAI NPFYSAMRNH GGIRHTLARH VEGASHMADG 
YSRAADGNIG ICIGTSGPAG TDMITGLYAA WADSIPMLCI TGQAPVAKLH KEDFQAVDIE
SIAKPVTKMA MTILEPGQVP GAFQKAFQLM RSGRPGPVLL DLPIDVQMAE IEFDIDTYEP
LPVEKPKASR KQLEKALDML TAGERPLIVA GGGVINAGAS EQLVELAELL GVPVIPTLMG
WGAISDDHPL MAGMVGLQTS HRYGNENYLR SDFVIGVGNR WANRHTGGLD TYTAGRTFVH
IDIEPTQIGR VFSPDFGIAS DAGAALTGLV ELARERQAAG SLPDYSAWAA ECQERKATLH
RKTHFENIPI KPQRVYEEMN KSFGKDTTYV STIGLSQIAG AQMLHVFGPR KWINAGQAGP
LGWTAPAALG VVRGKPDETV VALSGDYDFQ FMIEELAVGA QFNLPYIHVV VNNSYLGLIR
QSQRGFNMEQ NVSLAFENIN SPETNGYGVD HVKVAEGLGC KAVRVENPND LGAAFDKAKA
LAGEFKVPVV VEVILEKITN ISMGVEINAV NEFEELAETS ADAPTAILAM QA