Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1901 |
Symbol | |
ID | 4445555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2139544 |
End bp | 2140731 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639689711 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_831383 |
Protein GI | 116670450 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA CCAACCTCGA CACCGTCGTT GTCGACTTCT ACCGGACGAA CCTGATCTTC GTTCGGCTCA GCACCGACGA GGGACTCACT GGCATCGCCG AGGCAACCCT CGAAGGCCAG GAACATGCGG TCCGCGGCGC CGTCGCCGTG CTCGCCGACG CGGTCCGCGG CAAGGACCCA ACCCGGATTT CGCAAACCAT CTATGAACTC AACCGCGATG CCTACTGGCG CGGCGGGCCG GTCTCGATGA CAGCGCTCAG CGCCCTCGAA ATGGCAATGT GGGACGTCTC CGCCCGCGCA CTTGGCGTCC CTGTCCACCG CATGCTGGGT GGACAGGTCC GCGACAGAGT CCGCGCTTAC GCCAACGGCT GGTTCTCCGG AGCCAAAACG GCTGAGGACT TCGCCGAGGC AGCCGTCCAG ACGGTCGCCC AAGGCTTCCG CGGACTCAAG TGGGATCCAT TCGAAGCCGC GGACCTCACC CTCGAGCCGC GGGACCTGCG GCGCATGCTC GAGCCCGTCG CCGCTGTCCG CGAGGCAGTG GGCGACGACG TCGAGCTATT CATCGAAGGA CACGGCCGGT TCGATGTACC GACAGCGATC CGGGTCGCAC GCGAAATCGA GCAGTTCCAG CCGGTGTTCT TCGAAGAACC ATGCCCGCCG GACGGGATCG ACGCGCTCCT TGAGATACGC TCCAAATCTC CTGTACCGAT CGCTGCCGGG GAACGTTGGT TCGGACGGAA CACCTTTGTC CCTGCCCTCG CGCGGAATGC CGTGGACTAC ATACAGCCGG ACGTCACGCA CGCCGGCGGC CTGCTGGAAC TGTCCTTCAT CTCCACGCTC GCCGCGGCCC ATTACATTCC GTTTGCACCG CATAACCCAA GCGGACCGCT CAGTACCGCG GCGACGTTGC AGCTCGGCGC GATGCTGCCC AATTTCCGCT ATCTGGAAAT CATGGCCTCG GACGTACCCT GGCGAACCGA GATCTCCAAC GAGCGCCTCC AGCTGACGGA GGAGGGTGAC ATCCTCATTC CTGAAGGCAT CGGTCTGGGC ATCGAACTTG ACTTCGAAGC GATCGCCGAA CACCCCTACA CGCCACACCC GATGCGGATC TTCCATGATG CCGTCGCAGA CATCCGCCCC CCGGACGCCC GCTCCTACTT CAACCTCGAG CGCAGCCCGG CCATTTGA
|
Protein sequence | MKITNLDTVV VDFYRTNLIF VRLSTDEGLT GIAEATLEGQ EHAVRGAVAV LADAVRGKDP TRISQTIYEL NRDAYWRGGP VSMTALSALE MAMWDVSARA LGVPVHRMLG GQVRDRVRAY ANGWFSGAKT AEDFAEAAVQ TVAQGFRGLK WDPFEAADLT LEPRDLRRML EPVAAVREAV GDDVELFIEG HGRFDVPTAI RVAREIEQFQ PVFFEEPCPP DGIDALLEIR SKSPVPIAAG ERWFGRNTFV PALARNAVDY IQPDVTHAGG LLELSFISTL AAAHYIPFAP HNPSGPLSTA ATLQLGAMLP NFRYLEIMAS DVPWRTEISN ERLQLTEEGD ILIPEGIGLG IELDFEAIAE HPYTPHPMRI FHDAVADIRP PDARSYFNLE RSPAI
|
| |