Gene Arth_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1901 
Symbol 
ID4445555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2139544 
End bp2140731 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID639689711 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_831383 
Protein GI116670450 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA CCAACCTCGA CACCGTCGTT GTCGACTTCT ACCGGACGAA CCTGATCTTC 
GTTCGGCTCA GCACCGACGA GGGACTCACT GGCATCGCCG AGGCAACCCT CGAAGGCCAG
GAACATGCGG TCCGCGGCGC CGTCGCCGTG CTCGCCGACG CGGTCCGCGG CAAGGACCCA
ACCCGGATTT CGCAAACCAT CTATGAACTC AACCGCGATG CCTACTGGCG CGGCGGGCCG
GTCTCGATGA CAGCGCTCAG CGCCCTCGAA ATGGCAATGT GGGACGTCTC CGCCCGCGCA
CTTGGCGTCC CTGTCCACCG CATGCTGGGT GGACAGGTCC GCGACAGAGT CCGCGCTTAC
GCCAACGGCT GGTTCTCCGG AGCCAAAACG GCTGAGGACT TCGCCGAGGC AGCCGTCCAG
ACGGTCGCCC AAGGCTTCCG CGGACTCAAG TGGGATCCAT TCGAAGCCGC GGACCTCACC
CTCGAGCCGC GGGACCTGCG GCGCATGCTC GAGCCCGTCG CCGCTGTCCG CGAGGCAGTG
GGCGACGACG TCGAGCTATT CATCGAAGGA CACGGCCGGT TCGATGTACC GACAGCGATC
CGGGTCGCAC GCGAAATCGA GCAGTTCCAG CCGGTGTTCT TCGAAGAACC ATGCCCGCCG
GACGGGATCG ACGCGCTCCT TGAGATACGC TCCAAATCTC CTGTACCGAT CGCTGCCGGG
GAACGTTGGT TCGGACGGAA CACCTTTGTC CCTGCCCTCG CGCGGAATGC CGTGGACTAC
ATACAGCCGG ACGTCACGCA CGCCGGCGGC CTGCTGGAAC TGTCCTTCAT CTCCACGCTC
GCCGCGGCCC ATTACATTCC GTTTGCACCG CATAACCCAA GCGGACCGCT CAGTACCGCG
GCGACGTTGC AGCTCGGCGC GATGCTGCCC AATTTCCGCT ATCTGGAAAT CATGGCCTCG
GACGTACCCT GGCGAACCGA GATCTCCAAC GAGCGCCTCC AGCTGACGGA GGAGGGTGAC
ATCCTCATTC CTGAAGGCAT CGGTCTGGGC ATCGAACTTG ACTTCGAAGC GATCGCCGAA
CACCCCTACA CGCCACACCC GATGCGGATC TTCCATGATG CCGTCGCAGA CATCCGCCCC
CCGGACGCCC GCTCCTACTT CAACCTCGAG CGCAGCCCGG CCATTTGA
 
Protein sequence
MKITNLDTVV VDFYRTNLIF VRLSTDEGLT GIAEATLEGQ EHAVRGAVAV LADAVRGKDP 
TRISQTIYEL NRDAYWRGGP VSMTALSALE MAMWDVSARA LGVPVHRMLG GQVRDRVRAY
ANGWFSGAKT AEDFAEAAVQ TVAQGFRGLK WDPFEAADLT LEPRDLRRML EPVAAVREAV
GDDVELFIEG HGRFDVPTAI RVAREIEQFQ PVFFEEPCPP DGIDALLEIR SKSPVPIAAG
ERWFGRNTFV PALARNAVDY IQPDVTHAGG LLELSFISTL AAAHYIPFAP HNPSGPLSTA
ATLQLGAMLP NFRYLEIMAS DVPWRTEISN ERLQLTEEGD ILIPEGIGLG IELDFEAIAE
HPYTPHPMRI FHDAVADIRP PDARSYFNLE RSPAI