Gene SeAg_B3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B3101 
Symbol 
ID6796951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp3027967 
End bp3029307 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID642777261 
Productglucarate dehydratase 
Protein accessionYP_002147870 
Protein GI197250815 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR03247] glucarate dehydratase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTC AATTTACGAC GCCTGTAGTG ACTGAAATGC AGGTTATCCC GGTTGCGGGT 
CATGACAGTA TGCTGATGAA CCTGAGCGGC GCGCACGCCC CGTTCTTCAC GCGCAACATC
GTCATTATTA AAGATAACTC CGGCCATACC GGGGTCGGCG AGATTCCGGG CGGCGAAAAA
ATCCGCAAAA CGCTGGAAGA TGCGATCCCA CTGGTGGTGG GAAAAACGCT GGGTGAATAT
AAAAATGTCC TGACCGCCGT TCGCAACCAG TTTGCCGATC GCGATGCGGG CGGACGCGGT
TTACAAACGT TCGATCTCCG TACCACTATC CATGTGGTGA CTGGTATTGA AGCGGCAATG
CTTGATCTTT TGGGCCAACA CCTGGGCGTC AACGTCGCTT CGCTGTTAGG CGACGGTCAG
CAGCGCAGCG AAGTCGAAAT GCTGGGTTAT CTGTTCTTTG TCGGCAATCG CAAGGCCACG
CCGCTGCCGT ATCAGAGCCA GCCGGATGAG CAATGCGACT GGTATCGTCT GCGCCATGAA
GAGGCGATGA CGCCGGAAAC GGTAGTACGT CTGGCGGAAG CCGCCTATGA AAAATACGGC
TTCAACGACT TCAAACTGAA AGGCGGCGTG CTGGCGGGCG AAGAAGAGGC CGAGTCAATC
GTGGCGCTGG CGAAACGTTT CCCACAAGCG CGCGTTACGC TCGATCCAAA CGGTGCCTGG
TCGTTGAACG AAGCGATCAG CATTGGTAAA TACCTGAAAG GTTCTCTGGC CTATGCAGAA
GATCCGTGCG GCGCGGAGCA GGGTTTTTCT GGTCGTGAAG TGATGGCGGA ATTCCGTCGC
GCGACCGGAT TACCGACGGC CACCAATATG ATAGCGACCG ATTGGCGTCA AATGGGGCAT
ACGCTGTCGC TGCAATCTGT AGATATCCCG CTGGCGGACC CGCACTTCTG GACTATGCAA
GGCTCTGTAC GCGTGGCGCA AATGTGTCAT GAGTTCGGTC TGACCTGGGG ATCGCACTCT
AACAACCACT TTGATATTTC GTTGGCGATG TTTACCCATG TTGCCGCGGC GGCGCCGGGC
AAGATCACCG CGATCGATAC CCACTGGATC TGGCAGGAAG GCAACCAACG TCTGACTAAA
GAACCGTTTG AAATTAAAGG CGGCATGGTG CAAGTACCGG CCAAACCGGG TCTGGGCGTT
GAGCTCGATA TGGATCAGGT GATGAAAGCG CATGAGCTCT ATCAAAAGCA TGGCTTAGGC
GCGCGTGACG ACGCGATGGG AATGCAGTAC TTAATTCCTG GCTGGACGTT TGATAATAAG
CGTCCTTGCA TGGTGCGTTA A
 
Protein sequence
MSTQFTTPVV TEMQVIPVAG HDSMLMNLSG AHAPFFTRNI VIIKDNSGHT GVGEIPGGEK 
IRKTLEDAIP LVVGKTLGEY KNVLTAVRNQ FADRDAGGRG LQTFDLRTTI HVVTGIEAAM
LDLLGQHLGV NVASLLGDGQ QRSEVEMLGY LFFVGNRKAT PLPYQSQPDE QCDWYRLRHE
EAMTPETVVR LAEAAYEKYG FNDFKLKGGV LAGEEEAESI VALAKRFPQA RVTLDPNGAW
SLNEAISIGK YLKGSLAYAE DPCGAEQGFS GREVMAEFRR ATGLPTATNM IATDWRQMGH
TLSLQSVDIP LADPHFWTMQ GSVRVAQMCH EFGLTWGSHS NNHFDISLAM FTHVAAAAPG
KITAIDTHWI WQEGNQRLTK EPFEIKGGMV QVPAKPGLGV ELDMDQVMKA HELYQKHGLG
ARDDAMGMQY LIPGWTFDNK RPCMVR