Gene Dshi_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2447 
Symbol 
ID5714103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2589923 
End bp2591098 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content62% 
IMG OID641268370 
Productputative mandelate racemase/muconate lactonizing protein 
Protein accessionYP_001533782 
Protein GI159044988 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.549803 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA TCAAATCCGT CCGTACGCGC GTCTGGAACT GGACCGGCCC GACCGTCCCG 
CCGCAGGGCA ATTTCTGCAC CAATGCGTCC GATGCGCTGT GGATCCAGGG CGACGCCATG
GCCTCCTTCC GGTTCCACCA ATGGCTGACC TGCGAGGTCG AGACCGAGGA TGGCACCATC
GGGATCGGCA ACGCGGCGCT GGCGCCCAAC GTCGTCAAAC AGGCCATCGA CGAATGGTAT
GCGCCCCTGG TCATCGGCGA GGACCCGTTC GATTATGCCT ACCTGTGGGA AAAGATGTAT
CGCCGCACCC ATGCCTGGGG CCGCAAGGGG ATCGGCATGA CCGCGATCAG TGCCATCGAC
ATCGCGATCT GGGACCTGAT GGGCAAGCTG GTCGGCAAGC CGGTGTTCAA GCTTCTGGGC
GGGCGCACCA AGGAGAAGAT CCCGGTCTAC TACTCCAAGC TTTACGCCGA CAGCATCCCC
GCGATGCAGG CAGAGGCCGA GGAGGCGCAA AAGCATGGCT ACCAAGGCTA CAAGACCCGG
TTCGGCTACG GTCCCAAGGA CGGCCCGGCG GGGATGCGCG AGAACCTCAA GCGCGTGGAG
GCCCTGCGCG AGGTACTGGG CTATGACGTG GACCTGATGC TTGAGTGCTA CATGGGCTGG
AACCTCGATT ACACCAAGCG GATGCTGCCC AAACTGGAGC GGTTCGAGCC GCGCTGGCTC
GAAGAGCCGG TGATTGCCGA CGACGTGGCG GGCTATGCGG AGCTGAACGC CATGGGGATC
GTGCCGATCT CGGGCGGGGA GCATGAATTC AGCGTCATGG GCTGTGCGGA GTTGATCAAC
CGCAAGGCCG TCAGCGTGCT GCAATACGAC ACCAACCGGG TGGGCGGCAT CACCGCGGCG
CAGAAGATCA ACGCCATCGC CGAGGCCGCG CAGATCATCG TCATCCCCCA TGCGGGCCAG
ATGCACAACT ACCACCTGAC CATGGCCAAC ATGAACTGCC CCATCAGCGA GTATTTCCCT
GTCTTCGACG TCGAAGTGGG CAATGAGCTG TTCTACTACA TCTTCGACGG CGATCCCGAG
GCGGTGGACG GCTATCTGCA ACTGGATGAC GACACGCCCG GGCTCGGGAT CACCATCAGC
GACGCTCACC TGAAACATTT CGAGATCACA GAATGA
 
Protein sequence
MTKIKSVRTR VWNWTGPTVP PQGNFCTNAS DALWIQGDAM ASFRFHQWLT CEVETEDGTI 
GIGNAALAPN VVKQAIDEWY APLVIGEDPF DYAYLWEKMY RRTHAWGRKG IGMTAISAID
IAIWDLMGKL VGKPVFKLLG GRTKEKIPVY YSKLYADSIP AMQAEAEEAQ KHGYQGYKTR
FGYGPKDGPA GMRENLKRVE ALREVLGYDV DLMLECYMGW NLDYTKRMLP KLERFEPRWL
EEPVIADDVA GYAELNAMGI VPISGGEHEF SVMGCAELIN RKAVSVLQYD TNRVGGITAA
QKINAIAEAA QIIVIPHAGQ MHNYHLTMAN MNCPISEYFP VFDVEVGNEL FYYIFDGDPE
AVDGYLQLDD DTPGLGITIS DAHLKHFEIT E