Gene Csal_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2194 
Symbol 
ID4026385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2469665 
End bp2471023 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content68% 
IMG OID637967399 
ProductUDP-N-acetylmuramoyl-tripeptide--D-alanyl-D- alanine ligase 
Protein accessionYP_574244 
Protein GI92114316 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA CGAACGTGCA GACCATCGCG ACCTGGCTGG GGGCGCCACC GCCCGATGTC 
GACGCTCCCG TGACCGGTGT GACGACCGAT ACACGGCATG TTTCGCCGGG CGACGTGTTC
GTCGCCCTGG TCGGTGAGCG TTTCGACGGG CACGAGTTTC TCGAGCAGGC GCGGGCCAGC
GGGGCAGTGG CAGCGGTGGT ATCGCGTCGC GTGGCATCTC CGCTGGCCCA GATCGAAGTC
GCGGATACGC GTCTGGCGCT GGGAATACTG GCGCGTGCCG CACGGCGTGC CTGGTCGGGA
GAGCTGGTTG CAGTGACCGG CAACAGCGGC AAGACGACGG TCAAGGAGAT GCTGGCCGCG
ATCCTGTCGC GTGTCCATCC CACGCTGGCC ACGCGTGGCA ACCTCAACAA CGATATTGGT
GCCCCGCAGA CACTGCTCGC CCTGTCGTCG GCGCATCGTC GCGCGGTCAT CGAACTGGGC
GCCAATCACC TGGGCGAGAT CGCCTGGACC ACATCGCTGG CGATTCCCGA CGTGGCCGTG
ATCACCAACG TCACTGGCGC GCATGTCGGC GAGTTCGGCG GGATCGGTCG CATCGCCCAG
GCCAAGGCGG AAATCCTCCT CGGGCTCTCG GCATCGGGGA CGGCGGTGCT CAACCGCGAT
GATCGTTTCT ACCCCATGTG GCGTGAATTG GCCGGCGACG CCGAGGTGCT CGACTTTGGC
CTCGATCCGG CTGCCCGAGT GCGCGCCGAG GCGCTGGCCT GCGACGACGA CGGGCGTTAT
GCTTTCACGC TATTCGTCGA CGGCGTGTCG CTGGGGCGTG TTCGCCTGGC ATTGCTGGGC
CGCTACAACG TACGCAACGC GCTGGCCAGT GCCGCCGCTG CCTGGGCGCT GCGCGTGCCG
GCCGAGGACA TCGTCGCCGG CCTGGAGGCC TGCCGGACGA TGCCGGGGCG CATGATGAAC
GTGCCGGGCA TTCGCGGTAC GCGCCTGCTG GACGATACCT ACAATGCCAA TCCCGGCGCG
GTGCGAGCCG CGTTGGCCGT GCTCGCGGAG ATGCCGGCGC CACGCTGGTG TTTTCTCGGC
GCCCTGGGCG AGCTGGGCGC CGAGAGCGAA CGGCTGCACG CGGACCTTGG ACGCGTCGCG
CGGGAACTGA AGATCGACTT TCTCGGCACC TTCGGCGCCG ATGCTCGCCC GGCCGTGGCG
GCATTCGGTG ATCGCGGGTG TCATTTCGAT GATTGGGCGG CGCTGGTGCG CTATGCCCAC
GACCATCTTC CCCCCGGGGC CAGTGTCCTG GTCAAGGGAT CGCGCAGTGC CGGCATGGAA
CGCTTGATTG CCGACCTGCG CACGGATGCA CCAAGGTGA
 
Protein sequence
MNTTNVQTIA TWLGAPPPDV DAPVTGVTTD TRHVSPGDVF VALVGERFDG HEFLEQARAS 
GAVAAVVSRR VASPLAQIEV ADTRLALGIL ARAARRAWSG ELVAVTGNSG KTTVKEMLAA
ILSRVHPTLA TRGNLNNDIG APQTLLALSS AHRRAVIELG ANHLGEIAWT TSLAIPDVAV
ITNVTGAHVG EFGGIGRIAQ AKAEILLGLS ASGTAVLNRD DRFYPMWREL AGDAEVLDFG
LDPAARVRAE ALACDDDGRY AFTLFVDGVS LGRVRLALLG RYNVRNALAS AAAAWALRVP
AEDIVAGLEA CRTMPGRMMN VPGIRGTRLL DDTYNANPGA VRAALAVLAE MPAPRWCFLG
ALGELGAESE RLHADLGRVA RELKIDFLGT FGADARPAVA AFGDRGCHFD DWAALVRYAH
DHLPPGASVL VKGSRSAGME RLIADLRTDA PR