Gene Cag_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0051 
Symbol 
ID3747250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp58082 
End bp59512 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content51% 
IMG OID637772577 
ProductUDP-N-acetylmuramoylalanyl-D-glutamyl-2, 6-diaminopimelate-D-alanyl-D-alanyl ligase 
Protein accessionYP_378373 
Protein GI78188035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0770] UDP-N-acetylmuramyl pentapeptide synthase 
TIGRFAM ID[TIGR01143] UDP-N-acetylmuramoyl-tripeptide--D-alanyl-D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.577094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGCAA CCTTACAGCG TAACGATTTA GAAGCGGTTG GCGAATTAGT GTTTCACGGA 
GAAGCACCTC CATCATTCGA GTTAGCGGAA CCGCATGTGG TGATTGATTC GCGTGAAGTA
AGCGAAGGTG GACTTTTTGT TGCCTTGCAT GGTGAGCGCA CCGATGGACA CCGTTACGTT
AACGATGTGT TTCAGCATGG AGCTACGTGG GCAATGGTGA ATCGCTCATG GTATGAAGCC
GAAGGGCATC CTCTACCTCC GCACCATAAA GGGTTTCTTG TGGTTGAGGA TACCGTAGCT
GGTTTACAGC ACTTAGCAGT TCGTTACCGC AACACCTTTT CCATTCCCAT TGTTGCCATT
GGCGGCAGTA ATGGCAAAAC CACCACGAAA GAGATGGTGG CGGCAGTGTT AGCAAGTGAT
AGCAGTGCGG TAAGTATGAG CCAAGGGAAT CGCAACAATC ATCTTGGTGT ACCGCTAACG
CTTTTGCAAA TGCGCCACTC AACCGAGCGT GCAGTGGTTG AGGTGGGCAT TAACCATCCC
AATGAAATGG CTATGCTGGC TGAGCTGGTA GCACCAACCC ATCTGCTCTT AACCAACATT
GGGCATGAGC ACCTTGAATT TCTTGGCGAT CTTGATGGGG TTGCAAAAGC CGAAACGCAG
CTCTACGATT ATGCTCGGCA GCATGGAGCT ACTGCCTTTA TTAACGCCGA TGATGAGCGC
TTACGTGCGG CTGCCGAAGG AATGCCTTTC CGCATTGATT ATTCATTACA TGAAGCCGTT
GACTCGCTTG TTTGGGCTGA GGATGTTACG GTTGAGCGTG ATGGTCGTCT CTCGTTTTTA
TTAGTAACCA AAGGGCGAAG TGAGCAAGAG CGCCTTCGTT TACATTTTAC AGGGCGCCAT
AATGTGCTGA ACGCTGTAGC AGCGGCAACA GTTGGCTTGC AGTTTGGCAT TTCGCTCCAT
CACATTTGCG AAGGACTTGC TGGGTTGCAG CCTGCACCGG GCTGGAAGCG CCTTGAAGTG
GTGGAGGTTG GCGGCGTGCG TTTGCTTAAC GACACCTACA ACGCTAACTC CGACTCCATG
CGTCGCGCCA TTGATGCCCT TTGCGATATG CCATGCAATG GACGCCGTAT TGCGGTGGTT
GGCGATATGC TTGAATTAGG TGATGCGGCT GAAGTTGAAC ATCAAGCGGT TGCCCACTAT
ATTCAGCGCT CTCTGGTTAC CAAGCTCTTT ACCTTTGGCA CGCAAGCTGC CGCTATTTGC
CGCCATGCGC CTGAGTTGTG CTATGGCTCG TATAGCGAGC ATAGTGCGTT GCTTGATGAC
CTTTTACATG TACTTTCAGA GGGTGATGTG GTTTTGGTAA AGGGGTCGCG CGGTATGCGT
CTTGAGCTGA TTGTTGATGG AGTGGTTCAT GCTTTGCAAC CAAAATCATA G
 
Protein sequence
MKATLQRNDL EAVGELVFHG EAPPSFELAE PHVVIDSREV SEGGLFVALH GERTDGHRYV 
NDVFQHGATW AMVNRSWYEA EGHPLPPHHK GFLVVEDTVA GLQHLAVRYR NTFSIPIVAI
GGSNGKTTTK EMVAAVLASD SSAVSMSQGN RNNHLGVPLT LLQMRHSTER AVVEVGINHP
NEMAMLAELV APTHLLLTNI GHEHLEFLGD LDGVAKAETQ LYDYARQHGA TAFINADDER
LRAAAEGMPF RIDYSLHEAV DSLVWAEDVT VERDGRLSFL LVTKGRSEQE RLRLHFTGRH
NVLNAVAAAT VGLQFGISLH HICEGLAGLQ PAPGWKRLEV VEVGGVRLLN DTYNANSDSM
RRAIDALCDM PCNGRRIAVV GDMLELGDAA EVEHQAVAHY IQRSLVTKLF TFGTQAAAIC
RHAPELCYGS YSEHSALLDD LLHVLSEGDV VLVKGSRGMR LELIVDGVVH ALQPKS