Gene Cagg_2856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2856 
Symbol 
ID7267562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3507042 
End bp3508130 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content57% 
IMG OID643567677 
ProductD-alanine/D-alanine ligase 
Protein accessionYP_002464154 
Protein GI219849721 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.227321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC CAATCACAAT TGCCGTCATT TTTGGCGGTC AAAGTGGCGA ACACGAAGTA 
TCGCTCGTGT CGGCGCATGC CGTGATGAGC AATCTCGATC CGGATCGCTA CCACATTGAA
GCGCTTGGGA TCGGTAAAGA TGGCCGCTGG TGGCATGGTC CCGGTGCGTT GGCGATGCTC
ATGGCAGCCG CCGATCCGGC CCGTTTACCG GCCAGTGCGA CAATGGTTGC TCCCGGCCCG
GTGCGCGAAC TTGGTCGTGT GGGCGAACCG GGTTGGTCGT TGCCTCCCGC TGATGTCATC
TTCCCGGTGC TGCATGGTCC TTATGGTGAA GATGGGACTA TTCAAGGTCT CTTTGAACTG
GCCGGGCACC CTTACGTTGG GTGTGGAGTG GCGGCAAGTG CGGTAGGGAT GGATAAGGCA
TTCATGAAGG CTGCGTTTGC GGCTGCCGAT TTGCCGATCT TACCGTGGGT GCTGGTGCGT
CGGCATGAAT TGGCCGCTCT TGAGATGGTC TGTGAGCGGA TAGAGGCAAC GTTGCACTTC
CCGTTGTTTG TTAAACCGGC CAATTTAGGG AGCAGTGTGG GGGTGAGTAA GGTCCGTGAT
CGAGCTGAAC TGATCACCGG CTTGCACGAA GCTGCCCGTC ATGACCGACG GATCGTCGTG
GAGCAGGGCA TTTCGGCGCG CGAAATAGAA ATCTCGGTAT TAGGAAACGA ACACGTTGAG
GTGAGTATTC CCGGTGAAAT TATTCCTGCC GGTGAGTGGT ACGACTACCA CGCGAAGTAT
ATCGCCGGTG GTTCACAGAC GATAGTACCG GCGGCATTAA CCGAAATGCA GATTGCACAG
GTGCAAACGC TCGCCAAACG CGCGTTTCAA GCGATCGATG GGGCTGGGCT GGCCCGAGTG
GATTTTCTCC TTGACCGTGA ACAAGGCACG CTCTGGTTGA ACGAGGTCAA TACCATGCCC
GGCTTCACTC CGATCAGTAT GTACGCCAAA ATGTGGGAGG CTAGCGGCTT GCCCTATCCA
CAGTTGCTCG ACCGACTAAT CGAGTTGGCA TTGGCTCGGT CGGCGCGTGC CGGCTACGTA
AAGGTGTGA
 
Protein sequence
MSKPITIAVI FGGQSGEHEV SLVSAHAVMS NLDPDRYHIE ALGIGKDGRW WHGPGALAML 
MAAADPARLP ASATMVAPGP VRELGRVGEP GWSLPPADVI FPVLHGPYGE DGTIQGLFEL
AGHPYVGCGV AASAVGMDKA FMKAAFAAAD LPILPWVLVR RHELAALEMV CERIEATLHF
PLFVKPANLG SSVGVSKVRD RAELITGLHE AARHDRRIVV EQGISAREIE ISVLGNEHVE
VSIPGEIIPA GEWYDYHAKY IAGGSQTIVP AALTEMQIAQ VQTLAKRAFQ AIDGAGLARV
DFLLDREQGT LWLNEVNTMP GFTPISMYAK MWEASGLPYP QLLDRLIELA LARSARAGYV
KV