Gene Acid345_4592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4592 
Symbol 
ID4071537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5439063 
End bp5440244 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content58% 
IMG OID637986632 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_593666 
Protein GI94971618 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.79248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.646635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAAGA AGATCCGGGT GGGCGTGTTG TTTGGTGGAC GCAGCGGCGA GCACGACGTC 
TCGCTGCTGT CGGCGGCGTC GGTGATCAAC GCAGTCGATA AGAAGAAGTT CGAGGTCGTT
CCGATCGGCA TCACCAAAGA AGGTAAGTGG CTCACGGCCT CGCATGCTGA GTCGTTGCTG
AATGGCAAGA AAGCCGAGCA GCAACTGCGC GCGGGTGATC CCGAGGCGAC GCCCGGTGCC
GCGGTGCTCG CGAAGGGCGA AGCTGTGCTG GTCCCTCCGG TTCCGCAGTC AGACCGCTCG
GCGATCGTTC CGTTTGAGAC CGATGCGGCA GGCCATTCCG CTTCGTCTAT CGATGTTGAC
GTCATCTTCC CCGTGCTCCA CGGCACCTTC GGCGAAGACG GCACCATCCA GGGACTCTTC
GAACTGGCGA ACATACCCTA CGTCGGCGCA GGCGTACTTG GCTCGGCGGC CGGCATGGAC
AAAGACATCA TGAAGGCGCT CTTCAAGAAC GCTGGCCTTC CGATCGTGAA GCACATGACC
GTGCTGCGCA GCCAGTGGCA ACGCGAGCCA AAGAAAGTGA AGAAGCTGGT CGAGTCGAAA
CTGAAGTATC CGGTGTTCGT AAAGCCGGCA AACCTCGGCT CGTCGGTCGG CATTTCCAAG
GCGCATGACT CGAAAGAGTT AGGGCCGGCG CTCGATGAGG CCGCGAAATT CGACCGCAAG
CTCGTCATCG AACAAGGCGT AGGCGGCAAG AAAAAGAAAG CGCGTGAGAT CGAGTGTTCC
GTCCTCGGCA ACGACGAGCC GAAGGCATCC GTCCCTGGCG AAATCGTGCC GATCAAAGAA
TTTTACGATT ATGCCGCCAA GTATCTGGAT GAAGGCTCGG CGCTCGAAAT CCCTGCGAAG
CTTCCCAAAT CCACACAGAA ACAGGTGCAG GAACTCGCAA TCCGCGCCTT TCAGGCGGTG
GATGGCAGCG GCCTGGGACG AGTGGATTTC CTCATCGATC CCAAGAGCAA CAAACTTTAT
GTCAACGAGA TCAATACGAT GCCGGGGTTC ACTTCGATCA GCATGTATCC CAAGTTGTGG
GAAGCCTCGG GCCTGAAATA TACGGACCTG ATTACGCGCT TGATTGAGTT GGCGCTGGAG
CGCCACGCCG ACAAACAAAA AAATCAGTAC ACCAAGGACT GA
 
Protein sequence
MAKKIRVGVL FGGRSGEHDV SLLSAASVIN AVDKKKFEVV PIGITKEGKW LTASHAESLL 
NGKKAEQQLR AGDPEATPGA AVLAKGEAVL VPPVPQSDRS AIVPFETDAA GHSASSIDVD
VIFPVLHGTF GEDGTIQGLF ELANIPYVGA GVLGSAAGMD KDIMKALFKN AGLPIVKHMT
VLRSQWQREP KKVKKLVESK LKYPVFVKPA NLGSSVGISK AHDSKELGPA LDEAAKFDRK
LVIEQGVGGK KKKAREIECS VLGNDEPKAS VPGEIVPIKE FYDYAAKYLD EGSALEIPAK
LPKSTQKQVQ ELAIRAFQAV DGSGLGRVDF LIDPKSNKLY VNEINTMPGF TSISMYPKLW
EASGLKYTDL ITRLIELALE RHADKQKNQY TKD