Gene Sare_5102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5102 
Symbol 
ID5704070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5776775 
End bp5777758 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content71% 
IMG OID641274494 
ProductD-alanine--D-alanine ligase 
Protein accessionYP_001539835 
Protein GI159040582 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01205] D-alanine--D-alanine ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.842129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000223224 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGGTACGA CCGCTGCCGA TCCGGCTGTC CTCTTGACCG ACCTGCGCGT GCTGGTACTC 
GCCGGCGGCC TTTCCTATGA ACGGGACGTC TCGCTGCGAT CCGGCCGGCG GGTGCTGGAC
GCGCTGCGTG CGGTGGGCGT GGAGGCGGAG CTACGCGACG CGGACGTCGC CCTGCTGCCG
TCACTCGTCG CCGATCCACC GGACGCGGTG GTCATCGCCC TGCACGGTGC CACTGGTGAG
GACGGTTCAC TTCGCGGTGT GCTGGACCTC TGCAACGTCC CGTACGTCGG CTGCGACGCC
CGCTCGTCAC GCCTCGCGTG GGACAAACCC TCGGCCAAGG CCGTGTTGCG GGAAGCGGGC
ATCCCCACCC CGGACTGGGT GGCACTACCT CATGATCGCT TCTCCGAGCT CGGTGCGGTG
GCGGTACTGG ACCGCATCGT CGACCGCTTG GGGCTCCCGC TGATGGTGAA GCCCGCGCAG
GGCGGCTCGG GTCTGGGCGC CGCCGTGGTC CGGGATGGTC CGGCCCTACC GGCCGCGATG
GTCGGTTGTT TCGCCTACGA CTCGACCGCC CTCGTCGAAC GCTACCTGCC CGGAACGGAC
GTGGCGGTAT CCGTGATCGA CCTCGGCGAG GGGCCGCAGG CCCTGCCGGC GGTGGAGATC
GTGCCCCGAA ACGGTGTGTA CGACTACGCC GCCCGGTACA CGGCCGGCCG TACCACCTGG
CACACGCCGG CCCGCCTGGA CACCGAGGTG GCCGAAGCGG TCGCCACGGT CGCCGTCGCC
GCCCACACCG CGCTCGGGTT GCGCGACCTC AGCCGGGTCG ACCTGATCGT GGATGCCGAC
CACCAGCCGC ACGTCCTCGG GGTGAACGTC GCACCCGGCA TGACGGAGAC CTCACTGCTA
CCGCTCGCGG CCCAGGCCGC GAGTCTCGAC TTCGGCCGAA TGATCGGAAC CTTGGTCTCT
CGGGCCGTTG CCCGGGCCAC CTGA
 
Protein sequence
MGTTAADPAV LLTDLRVLVL AGGLSYERDV SLRSGRRVLD ALRAVGVEAE LRDADVALLP 
SLVADPPDAV VIALHGATGE DGSLRGVLDL CNVPYVGCDA RSSRLAWDKP SAKAVLREAG
IPTPDWVALP HDRFSELGAV AVLDRIVDRL GLPLMVKPAQ GGSGLGAAVV RDGPALPAAM
VGCFAYDSTA LVERYLPGTD VAVSVIDLGE GPQALPAVEI VPRNGVYDYA ARYTAGRTTW
HTPARLDTEV AEAVATVAVA AHTALGLRDL SRVDLIVDAD HQPHVLGVNV APGMTETSLL
PLAAQAASLD FGRMIGTLVS RAVARAT