Gene Sare_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0859 
Symbol 
ID5705124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp960190 
End bp962028 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content71% 
IMG OID641270378 
Productglutathione synthase 
Protein accessionYP_001535768 
Protein GI159036515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR03103] GNAT-family acetyltransferase TIGR03103 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00490863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGAGGA CCCTCGCGAC CGGGGTGGCG CTGGCGGATC AGGTACGCGG CCGAAGTCGT 
CGGTTCGAGC GGGTCGGCCC CGGGGGCGAC CCGGTGGCCG CGGCCGCCGC GCCGGCCGGT
TCGGCCGAGG CGGACGACAC CGCTGACCCG CACGTCGAGG GCATGGTGCT CGACTGCGGC
TGGGGCCGGC TCGTGTTCGG CCAGACCTTC GCCGACCAGG CAGCCGTCGC CGACGTGTTG
CGCTCCGAGG CGGCCGGCGC CCGGGACATC TGCATCTATC TGCGCGACCC GCACGTGCTC
GTCTCGCGGT TGCCGGACGA GTTGTTCATC GACCCGTCAC TGACCTTCCG GCTGCCGCTA
CACGGCGAGG GACTCGCCGA CCCGGAAGTC CCCGGCCTGC GGATCCGTCC GCTGCAGGAC
GCGGGGGACG CGGAGGAGGT CAACCGGATC TACGCGGCCA ACAGCATGGT GACCGCGCCG
GTTGAGGTAC TCGTCGCCAA CGCCGCCACC GATGGTTTCC TGCACCTGGT CGCCGAGAAC
GCGACCGGCG AGATCGTCGG CACCATCACC GGTGTGGACC ACGTCGCCGT CTTCGACGAC
CCGGACCGGG GCGCGAGTCT CTGGTGCCTG ACCGTGGACT TCAACGCCGC TCCGCCCGGC
ACCGGCCAGG CGCTGATCAC CGAACTGGCC GCCCAACTGG TCAGGCGGGG GCTGGCGTAC
GTGGACCTGT CGGTGCTCGC CGAGAACGAA GGCGCCATCC GGCTCTACGA GCGGCTCGGC
TTCTACCGCA CCACCACGCT CTGCGTGAAA CGGAAGAACC CGATCAATGA ACGGCTGTTC
CTGCCCGCCA TGCCGGAGGG GTACGACGAG CTCAACCCGT ACGCACAGAT CGTCGCGGAC
GAGGCGATGC GCCGGGGAAT CCGGGTGGAG GTGACTGACC CGACCTGGGG TGAGCTGCGC
CTGACCAGTG GCGGCCGAAC GATCCTCACC CGCGAGTCAC TGTCCGAGCT GACCTCGGCG
GTCGCCATGA GCCGCTGTGA CGACAAGCGG GTCACCCGTC GAATCCTCAG CCAGGCCGGG
CTGTCCGTAC CGCGTGGCCG GACAGCCACC GGGGACGGGG CCGACGCGGC CTTCCTGGCC
GAGGTCGGCG AGTTGGTCGT CAAGCCGGCC CGGGGCGAGC AGGGCAAGGG GATCACGGTC
GGGGTGCGTA CGCCCGAGGC CCTGCACGCC GCCGTCGAAC TGGCGGCCCG GTTCTGCCCC
GAGGTGCTTC TCGAGGAGTT GTGCGCCGGT GAGGACCTGC GGGTGATCAT GATCGACCAC
GAGGTGGTGG CCGCCGCGGT TCGCCGGCCG GCGACGATCA TCGGTGACGG GGTACACGAC
GTCGCCGAAC TGATCGAGCG GCAGAGCCGT CGCCGCGCCG CCGCGACGGG CGGCGAGTCC
CGCATCCCAC TGGACGAGAT GACCCGCGAG GTGGTCGCCG AAGCCGGGTA CGCACTCACC
GACATCCTGC CGGAGGGGGA GCAGCTCATC GTGCGTCGGA CCGCGAACCT GCACACCGGG
GGCACGATCC ACGACGTCAC CGCGGTCCTG CACCCGGAGA TCGCCGAGGC GTGCGTGACC
GCGAGTCGCG CCCTGGACAT CCCGGTAGCC GGGCTTGACC TGCTGGTACC CACCACGGAG
GAGTCCGCGC ACGTCTTTCT CGAGGCGAAC GAACGGCCCG GCCTGGCCAA CCACGAACCG
CAGCCGACCG CCGAACGCTT CGTCGACCTC CTTTTTCCCG GGACCCGGGC ACCTCAACGC
CTCTGGTCGC CGGCGGGTGC GGCAAGCTCT GGAGTATGA
 
Protein sequence
MSRTLATGVA LADQVRGRSR RFERVGPGGD PVAAAAAPAG SAEADDTADP HVEGMVLDCG 
WGRLVFGQTF ADQAAVADVL RSEAAGARDI CIYLRDPHVL VSRLPDELFI DPSLTFRLPL
HGEGLADPEV PGLRIRPLQD AGDAEEVNRI YAANSMVTAP VEVLVANAAT DGFLHLVAEN
ATGEIVGTIT GVDHVAVFDD PDRGASLWCL TVDFNAAPPG TGQALITELA AQLVRRGLAY
VDLSVLAENE GAIRLYERLG FYRTTTLCVK RKNPINERLF LPAMPEGYDE LNPYAQIVAD
EAMRRGIRVE VTDPTWGELR LTSGGRTILT RESLSELTSA VAMSRCDDKR VTRRILSQAG
LSVPRGRTAT GDGADAAFLA EVGELVVKPA RGEQGKGITV GVRTPEALHA AVELAARFCP
EVLLEELCAG EDLRVIMIDH EVVAAAVRRP ATIIGDGVHD VAELIERQSR RRAAATGGES
RIPLDEMTRE VVAEAGYALT DILPEGEQLI VRRTANLHTG GTIHDVTAVL HPEIAEACVT
ASRALDIPVA GLDLLVPTTE ESAHVFLEAN ERPGLANHEP QPTAERFVDL LFPGTRAPQR
LWSPAGAASS GV