Gene Sare_4496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4496 
Symbol 
ID5707382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5081093 
End bp5084074 
Gene Length2982 bp 
Protein Length993 aa 
Translation table11 
GC content69% 
IMG OID641273910 
Productglycyl-tRNA synthetase, beta subunit 
Protein accessionYP_001539259 
Protein GI159040006 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0751] Glycyl-tRNA synthetase, beta subunit
[COG0752] Glycyl-tRNA synthetase, alpha subunit 
TIGRFAM ID[TIGR00211] glycyl-tRNA synthetase, tetrameric type, beta subunit
[TIGR00388] glycyl-tRNA synthetase, tetrameric type, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.957039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACC TGACCATGCA GGACGCGCTA TCCCGACTCG CCGCCTACTG GGGTGGACTG 
GGCTGCCTGA CCGTGCAACC GATGAACACC GAGGTCGGAG CGGGCACCCT CAACTCAGCG
ACCGCCCTGC GGGTCCTCGG TCCGGAGCCG TGGAAGGTCG CCTACGTCGA GCCGTCGGTA
CGCCCGGATG ACGCCCGATA CGGCCAGAAC CCCAACCGGA TCCAGTGCCA CACCCAGTTC
CAGGTGATCC TCAAGCCCGA GCCCGGCAAC GCCCAGGAGC TCTACCTCGG CTCCCTCCAA
GCGCTCGGTA TCGACATCAC CAAGCACGAC GTGCGATTCG TGGAGGACAA CTGGGCGTCC
CCGGCGCTGG GCGCGTGGGG TCTGGGCTGG GAGGTGTGGC TGGACGGCCT GGAGATCACC
CAGTTCACCT ACTTCCAGCA GGCCGGTGGC ATCACACTCA ACCCCGTCTC GGTGGAGATC
ACCTACGGCA TGGAGCGCAT CCTCATGGCG CTGCAAGGAG TACGCCATTT CAAGGACATC
ACGTACGCGC CCGGCATCAG CTACGGCGAG GTGTTCGGGC AGGCCGAGTA CGAGATGTCG
CGCTATTACC TCGACGACGC CGACGTGCCC ACCAACCGGG AGTTGCTGTC GGCGTACGCG
GCCGAAGCCC AGCGGATGAT CGACGCCGGT CTGCCGATCC CGGCGCACTC GTTCGTGCTG
AAGATGTCGC ACGCCTTCAA CGTCCTCGAC GCCCGCGGCG CCGTTTCCAC CGCCGACCGG
GCCAGCGAGT TCGCCCGCAT GCGCCGCCTG TCCAACGCCG TCGCCAAGCT GTGGATAGAG
ACCCGGGAGA AGGAAGGCTT CCCGCTCGGG CAGGTGGTGC CGCCAGAGAT GACAGCCACC
ACCTGGCGTG AGCAGCCGGA GCAGACCGGG CCACCGCTGC TCGTCTTCGA AGTCGGCGTC
GAGGAGATGC CGCCCGCCGA GGCCCGCGAC GCGCGCCAGC AGGTCGAGAA CGCACTGACC
GCACGGCTCG CGGCAGGCCG GCTCCCGCAC GGCGCGATCC GCGTCCACGC CACGCCCCGG
CGCATCACCG CCATCGTCAC CGACGTCGCC GACCGTGAAA CCGACGCCGT CACAACCATG
CGCGGCCCAC GCGTCTCGGT CGCCTTCAAG GACGGCACGC CGACTCCAGC GGCCCAAGGG
TTCGCACGCT CCCACGGCGT CAGCGCGGAG GAACTGGCCA CCATCGTCGT CGACGGACAG
GAGTATCTCT GCGTCCACCG CGAGGAGCCC GGCCGTCCCG CCGTCGAAGC GCTCGCCGAG
ACCCTGTCCA CGGTCGTTGG CGGCCTGCGA TCGGCGAAAA ACATGCGCTG GAGCGATCCC
GAGCTGGTGT TCACCCGGCC CATCCGCTGG CTGCTGGCGA TGTGGGGCAA CGCGCCGGTT
CCGGTCAATG CCTCCCGCCT GCGGGCAGAC ACCTCGACAC GAGTGCACCG CAACGCCGAC
CAGCCGATCG TGCCCGTCAG CTCGGCCGGC GCGTACCTGC CAGCACTCGC GGCTGCGGGG
ATCGTGCTCG ACGAGAGGGC CCGCAGGCAG GTCATCATCG ACGCGGCCCG CACGGCGGCT
GCCACCGTGG GCGGCAGCAT CGACGTCGAC GGCGAGGCGC CCCTACTCGA CCAGATCACG
TTCCTGGTCG AGCAACCGAC CGCCATCCTG GGCGACTTCG ACCCCGGCTA CCTCGGACTA
CCCGACGCGA TCCTCACCAC GGTGATGCGC AAGCACCAGC GCTACCTACC CGTACGCGAC
ACCGACGGCC AGCTCCTGCC GCACTTCGTC GTGATCGCCA ACGGCGTGAT CGACCGGGCT
ACCGTCCAGG CAGGCAACGA GGCCGTGCTG CGGGCCCGGT TCGAAGACGC GGCGTTCTTC
TACCGCGCCG ACCAGACCGT CCCGATCGCC ACCATGCGCA AACGCCTGCA CCGGCTCACC
TTCACCGACA AACTCGGCTC CATGGCCAAC CGGGCCGACC GCATCGCAGG TCTCGCCCAA
GCGCTCGCGT CTGACGTGTC CCTGAAGGAC AGCGAGCGTA CGACGCTGGA ACGAGCCGCG
CAGATCGTCA AGTTCGACCT CGGGTCACAG ATGGTCACCG AGATGACCAG CCTCGCCGGC
ATCATGGGCC GCGAGTACGC GCTGCACGCC GGCGAGACCG CCGACGTTGC CGCCGCGATG
TTCGAGGCCG AGCTACCCCG CCACGCCGAC GACCAACTCC CGCACAGCAC GCCGGGCGCC
CTGCTGGCCC TGGCAGACAA GCTCGACTCG TTAATCGGCC TCGCCGCCAC AGTGGGCCTG
CCCACCAGCA GCAGCGACCC GTTCGCGCTG CGCCGGGCCG CAGTCGGGAC GATCGCGATC
CACCGCGGCA CGCCCGCCCT CGCCGGGCTG TCCCTGCGTG ACGCCCTCGC CGAAGCCGCG
CGCCTGCAAC CCGTGCCCGT CACCAGCGAG ACGATCGACG CCGTCAGCGA GTTCGTGACA
CGGCGGCTGG AGCAGCAGCT CCTCGACGAG GGGCACGCCG TCGACCGAGT GCGCGCAGTC
ATCATTCACG CCGACAAGCC CAGCACCGCC GACCGGGTGC TGCGCCAACT GACCAATCTC
GCCGATACGC CGGACTTCCG GCGGTTCGCC GCCGCCCTGC AGCGAGCCCG GCGCATCGCG
ACGACCGGCG GCGCTGAGTA CGACCCGGCT GCGCTGACCG AACCGGCGGA GCTGCATCTG
CACGAGACCG TCAGCAAGTC AGCTGGCGAG CTTCCTGAAC ACGCAGACCT CGGTGACCTT
GTCGCCGCCG GTGGTGCCCT GCCTGCCGCA GTGGACACGT CCTTCGACGG CATCCTCGTG
ATGGCTGACG ATCCGCAGGT GCGGGCACGC CGCTTGGGCT TGGTCGGGGC GGTCGCCGCG
CTCGGTGACG GCGTGCTCGA CTGGCAGGCG CTTCGCCTGT AG
 
Protein sequence
MTDLTMQDAL SRLAAYWGGL GCLTVQPMNT EVGAGTLNSA TALRVLGPEP WKVAYVEPSV 
RPDDARYGQN PNRIQCHTQF QVILKPEPGN AQELYLGSLQ ALGIDITKHD VRFVEDNWAS
PALGAWGLGW EVWLDGLEIT QFTYFQQAGG ITLNPVSVEI TYGMERILMA LQGVRHFKDI
TYAPGISYGE VFGQAEYEMS RYYLDDADVP TNRELLSAYA AEAQRMIDAG LPIPAHSFVL
KMSHAFNVLD ARGAVSTADR ASEFARMRRL SNAVAKLWIE TREKEGFPLG QVVPPEMTAT
TWREQPEQTG PPLLVFEVGV EEMPPAEARD ARQQVENALT ARLAAGRLPH GAIRVHATPR
RITAIVTDVA DRETDAVTTM RGPRVSVAFK DGTPTPAAQG FARSHGVSAE ELATIVVDGQ
EYLCVHREEP GRPAVEALAE TLSTVVGGLR SAKNMRWSDP ELVFTRPIRW LLAMWGNAPV
PVNASRLRAD TSTRVHRNAD QPIVPVSSAG AYLPALAAAG IVLDERARRQ VIIDAARTAA
ATVGGSIDVD GEAPLLDQIT FLVEQPTAIL GDFDPGYLGL PDAILTTVMR KHQRYLPVRD
TDGQLLPHFV VIANGVIDRA TVQAGNEAVL RARFEDAAFF YRADQTVPIA TMRKRLHRLT
FTDKLGSMAN RADRIAGLAQ ALASDVSLKD SERTTLERAA QIVKFDLGSQ MVTEMTSLAG
IMGREYALHA GETADVAAAM FEAELPRHAD DQLPHSTPGA LLALADKLDS LIGLAATVGL
PTSSSDPFAL RRAAVGTIAI HRGTPALAGL SLRDALAEAA RLQPVPVTSE TIDAVSEFVT
RRLEQQLLDE GHAVDRVRAV IIHADKPSTA DRVLRQLTNL ADTPDFRRFA AALQRARRIA
TTGGAEYDPA ALTEPAELHL HETVSKSAGE LPEHADLGDL VAAGGALPAA VDTSFDGILV
MADDPQVRAR RLGLVGAVAA LGDGVLDWQA LRL