Gene Sare_3810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3810 
Symbol 
ID5705305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4343765 
End bp4345144 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content66% 
IMG OID641273232 
Productglycyl-tRNA synthetase 
Protein accessionYP_001538594 
Protein GI159039341 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0423] Glycyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00389] glycyl-tRNA synthetase, dimeric type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0220595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCAG ACCGTATCGA CGCCGTTGTC AGCCTCGCCA AGCGTCGGGG TTTCGTCTTT 
CCGTCCAGCG AGATCTACGG GGGGACCCGG TCGGCGTGGG ACTACGGCCC GCTCGGTGTG
GAGCTGAAGG AAAACGTCCG CCGGCAGTGG TGGCGGAGCA TGGTTCAGCA ACGCGACGAC
GTGGTGGGCC TCGACTCCGC GGTGATCCTG GCCCGGGACG TCTGGGCTGC CTCCGGCCAC
CTGGACGCGT TCGTCGACCC GTTGACCGAG TGTCAGTCCT GCCACAAGCG GTTCCGGGCC
GACCACCTGG AGGAGACCTA CGAGGCCAAG CACGGTCGCC CGCCGGCCTC GTTGAGCGAG
CTGAACTGCC CGAACTGCGG TAACAAGGGC ACCTTCACCG AACCGCGGAT GTTCAACGGC
CTGATGAAGA CCTACCTGGG CCCGGTGGAG AGCGACGAGG GTCTGCACTA TCTGCGACCG
GAGACCGCAC AGGGCATCTT CGTCAACTAC AAGAACGTCG AGACGGTGGC CCGCAAGAAG
CCGCCGTTCG GCATCGCCCA GACCGGCAAG TCCTTCCGTA ACGAGATCAC CCCCGGCAAC
TTCATCTTCC GGACCCGTGA GTTCGAGCAG ATGGAGATGG AGTTCTTCGT CGAACCGGGC
ACCGACGAGG GCTGGCACGA GTACTGGCTC ACCGAGCGTT GGAACTGGTA CCTCGACCTC
GGTCTCACCG AACGCAACCT GCGCCGGTAC GAGCACCCGC AGGAGAAGCT CTCGCACTAC
TCGAAGCGCA CCGTCGACAT CGAGTACCGG TTCCAGTTCG GCGGCACCGA GTTCGCTGAG
CTGGAGGGCA TCGCCAACCG CACCGACTTC GACCTGTCTA CGCACAGCAA GCACTCCGGA
GTGGATCTGT CCTACTTTGA CCAGGCCAAG GGCGAGCGGT GGATTCCGTA CGTGATCGAG
CCGGCGGCCG GTCTCACCCG CGCGGTGCTG GCGTTCCTGC TCGAGGCGTA TGACGAGGAC
GAGGCACCGA ACACCAAGGG CGGCGTGGAC AAGCGCACGG TGATGCGCTT CGACCCGCGG
CTTGCCCCGG TGAAGGCGGC GGTGCTGCCG CTGTCGCGCA ACGAGGCACT GTCGCCGAAG
GCCCGGCAAC TCGCGGCAGA CCTGCGTCAG CGCTGGGTGG TGGAGTTCGA CGACTCGCAG
GCCATCGGCC GCCGCTATCG CCGGCAGGAC GAGATCGGTA CCCCGTTCTG TGTGACGGTC
GACTTCGACA CCCTCGACGA CAACGCGGTG ACCGTGCGGA ACCGGGACAC CATGGCTCAG
GAGCGGATCT CCCTGGACCA GGTCGAGCGG TACCTCATCG AACGCCTTCC CGGCTGCTAG
 
Protein sequence
MPADRIDAVV SLAKRRGFVF PSSEIYGGTR SAWDYGPLGV ELKENVRRQW WRSMVQQRDD 
VVGLDSAVIL ARDVWAASGH LDAFVDPLTE CQSCHKRFRA DHLEETYEAK HGRPPASLSE
LNCPNCGNKG TFTEPRMFNG LMKTYLGPVE SDEGLHYLRP ETAQGIFVNY KNVETVARKK
PPFGIAQTGK SFRNEITPGN FIFRTREFEQ MEMEFFVEPG TDEGWHEYWL TERWNWYLDL
GLTERNLRRY EHPQEKLSHY SKRTVDIEYR FQFGGTEFAE LEGIANRTDF DLSTHSKHSG
VDLSYFDQAK GERWIPYVIE PAAGLTRAVL AFLLEAYDED EAPNTKGGVD KRTVMRFDPR
LAPVKAAVLP LSRNEALSPK ARQLAADLRQ RWVVEFDDSQ AIGRRYRRQD EIGTPFCVTV
DFDTLDDNAV TVRNRDTMAQ ERISLDQVER YLIERLPGC