Gene Sare_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1145 
Symbol 
ID5704289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1294410 
End bp1295468 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID641270660 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001536044 
Protein GI159036791 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000761726 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCTC AGCGGATGCT CACCGGTGAC CGTCCCACCG GGAAACTGCA CCTCGGCCAC 
TACGTCGGCA GCATCGCCAA CCGGGTGAAG TTGCACCAGC GGTACGAGAG CTTCTTCATC
ATCGCCGACC TGCACATGTT GACCACCAAG AACACCCGCG ACGACATCGC GAGGGCCACC
CAGAACGCCC GGGACATGGT CCTCGACTCC CTCGCCGCGG GGATAGACCC GGACACCGCC
ACCTTCTATC TCCAGTCGGC GATCCAGGAA GTCGGCGATC TCAACACCCT CTTCCAGAAC
CTGGTCACCG TGCCGCGCCT GGAGCGGGTG CCATCGCTCA AGGACATGGC CCGCGACGCT
GGTAAGGACG AGATGCCATA CGGTCTGCTC GGCTACCCGG TCCTGCAGGC CGCCGACATC
CTCTGCGTCA AGGCCCACGT GGTGCCCGTC GGCAAGGACA ACGCCGCGCA CGTCGAGGTC
ACCAGGGAAC TGGCCCGCCG CTTCAACCAC CTCTACGGCG AGGTCTTCCC CGTCCCTGAA
CTTGTCAGCG CCGAAACGCC CACCCTGGTC GGCACCGACG GCCGGGCCAA GATGAGCAAG
AGCCTGGGCA ACGTCATCGC GCTTTCCGAC GAGCCGGCCG ACGTTCGCCG CAAGGTCATG
GGCATGTACA CCGACCCGAA CCGGGTCCGT GCGGACGTGC CCGGCACGGT CGAGGGCAAC
CCGGTGTTCC AGTATCACGA CGTCTTCAAC CCGAACCGGG CCGAGGTCGC TGACCTCAAG
AGTCGCTATC GCGAGGGCAG GGTCGGCGAT GTCGAGGTCA AGGAGAAGCT GGCCACCGCG
TTGAACGCGT TTCTCGACCC GGTGCGCGAG CGGCGCGCCC GCTACGAGGC CGACCGGGGC
CTGGTCGACG AGCTGATCGT GGAAGGCACG GAACGCACCC GGCGGGTGGT GCGGCAGACC
GTGTTCGACG CACGCAAGGC AATGGGCCTC ACCGGCGTCT ACACGCAACT GCGCCGCAAG
GCGGAACGGT CCCGCAAGCC CGCGGTCACC ACCGCGTAG
 
Protein sequence
MTAQRMLTGD RPTGKLHLGH YVGSIANRVK LHQRYESFFI IADLHMLTTK NTRDDIARAT 
QNARDMVLDS LAAGIDPDTA TFYLQSAIQE VGDLNTLFQN LVTVPRLERV PSLKDMARDA
GKDEMPYGLL GYPVLQAADI LCVKAHVVPV GKDNAAHVEV TRELARRFNH LYGEVFPVPE
LVSAETPTLV GTDGRAKMSK SLGNVIALSD EPADVRRKVM GMYTDPNRVR ADVPGTVEGN
PVFQYHDVFN PNRAEVADLK SRYREGRVGD VEVKEKLATA LNAFLDPVRE RRARYEADRG
LVDELIVEGT ERTRRVVRQT VFDARKAMGL TGVYTQLRRK AERSRKPAVT TA