Gene Sare_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0741 
Symbol 
ID5707773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp824775 
End bp825800 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content67% 
IMG OID641270260 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001535651 
Protein GI159036398 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0159074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00650672 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCACG TACCCGCTCG CCCGCGTGTG CTGTCCGGCA TCCAGCCGAC GGCCGATTCG 
TTCCATCTCG GCAACTACCT GGGCGCGGTC GGGCACTGGG TTGCGTTGCA GGATACCCAC
GACGCCTTCT ACTGCGTGGT TGATCTGCAC GCGATCACCG CCGGGCACGA CCCCCGGATG
CTGGCACAGC GGTCCCGGAT TGCCGCGGCG CAGCTCCTTG CGGTGGGTCT GGACCCGGAG
CGCTGCACCC TCTTCGTGCA GTCCCAGGTA CCCGAGCACG CACAGTTGGC CTGGGTGTTG
GGATGCATCA CCGGGTTCGG CGAGGCGGGC CGGATGACCC AGTTCAAGGA CAAGTCGCAG
CGGCAGGGTA ACGAGCGGGC CAGCGTCGGC CTGTTCACGT ACCCGGTCCT CCAGGCAGCC
GACATCCTGC TCTACCAGGC CGACGCGGTG CCGGTCGGCG AGGACCAGCG GCAGCACCTG
GAACTCAGTC GCGACCTGGC GCAGCGTTTC AACACGCTGT TCGGGCCGAC CTTCACGGTT
CCCGAGGCAC ACATCGTCAA GGACACTGCG AAGATCACGG ACCTGCAGGA TCCAACGGCC
AAGATGTCGA AATCGTCGTC GTCTCCGGCG GGTATCGTCC TGTTACTGGA GGATGCGGCC
CGGTCGGCCA AGAAGATCCG CTCGGCGGTG ACCGACACCG GGCGGGAGGT CGTTTTCGAT
GCGCAGCAGA AGCCAGGTGT GTCCAACCTG CTGACGATCT ACTCGGCGCT GTCCGGCCGG
AGCATCGATG ATCTGGTCGC CGCGTATGCC GGCAAGGGCT ACGGCGACCT GAAGAAGGAC
CTCGGAGAGG TGGTACGCGA GTTCGTGGCG CCGATCCAGG ATCGTACCCG CGGCTACCTC
GCCGATCCGG CCCAACTCGA CAGGCTGCTG GTGACGGGGG CGCAGAAGGC GCGGGCGGTG
GCGGGACCGA CCCTGCGGGC CGTGTACGAG CGGGTGGGCT TCTTTCCGCC GGTGCTCGGC
GAGTAG
 
Protein sequence
MSHVPARPRV LSGIQPTADS FHLGNYLGAV GHWVALQDTH DAFYCVVDLH AITAGHDPRM 
LAQRSRIAAA QLLAVGLDPE RCTLFVQSQV PEHAQLAWVL GCITGFGEAG RMTQFKDKSQ
RQGNERASVG LFTYPVLQAA DILLYQADAV PVGEDQRQHL ELSRDLAQRF NTLFGPTFTV
PEAHIVKDTA KITDLQDPTA KMSKSSSSPA GIVLLLEDAA RSAKKIRSAV TDTGREVVFD
AQQKPGVSNL LTIYSALSGR SIDDLVAAYA GKGYGDLKKD LGEVVREFVA PIQDRTRGYL
ADPAQLDRLL VTGAQKARAV AGPTLRAVYE RVGFFPPVLG E