Gene Sare_3553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3553 
Symbol 
ID5705046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4099288 
End bp4101042 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content71% 
IMG OID641272980 
Productprolyl-tRNA synthetase 
Protein accessionYP_001538346 
Protein GI159039093 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.279116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0486648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTAC GGATGTCGAC CCTGCTGCTG CGGACCCTCC GTGAGGACCC AGCAGACGCA 
GAGGTACCGA GCCACCGGCT GCTCCTGCGC GCCGGCTACC TCCGCCGCGC CGCTCCCGGT
GGGTACACCT GGCTGCCGCT GGGCAAGCTC GTGCTCGACC GGGTGGCCGA GGTGATCCGG
ACCGAGATGC TCGCGATCGG TGACCAGGAG GTGCACTTCC CGGCGCTGCT GCCGGCCGAG
CCGTACCGGA CCAGCGGGCG GTGGACGGAG TACGGCGACG ACCTCATCAC CCTCGTCGAC
CGAAGGGGCG CCGAGCACCT GCTCGCCCCG ACCCACGAGG AGCCGGCGGC CCTGCTGGTC
AAGGAGCTGT TCACCTCGTA CCGGGACTTC CCCGTAGGGA TCTTCCAGAT ACAGACAAAG
TTCCGCGACG AGGCACGGCC CCGGGCCGGT CTGTTGCGTG GGCGCGAGTT CCTGATGAAG
GACGCGTACT CCTTCGACCT GGACGAGGCG GGTCTCCAGG CCGCCTACGA CCGACACCGG
AGCGCGTACC AAAAGATCTT CGCGCGGCTC GGCCTGGACT ACGCGGTCGT GCACGCCGTT
TCCGGGGCGA TGGGCGGTTC GGCTTCGGAG GAGTTCCTGG CCACGTCCAA GATCGGCGAG
GACGTCTACG TCGGCTGCAC CGCGTGCGAC CACACCGCGA ACACCGAGGC CGTGACCACA
CTCGCCCCGC CGGCCTCGAA CCCGGAGGAA CGGCCGGCGA CTCAGGTGCA CGACACCCCG
GACACACCGA CGATCGCCAG CCTGGTCGGC CTCGCCAACG CCCGTGCGCT GGCCGGCCGG
GACGACTGGG CCGCCGGCGA CACCCTGAAG AATGTCGTCC TCACGATCCG CCCGCCCGGC
GCGGCGAAGT CCGAGCTCCT GGTCATCGGT CTTCCCGGTG ACCGGGAGGT TGATCTCAAG
CGGGTCGCGG CGACACTCGC CCCGGCGACC GTCACCGTGT TCGACGGCTG GGCCGACCAC
CCCGAGTTGG TCCGTGGCTA CCTCGGGCCG CAGGTCATGG CCAAGCTCGG TGTCCGTTAC
CTGGTCGACC CCCGGGTGGT GCCCGGCACC GCCTGGCTGA CCGGCGCGAA CGAGCCCGGA
CGGCACGCGA CGAACGTCGT CTGCGGTCGA GACTTCCTGC CCGACGGCAC GATCGAGGCC
GCCGAGGTCC GTCCCGGCGA TCCCTGCCCG GCCTGCCGCA CCGGGCAGCT CACCCTACGT
CGGGGCATCG AGATCGGGCA CATCTTCCAG CTCGGTCGCC GCTACACCGA CGCGTTCACC
GTGGACGTGC TCGGCCCGGA GGGCCAGTCG GTCCGACCCA CGATGGGCTG CTACGGCATC
GGTGTGTCCC GGGCGGTCGC GGTGATCGCC GAGCAGCACC ACGACGAGCG GGGCCTCGTC
TGGCCGACCG AGGTCGCGCC ATGCGACGTA CACCTGGTGG CGGCCGGTAG GGGACCGCAG
GTGGAGACAG CACTCGGCCT CGGCAACCGT CTCGCCGAGG CCGGCCTACG GGTGCTGGTG
GACGACCGAG GGCACGTCTC CGCCGGGGTG AAGTTCACCG ACGCGGAACT GGTCGGCATT
CCCCGGACGG TCGTGGTCGG CCGCCGGCTC GCCGACGGGT ACGCCGAGGT GCGTGACCGG
CCCTCCGGCA AGCGTGCCGA CGTGCGGGTG GACGCCCTCG TCGAACACCT GGTCAACGAG
GTCCATAGCG GGTAG
 
Protein sequence
MLLRMSTLLL RTLREDPADA EVPSHRLLLR AGYLRRAAPG GYTWLPLGKL VLDRVAEVIR 
TEMLAIGDQE VHFPALLPAE PYRTSGRWTE YGDDLITLVD RRGAEHLLAP THEEPAALLV
KELFTSYRDF PVGIFQIQTK FRDEARPRAG LLRGREFLMK DAYSFDLDEA GLQAAYDRHR
SAYQKIFARL GLDYAVVHAV SGAMGGSASE EFLATSKIGE DVYVGCTACD HTANTEAVTT
LAPPASNPEE RPATQVHDTP DTPTIASLVG LANARALAGR DDWAAGDTLK NVVLTIRPPG
AAKSELLVIG LPGDREVDLK RVAATLAPAT VTVFDGWADH PELVRGYLGP QVMAKLGVRY
LVDPRVVPGT AWLTGANEPG RHATNVVCGR DFLPDGTIEA AEVRPGDPCP ACRTGQLTLR
RGIEIGHIFQ LGRRYTDAFT VDVLGPEGQS VRPTMGCYGI GVSRAVAVIA EQHHDERGLV
WPTEVAPCDV HLVAAGRGPQ VETALGLGNR LAEAGLRVLV DDRGHVSAGV KFTDAELVGI
PRTVVVGRRL ADGYAEVRDR PSGKRADVRV DALVEHLVNE VHSG