Gene Sros_2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2107 
Symbol 
ID8665389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2261965 
End bp2263707 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content71% 
IMG OID 
ProductProline--tRNA ligase 
Protein accessionYP_003337835 
Protein GI271963639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.205273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.90716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTGC GCATGTCGTC GTTGTTTCTT CGAACACTGA GGGACGACCC GGCAGACGCG 
GAGGTCCCGA GCCACAAGTT GCTCGTCCGC GCCGGGTATG TCCGTCGTGT CGCACCCGGC
ATCTACTCCT GGCTGCCGCT CGGCAAGATG GTTCTGGAGA ACGTCACGCG CATCGTGCGC
GAGGAGATGA ACCGCATGGG CGGCCAGGAG GTGCTCTTCC CCGCCCTCCT GCCCCGGGAG
TACTACGAGG CCACCGGCCG CTGGACGGAG TACGGCGACA CGCTCTTCCG CCTCCAGGAC
CGCAAGGGGG CCGACTACCT CCTCGGCCCC ACCCACGAGG AGATGTTCAC CGACATGGTC
AAGGGGGAGT ACTCCTCCTA CAAGGACTAC CCGGTGACGC TCTACCAGAT CCAGACGAAG
TACCGCGACG AGGCCCGCCC CCGGGCCGGC ATCCTGCGCG GCCGCGAGTT CGTCATGAAG
GACTCCTACT CCTTCGACCT GGACGACGAC GGTCTCAAGC GCTCCTACGA GCAGCACCGC
GAGACCTACA TCAGGACCTT CGACCGCCTC GGCATCAGCT ACAAGATCTG CTTCGCCACC
TCCGGCGCGA TGGGCGGCTC CGCCTCCGAG GAGTTCCTCG CGCCCGCCGC GACGGGCGAG
GACACCTTCG TCGCCTGCCA CCAGTGCGGC TACGCGGCCA ACGCCGAAGC GGTCACCACC
CCCGCCCCCG CCGCGATCAC CGGCGAGCGG CCGGCCCTGC GGGTCCTCGA CACCCCTGAC
ACCCCGACCA TCGAGTCGCT GGTCGACCAC GTCAACGAGC ACCACGGCCT GGGCATCACC
GCCGCCGAGA CGCTCAAGAA CATCGTGGTC AAGGTCACCA CTCCCGGCTC CGGCAAGTCC
GAGACGCTGA TCGTCGGCGT GCCGGGCGAC CGCGAGGTCG ACTTCAAGCG GCTGGAGGCC
TCCCTGGCCC CCGGCGAGCC CGCCATCTTC GAGGCCGAGG ACTTCGCCAG GCACCCCGGC
CTGGTCCGCG GCTACATCGG CCCGCAGGTG CTCAAGGACC TCGGCATCCG CTACCTCGTC
GACCCCCGCG TGGTCGACGG CAGCGCCTGG GTGACGGGTG CCAACGAGCC CGGCAAGCAC
GCCGCCGGCG TGGTCGCCGG CCGCGACTTC ACCGCCGACG GCACGATCGA GGCCGCCGAG
GTCCGCGCGG GCGACGCCTG CCCGGTCTGC GGCTCCGGGC TGTCCATCGA CCGGGGCATC
GAGATCGGCC ACATCTTCCA GCTCGGCCGC AAGTACGCCG ACGCCGCGAA GCTGGACGCC
CTCGGCCCCG ACGGCAAGCC GATCCGCGTC ACCATGGGCT CCTACGGCGT CGGTGTCTCC
CGCGCCGTGG CCGTGCTGGC CGAGCAGAGG CACGACGAGC TCGGCCTGGT CTGGCCCCGC
GAGGTCGCCC CGGTGGACGT CCACGTCGTG GGCACCGGCA AGGACGGCCA GATCGAGGCC
GCGAACCGGC TGGCCGAAGA CCTTCAGGCC CGCGGCCTGC GCGTGCTGGT GGACGACCGC
GCCGGGGTCT CCCCGGGGGT GAAGTTCAAG GACGCCGAGC TGCTCGGCAT GCCCACGATC
CTGATCATCG GCCGGGGGCT CGCCCAGGGC GTCGCCGAGC TGCGCGACCG CGTCACCGGC
GTCAAGGAGG AGATCCCGAT CGACGAGGCC GCCGACCGGG TCGTGGCCGC CTGCCGCGCC
TGA
 
Protein sequence
MLLRMSSLFL RTLRDDPADA EVPSHKLLVR AGYVRRVAPG IYSWLPLGKM VLENVTRIVR 
EEMNRMGGQE VLFPALLPRE YYEATGRWTE YGDTLFRLQD RKGADYLLGP THEEMFTDMV
KGEYSSYKDY PVTLYQIQTK YRDEARPRAG ILRGREFVMK DSYSFDLDDD GLKRSYEQHR
ETYIRTFDRL GISYKICFAT SGAMGGSASE EFLAPAATGE DTFVACHQCG YAANAEAVTT
PAPAAITGER PALRVLDTPD TPTIESLVDH VNEHHGLGIT AAETLKNIVV KVTTPGSGKS
ETLIVGVPGD REVDFKRLEA SLAPGEPAIF EAEDFARHPG LVRGYIGPQV LKDLGIRYLV
DPRVVDGSAW VTGANEPGKH AAGVVAGRDF TADGTIEAAE VRAGDACPVC GSGLSIDRGI
EIGHIFQLGR KYADAAKLDA LGPDGKPIRV TMGSYGVGVS RAVAVLAEQR HDELGLVWPR
EVAPVDVHVV GTGKDGQIEA ANRLAEDLQA RGLRVLVDDR AGVSPGVKFK DAELLGMPTI
LIIGRGLAQG VAELRDRVTG VKEEIPIDEA ADRVVAACRA