Gene Sros_8726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8726 
Symbol 
ID8672064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9623943 
End bp9626918 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content72% 
IMG OID 
ProductGlycine--tRNA ligase 
Protein accessionYP_003344105 
Protein GI271969909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCA TGCAAGACGC TCTCCTCGCA CTGACGCGGT ACTGGACCGA GCAGGGATGC 
ATGGTGGTGC AGCCCATGAA CACCGAGGTC GGGGCGGGCA CCCTGAACCC GGCGACCGCG
CTGCGCGTGC TGGGGCCCGA GCCGTGGCGG GTGGCGTACG TCGAGCCGAG CGTCCGGCCC
GACGACTCCC GTTACGGCGA CAACCCGAAC CGGCTGCAGA CGCACACCCA GTTCCAGGTG
ATCCTCAAGC CCGAGCCGGG CGACCCGCAG GAGCTCTACC TCGGCAGCCT CAGGGCTCTG
GGCATCGACA TCGACCGGCA TGACGTCCGC TTCGTGGAGG ACAACTGGGC GTCCCCGGCG
CTCGGGGCGT GGGGGCTCGG CTGGGAGGTG TGGCTGGACG GCCTGGAGAT CACCCAGTTC
ACCTACTTCC AGCAGGCCGG CGGCATGACA CTGGACCCGG TGTCGGTAGA GATCACCTAC
GGGATGGAGC GCATCATCAT GGCGCTCCAG GGCGTGGACC ACTTCAAGGA CATCGCCTAC
GCGCCCGGCA TCTCCTACGG CGAGGCCTTC GGCCAGGCCG AGTACGAGAT GAGCCGCTAC
TACCTCGACG ACGCCGACGT CACCGCGCAG CGCGGCCTCT TCGAGGCGTA CGCGGCGGAG
GCCGACCGGC TGGTCGAGGC GCGCCTGCCC GTCCCCGCCC ACACGTACGT GCTGAAGTGC
TCCCAGGCGT TCAACGTCCT GGACTCGCGC GGGGCGATCT CCACGACCGA GCGGGCCCAG
GCGTTCGCCC GGATGCGCAG GCTCGCTCAC AGCGTCGCCA AGCTCTGGGT GGAGCGCCGC
GCCGAGCTCG GCTACCCGCT CGGCGGCGTG GAAGTCCCGC CGGCCGAGGC GGCGGCCGCG
GAGAGCCCGC GGCCCGGCAC GGACCAGACG CTCGCCTTCG AGATCGGCGT CGAGGAGCTG
CCCCCCGCCG AGACGACCCG GGCCGCCGAC GCCGTACGGC AGGCGCTCAC CGAGAAGCTG
GCCGCCACGC GGCTACGGCA CGGCTCCGTC ACCGTGATGT CCTCGCCGCG GCGCATCGTG
GCGCTCGTCG AGGACATCGC ACCGCGCGAG GACGACGACG AGCAGACCGT CCGCGGTCCG
CGCCTGTCGG CGGCGTACGA CGCCGAGGGC GCTCCGACCA AGGCTGCGCA GGGCTTCGCG
CGCGGCCAGG GCATCGACGT CGCCGAGCTG GCGCCTCTGA GCGCCGGCGG CGGCGAGTAC
GTGGGATACG TCAAGCACGT CCCCGGCCGT CCCGCCGGAG AGGTGCTCGC CGCGATCCTG
CCGGAGATCG TCACCGGACT GCGCGCCGAG AAGAACATGC GCTGGCGCTC GCCGGGCCTG
TCCTACAGCC GACCGATCCG CTGGATCACC GCGCTGCTGG GCGCGGAGGT CGTGCCGTTC
ACGGTCGCCG ACCTGGCCTC CGGCAGGTCC AGCCGCGTGC ACCGCACCGC CGCCGAACCC
GTCATCACCC TCAGCACCGC GACCGGCTAC GTGGAGACGC TGCGCCGCCA CGCGATCGAA
CCCGACGCCG CCGTACGGCG CGACGACATC GTCGAGCAGG CGGCCCGGCT CGCCGCGGGC
GCGGGCGGAA GGATCGACTT CGAGGCCGAG AGCGCCCTCG TGGACGAGGT CACGAACCTG
GTCGAGGCGC CCGTCGCGAT CCTGGGGACG TTCGACGAGA AGTACCTGGA GCTTCCGGCG
GCGATCCTGA CGACCGTCAT GAAGAAGCAT CAGCGCTACT TCCCGGTCCT GGACGGCGAC
GGCCGGCTGC TGAACCGGTT CGTGACCATC GCCAACGGCG CGTGCGACCA CGACGCGGTG
CGGGCCGGGA ACGGAGCGGT CCTGCGCGCC CGCTACGAGG ACGCCGGGTT CTTCTGGCGC
AACGACCTCG CGACCCCGCT CACCGAGATG AAGCGGCGTC TGGCGCGGCT CACCTTCGAG
ACCCGGCTGG GTTCGGTCGC CGAACGCGCC GACCGCATCG ACGCGATCGC CTCAGACCTG
GCCGCGCGGG CACTGCTCGG CGGGGACGAT GGCGAGACGC TGCGCCGGGC CGGCCGGCTC
GCCAAGTTCG ACCTGGGGTC CGAGCTCGTC ATCGAGCTGT CCAGCCTCGC CGGCGTGATG
GCCCGCGAGT ACGCGCGCCA GGCCGGCGAG CCGGAGGCGG TCGCCGAAGC CCTGTACGAG
ATGGAGCTTC CGCGCCAGGC GGGCGACGCG CTGCCGCGGA GCCGTCCCGG GGCGCTGCTC
GCGCTCGCCG ACCGGTTCGA CCTGCTGGCT GGGCTGTTCG CGATCGGCTC CGAGCCGACC
GGGAGCTCCG ATCCGTTCGG GCTCCGGCGG GCGGCCCTCG GAGTGATCAA CATCCTGCGC
GCCCACCCCG ATCTAGCGGG CCTCACGCTG CGCGAAGCGC TCGCGATCGC CGCGAGCCAC
CAGCCCGTCC CCACGGACGC CCGCCTGCCC GACCAGGTGC TGGACTTCGT CAGGCGCCGG
TTCGAGCAGC TCATGCTCGA ACAGGGCCAC CCCGCCGCGA ACATCCGGGC CGTCGCCGGC
CTCGTCGACA GCCCCGTCCG CGCCGAGCAG ACCCTCGACC ACCTCGCCGC CCTGCTGAGT
ACCGAGGACT TCCAGGAGCT CGAAGCCGCG GTCCAGCGCA TCCGCCGCAT CATCCCCGCC
GGTGCCACGC CCGGCTACGA CCCGGGACTG TTCGACAGCC CGGCGGAGGA GGGACTGGCC
ACCGCCCTGG AGAAGGCCCG CGCGGGCCTG CGCGGGGAGA CCGACCTGCG GCGCTTCGCC
GCCGAGGCGG CTGTCGTCGT CCATCCGGTC ACCGTGTTCT TCGACGAGGT GCTGGTCATG
GCGGACGACC CGGCCGTCAG AGCCAACCGG CTCGGTCTCC TCGCCGCCGT CCACGACCTG
GCCGACGGCT ACCTGGACTG GAAGGAGCTT TCCTGA
 
Protein sequence
MLTMQDALLA LTRYWTEQGC MVVQPMNTEV GAGTLNPATA LRVLGPEPWR VAYVEPSVRP 
DDSRYGDNPN RLQTHTQFQV ILKPEPGDPQ ELYLGSLRAL GIDIDRHDVR FVEDNWASPA
LGAWGLGWEV WLDGLEITQF TYFQQAGGMT LDPVSVEITY GMERIIMALQ GVDHFKDIAY
APGISYGEAF GQAEYEMSRY YLDDADVTAQ RGLFEAYAAE ADRLVEARLP VPAHTYVLKC
SQAFNVLDSR GAISTTERAQ AFARMRRLAH SVAKLWVERR AELGYPLGGV EVPPAEAAAA
ESPRPGTDQT LAFEIGVEEL PPAETTRAAD AVRQALTEKL AATRLRHGSV TVMSSPRRIV
ALVEDIAPRE DDDEQTVRGP RLSAAYDAEG APTKAAQGFA RGQGIDVAEL APLSAGGGEY
VGYVKHVPGR PAGEVLAAIL PEIVTGLRAE KNMRWRSPGL SYSRPIRWIT ALLGAEVVPF
TVADLASGRS SRVHRTAAEP VITLSTATGY VETLRRHAIE PDAAVRRDDI VEQAARLAAG
AGGRIDFEAE SALVDEVTNL VEAPVAILGT FDEKYLELPA AILTTVMKKH QRYFPVLDGD
GRLLNRFVTI ANGACDHDAV RAGNGAVLRA RYEDAGFFWR NDLATPLTEM KRRLARLTFE
TRLGSVAERA DRIDAIASDL AARALLGGDD GETLRRAGRL AKFDLGSELV IELSSLAGVM
AREYARQAGE PEAVAEALYE MELPRQAGDA LPRSRPGALL ALADRFDLLA GLFAIGSEPT
GSSDPFGLRR AALGVINILR AHPDLAGLTL REALAIAASH QPVPTDARLP DQVLDFVRRR
FEQLMLEQGH PAANIRAVAG LVDSPVRAEQ TLDHLAALLS TEDFQELEAA VQRIRRIIPA
GATPGYDPGL FDSPAEEGLA TALEKARAGL RGETDLRRFA AEAAVVVHPV TVFFDEVLVM
ADDPAVRANR LGLLAAVHDL ADGYLDWKEL S