Gene Sros_4297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4297 
Symbol 
ID8667591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4799471 
End bp4802545 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content68% 
IMG OID 
ProductGlycine--tRNA ligase 
Protein accessionYP_003339929 
Protein GI271965733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.83812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCTTGA AGAAGACTGA AGCCGTAAGC CCGATCCCCG CCATCATCGA GGTCGCCGCC 
GTGTTGACGA TGCAGGATGC TCTCCGTGCC CTCACTGACT ACTGGTCGCG TCACGGATGC
ACGCTGATAC AGCCGTACAA CTCCGAAGTC GGCGCGGGCA CGCTCAATCC GGCGACCGTA
CTGCGGGTGC TCGGTCCGGA GCCGTGGCGG GTGGCATATG TGGAGCCCAG CGTACGGCCC
GATGATGCCC GGTACGGAGA AAATCCGAAC CGCCTGCAGA CGCATACCCA GTTTCAGGTG
ATCCTCAAGC CGGACCCGGG TGATCCGCAG GAGCTTTATC TGGGGAGCCT GGCGGCCCTG
GGCATCGACC TGCATGCGCA CGATGTCAGA TTCGTGGAGG ACAACTGGGC CTCGCCAGCG
TTGGGCGCAT GGGGTCTCGG GTGGGAGGTG TGGCTCGATG GCCTCGAGAT CACCCAGTTC
ACCTATTTTC AACAGGCCGG CGGGATAACG CTCGATCCCG TGCCGGTAGA GATCACCTAT
GGTATCGAGC GGATCATGAT GGCCCTGCAG AGAGTCGATC ATTTCAAGGA GATCCGTTAC
GCCACAGGTG TTTCGTACGG TGAGATATTC GGCCAGGCCG AGTACGAAAT GAGCCGCTAC
TACCTCGACG AAGCGAGCAC GGACGCCAAC GGGCGCCTCT TCGAGGAATA TGCCGCAGAG
GCCCGGCGGA TGCTGGACGC CCGGCTGCCG GTGCCCGCGC ACAATTACGT GTTGAAGTGC
TCGCACACCT TCAATGTGCT CGACTCCCGA GGGGTGGTCT CCACCACGGA GCGGGCAGCG
GCCTTCAAGC GGATGCGAGG GCTGGCACGG GACGTGGCAC ACCTGTGGGT GGAGCGGCGG
GCCGAACTCG GCCACCCGCT CGGAGTGGTC GGTCCTCCGG CGCCGGCCGA GCTGCCCGTA
TCGACCGCGG TGCCCGGCAC TCCCGCCACC TTGGTCTTCG AGATCGGCAC CGAGGAGCTG
CCGGCCGGCG ACGTGACCCG GACGGCCGCG ACGGTGCGTG ACGCGGTAGC TCGGAAGCTC
GCCGCGACGC GGCTCGGCCA CGGTGGGATA CGCACGTACG CCACCCCCCG CCGCGTGGTC
ACAATTGTGG AACAGGTGCA GCCGCGCGAG CCGGACAGCG AGCAGACCGT ACGCGGTCCA
CGCGTGAGTG TCGCCTACGC CGACGACGGC GAGCCGACCA AGGCCGCGGT CGGCTTCGCC
CGCGGCCGGC AGGTGGATGT CGCCAAGCTG CGACGGATCG ATGTGGACGG CGTGGCACAT
GTCGGGGTGG TCAGGCCAGA CCCGGGCCGA GACGCCGTGG CAGTTCTCAG CGAGCTCCTG
GGACAGGTCG TCGCGGAATT GCGCGCCGAC AAGAACATGC GATGGAACGA CCCGAAGTTG
TCGTTCGTCC GGCCGTTGCG GTGGTTGCTC GCACTTCTCG GCGACGTCCC CGTGCCTCTG
GCGGTCTCGG CGCTGGCCGG CGGGACCACC ACCCGGGTGC TCCGGAACGC GGCCGAGCCC
TCGGTCGAAG TGCCGAGCGC GGACGGCTAC CTGGAGTTGC TGGCAGGAGA AGGGATCGTA
CTGGATCCCG CCGAGCGGCG GACCCGGATC GTCAGGGCGG CGCAGCGGCT GGCCAGGGAC
GTGGGCGGCG TCGTCGACGT TGAAGGCGAG TCGGCTCTCA TCGACGAGAT CACCAACTTG
GTCGAGGAAC CGACGCCGAT CCGGGGCGGA TTCGCCGAGG AGTACCTCGC GCTGCCATCC
GAGATCCTCA TGACGGTCAT GCGCAAGCAC CAGAGGTACT TGCCCGTGCG CGCCGCGGAC
GGCACGCTGA TGCCGTACTT CATCACGGTC GCGAATGGCT CCTGCGATCA GGAGGTGGTC
GGAACCGGGA ACGAAGCGGT GCTGCGCGCT CGATACGAGG ACGCGCTCTT CTTCTGGCGC
GCGGATCTTG ACGTGTCCCC GGACAGGTTC CGTGCGGCTC TGGACAAGCT GTCCTTCGCG
GAGAATCTCG GCTCCATGGC CGATCGTGCC GATCGGATCG CCGCTGTCGC GCAGGACCTG
GGTGCGTTGC CCATGGTCTT CATGACCGAG GCCGAAAGGC GGACCCTGAC CCGCGCCGCC
GCACTGGCGA AGTTCGATCT CGCGTCACAC ATGGTGGTGG AGCTCACCAG CCTCGCCGGC
ACCATGGCTC GAGAGTATGC CCTCCGGGCC GGCGAACCAG AGCCGGTTGC GCAGGCGCTC
CATGAGATGG AACAGCCCCG GTCCGCCGGC GGACAGCTGC CACGCAGCAC CCCGGGCGCG
CTGCTCGCAC TCGCGGATCG GCTCGACCTG CTCGTGGGGA TGTTCGGCAT CGGGTCCGGC
CCCACCGGCC GGTCCGACCC CTTCGGTCTG CGCCGAGCGG CATCGGGCCT GGTGAGCATC
CTCCGCGACC ACACGGCTCT ACGGTCGATG GACGTGAGCG CCTGCCTGGC CGTCGCCGCG
GAACACCTCC GGGCGCGAGC CGTCGACGTT CCGGCGACGA CGCTGGACGA GATCCGGGAA
TTCGTCACCC GCCGGTACGA GCAGCAACTT CTGGACTCCG GACAGGACCA CCGGTACGTC
GCCGCTGTGC TCCCCCTGGC CGGCAGTCCG GCCCGAGCAG ACGAAACGCT TGCCGAGCTG
CACAAGCGCG CGAAGGCACC CGACTTCGCC GAGCTTGTTG CCGCTCTCCT GCGAGTGAGA
CGGATCGTGC CGGCGGGCAC CGCGGCGGGA TACGACGCGA GCCGGCTGGT CGAGCCCTGT
GAGCTCTCCC TCCACCAGGT CCTGGGCGAG GTACGGACCG CCCTGGGCGA GCAGCCCACG
CGGTTGCGGG ACTTCGTCGA TACCGCTGCC GCGCTCGTCG GACCGGTCAA CACCTTCTTC
GATGAGGTCC TGGTCATGGA TGAGGACGCC GACCGGCGTG CGACCCGGCT CGGTCTCCTC
GCCCACATCC GCGATCTGTC GGCAGGTGTC CTCGACTGGT CCGCCCTGGG CGCCATGCCG
GCCGCCGGCA AGTAG
 
Protein sequence
MCLKKTEAVS PIPAIIEVAA VLTMQDALRA LTDYWSRHGC TLIQPYNSEV GAGTLNPATV 
LRVLGPEPWR VAYVEPSVRP DDARYGENPN RLQTHTQFQV ILKPDPGDPQ ELYLGSLAAL
GIDLHAHDVR FVEDNWASPA LGAWGLGWEV WLDGLEITQF TYFQQAGGIT LDPVPVEITY
GIERIMMALQ RVDHFKEIRY ATGVSYGEIF GQAEYEMSRY YLDEASTDAN GRLFEEYAAE
ARRMLDARLP VPAHNYVLKC SHTFNVLDSR GVVSTTERAA AFKRMRGLAR DVAHLWVERR
AELGHPLGVV GPPAPAELPV STAVPGTPAT LVFEIGTEEL PAGDVTRTAA TVRDAVARKL
AATRLGHGGI RTYATPRRVV TIVEQVQPRE PDSEQTVRGP RVSVAYADDG EPTKAAVGFA
RGRQVDVAKL RRIDVDGVAH VGVVRPDPGR DAVAVLSELL GQVVAELRAD KNMRWNDPKL
SFVRPLRWLL ALLGDVPVPL AVSALAGGTT TRVLRNAAEP SVEVPSADGY LELLAGEGIV
LDPAERRTRI VRAAQRLARD VGGVVDVEGE SALIDEITNL VEEPTPIRGG FAEEYLALPS
EILMTVMRKH QRYLPVRAAD GTLMPYFITV ANGSCDQEVV GTGNEAVLRA RYEDALFFWR
ADLDVSPDRF RAALDKLSFA ENLGSMADRA DRIAAVAQDL GALPMVFMTE AERRTLTRAA
ALAKFDLASH MVVELTSLAG TMAREYALRA GEPEPVAQAL HEMEQPRSAG GQLPRSTPGA
LLALADRLDL LVGMFGIGSG PTGRSDPFGL RRAASGLVSI LRDHTALRSM DVSACLAVAA
EHLRARAVDV PATTLDEIRE FVTRRYEQQL LDSGQDHRYV AAVLPLAGSP ARADETLAEL
HKRAKAPDFA ELVAALLRVR RIVPAGTAAG YDASRLVEPC ELSLHQVLGE VRTALGEQPT
RLRDFVDTAA ALVGPVNTFF DEVLVMDEDA DRRATRLGLL AHIRDLSAGV LDWSALGAMP
AAGK