Gene Sros_8944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8944 
Symbol 
ID8672282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9880866 
End bp9882620 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content70% 
IMG OID 
ProductArginine--tRNA ligase 
Protein accessionYP_003344319 
Protein GI271970123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC CGCAGCTCGT CCTCACCGAG CGCGTCCAGC AGGCGCTGGC CGCCGCCTTC 
GGTACGGATT ACGCCGACGC AGACCCGCTG ATCCGTCCCT CGCAGTTCGC CGACTACCAG
GCGAACGTGG CGATGAGCCT GAGCAAGCGA CTGCGACGAG CCCCGCGGGA AGTCGCCCAG
GAGATCGCCG ATCACCTCTC CGGCCGCGCC GAAGGCACCC AGGGTGACGC CTTCCCCGGC
ACCGTCGAGG TCAGCGGTCC CGGCTTCCTG AACATCACGC TCTCCGACGA CTGGATCGCC
GAGCAGACCG CCGAGGTCAT GGCCGACCCG CGCAGCGGCG TCGCCCAGGC CACCCCGCCG
CAGACCGTCG TGATCGACTA CTCCGCGCCC AACGCGGCCA AGGAGATGCA CGTCGGCCAC
CTGCGCACGA CCATCGTCGG CGACGCCCTG GCCCGCGTGC ACGAGCACCT GGGCAACAAG
GTCATCCGGC AGAACCACCT GGGCGACTGG GGCACCCCGT TCGGCATGCT CATCGAGCAC
CTGCTGGACA TCGGCGAGGC GACCGCCGTC GCCCAGCTCG AGGCCGGTGA GGGCAACGCC
TACTACCAGG CGGCCCGCGC CAAGTTCGAC ACCGACCCGG AGTTCAACCA GCGCGCCCGC
ACGCGGGTGG TGACCCTCCA GGCCGAAGAG CCGGAGACCA TGCGCCTGTG GCACATCTTC
ATGGACGCCA CCGTCCGCTA CTTCAACAAG GTCTACAGCC AGCTCGGCGT GACGCTCACC
GACGACGACA TCGCCGGTGA GAGCATGTAC AACCACATGC TCGCCCGGGT CTGCGACGAG
CTCCAGGAGC GCGGCATCGC CGTCGTCAGC GACGGCGCGC TCTGCGTCTT CCCGCCCGGC
TTCACCGGCC GCGAGGACCA GCCGCTGCCG TTCATGATCC GCAAGAGCGA CGGCGGGTAC
GGCTACGCCA CCACCGACAT GGCCACCATC CACTACCGCG TCCAGGACCT CAAGGTCGAC
CGGATCCTCT ACGTGATCGG GGCGACCCAG GCCCTGCACA TGTCGATGTT GTTCGCCTCC
GCCAGGATGG CCGGCTGGCT GCCCGACCAC GTCAGGGCCG AGCACGTCCA GATCGGCAGC
GTGCTTGGCA GCGACGGCAA GATGTTCAAG ACCCGCAGCG GCGAGTCCAT CAAGCTGCTG
GACCTGCTGG ACGAGGCGGA GACCCGCGCC GCCGCCGTGC TCGCCGGCCG CGACTACGAC
GATGCCGCGC GCGCCGAGAT CGGGCACGCC GTCGGCATGG GGGCGGTCAA GTACGCCGAC
CTGTCGGTCA GCCACGACAG CGAATACGTC TTCGACTTCG ACCGCATGCT CGCCCTGACC
GGCAACACCG GCCCCTACCT CCAGTACGCC ACGGCCCGGA TCCGCTCCAT CTTCCGCAAG
GCGGACGTGG CTCCGGCCAC GGCGACCGGC CCGATCCTGC TCGGCCACCC GGCCGAGCGC
GCCCTGGCCC TGCAGGTGCT CGGGTTCGGC TCCATCGTGG ACGAGGTGGC CGAAGGCTCC
ATGCCGCACA AGCTCGCCGC CTACCTGTTC GAGACCGCGA GCGTCTTCAC CACCTTCTAC
GAGAACTGTC CCGTCCTGAA GGACGACGTC GACCCGGCCA CGCGCGCCTC GCGGCTCGCC
CTGTGCGCGC TCACCCTGCG CGTCCTGGAG ACCGGCCTGG ACCTGCTCGG CGTCCCGGTT
CCCGAGCGCA TGTGA
 
Protein sequence
MTDPQLVLTE RVQQALAAAF GTDYADADPL IRPSQFADYQ ANVAMSLSKR LRRAPREVAQ 
EIADHLSGRA EGTQGDAFPG TVEVSGPGFL NITLSDDWIA EQTAEVMADP RSGVAQATPP
QTVVIDYSAP NAAKEMHVGH LRTTIVGDAL ARVHEHLGNK VIRQNHLGDW GTPFGMLIEH
LLDIGEATAV AQLEAGEGNA YYQAARAKFD TDPEFNQRAR TRVVTLQAEE PETMRLWHIF
MDATVRYFNK VYSQLGVTLT DDDIAGESMY NHMLARVCDE LQERGIAVVS DGALCVFPPG
FTGREDQPLP FMIRKSDGGY GYATTDMATI HYRVQDLKVD RILYVIGATQ ALHMSMLFAS
ARMAGWLPDH VRAEHVQIGS VLGSDGKMFK TRSGESIKLL DLLDEAETRA AAVLAGRDYD
DAARAEIGHA VGMGAVKYAD LSVSHDSEYV FDFDRMLALT GNTGPYLQYA TARIRSIFRK
ADVAPATATG PILLGHPAER ALALQVLGFG SIVDEVAEGS MPHKLAAYLF ETASVFTTFY
ENCPVLKDDV DPATRASRLA LCALTLRVLE TGLDLLGVPV PERM