Gene Sros_4837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4837 
Symbol 
ID8668131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5358650 
End bp5360251 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content68% 
IMG OID 
Productgluconate kinase 
Protein accessionYP_003340399 
Protein GI271966203 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGAGT CTGTCCCACT CATCCGAGCC GGCATCCGAA ACGGGCGCCA CCCGCTCATC 
CCGGCCTGGA TCGCCTGGAT GGCAGACCCG GACGTGGAGG CATCATCGAT GAGAGCATGG
GCGCAGATCA AGGAGACGCA TATCGGCGTC GTGGTTCTCC TCGGAGATCA CGCTTTCAAA
CTGAAGAAAC CGGTGAATTT CGGCTTCGTG GACTTCACCA CCCGGCAGGC GCGGGAGAGG
ATCTGCCATG AGGAGGTGGA GCTGAACCGG CGGCTGGCCC CGGATGTCTA CGAGGGAGTC
GCCGACGTGC TCGGCACGGA TGGGCAGGTC TGTGAGCATC TCGTTGTGAT GCGGCGTATG
CCCGAAGAAC GCCGCCTCGC CGCCATGATC GATTCAGGGA AACCGGTCGA GGAACACCTG
CGGCAGATCG CCCGGATGGT CGCGAGCATG CACGGGCGCT CCCGCCACAG CCCCCAGATC
GACCAGCAGG GAAGCGGGCA GGCGCTACGT TCACGGTGGA GCGCGAGCTT CGATCAGGTC
CGAGCACTTC CCGAGCCGGT CCTCGGCCCC GAGGTCGTCG GGGAGATCGA ACGGCTCACG
CTGCGCTTCC TCGACGGTCG CGGTCCCCTG TTCACCGCAC GTATCGATGA GGGCCGCATC
GTGGACGGCC ATGGTGACCT GCTCGCAGAG GACATCTTCT GCCTGGACGA CGGGCCACGG
ATTTTGGACT GCCTGGAGTT CGACGAACGG CTGCGCTTTG TCGACGGCCT TGACGACGTG
GCTTTCCTGG CCATGGACTT GGAACGGCTG GGGGCGCCGC GGCTGGCGGA GGTGTTCTTG
CACCAGTACA CCGAGTTCAC CGGCGACCCC GCGCCGCCCT CGCTGTGGCA TCACTATGTC
GCCTACCGGG CTTTCGTCCG CGCCAAGGTC GCCTGCTTGC GGCGCGGGCA GGGAGACTCC
GGTGCCGCCT GGGAGGCTCG CCGGTTCGCC GATCTCACAC TGCGCCACCT GCAGGCAGGC
ACGGTACCTC TGATTCTCGT CGGCGGTGCC CCGGGAGCCG GCAAATCGAC CCTGGCCGCT
GCCCTCGCCG ACCGTCTCGG TTACACGGTG CTGAACAGCG ACCGCGTCCG CAAGGAAATG
GCCGGCATCT CGCCTGACCA GTCCGCCTCC GCTCCCTTCG GCGAAGGCAT CTACGACCCT
GAACACACCG AACGCACCTA CGACGAGTTG CTGTCGCGAG CCGGGAAGCT GCTTGAGCGC
GGAGAGCCGG TCATCCTCGA CGCCTCATGG GGCGGCGCCG GGCACCGGGC AGCGGCCGAT
CGCGTCGCCC AGCGCACCTC CAGCGACCTG GTGGCCTTGC GTTGCACCGC TCTGCCGCAG
GTCGCCGCCG AGCGCCTCGC CCGCCGTACC GGTGCCGTCT CAGACGCCGA TCAGGCGATC
GGCGCGGCCG TGGCGGCGAG GATGGCCCCT TGGCCCGACG CGGTCGAGAT CGACACCAGC
GCATCTCCGG AACAGGCGCT GGAGCGGGCG TTGGCGGTGA TCAACGCTTC TCCGGCCCCT
GTTTCCTGGC GGTTCCGGCG CCCTCAGATG GAACCCGACT GA
 
Protein sequence
MGESVPLIRA GIRNGRHPLI PAWIAWMADP DVEASSMRAW AQIKETHIGV VVLLGDHAFK 
LKKPVNFGFV DFTTRQARER ICHEEVELNR RLAPDVYEGV ADVLGTDGQV CEHLVVMRRM
PEERRLAAMI DSGKPVEEHL RQIARMVASM HGRSRHSPQI DQQGSGQALR SRWSASFDQV
RALPEPVLGP EVVGEIERLT LRFLDGRGPL FTARIDEGRI VDGHGDLLAE DIFCLDDGPR
ILDCLEFDER LRFVDGLDDV AFLAMDLERL GAPRLAEVFL HQYTEFTGDP APPSLWHHYV
AYRAFVRAKV ACLRRGQGDS GAAWEARRFA DLTLRHLQAG TVPLILVGGA PGAGKSTLAA
ALADRLGYTV LNSDRVRKEM AGISPDQSAS APFGEGIYDP EHTERTYDEL LSRAGKLLER
GEPVILDASW GGAGHRAAAD RVAQRTSSDL VALRCTALPQ VAAERLARRT GAVSDADQAI
GAAVAARMAP WPDAVEIDTS ASPEQALERA LAVINASPAP VSWRFRRPQM EPD