Gene Sros_8557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8557 
Symbol 
ID8671891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9440933 
End bp9442063 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content77% 
IMG OID 
Productglucose kinase 
Protein accessionYP_003343942 
Protein GI271969746 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAT TAGTGCTTGC TGTCGACATC GGCGGGACGA AGTTCGCCGT CGCCCTGGTG 
GATTCCGACG GGAACGTGCG GACTGCCCGC CGCGCGGCGA CGCCGCCGGG TGGCGACGCG
CGGACCCTGT GGAAGGCGCT CGGCGAGCTG GTCGACTCCC TGCTGGACGG CGCCGCGGCC
GACGGCCTGA TCAACGGCGA CGCCGCTGCC GGCGGCGCGG TCGCCGGTGT CGGAATCGGC
TGCGGCGGCC CGATGACCTG GCCGGAGGGG GCCGTCTCCC CGCTGAACAT GCCGGGCTGG
CGAGGCTTCC CGCTGCGCGC GAGGCTCGCC GAGCGGTTCC CCGGCGTGCC GGTCCGCATC
CACAACGACG CCGTCTGCCT GGCCGTCGCC GAGCACTGGC GGGGGGCCGG GCGGGGCAGC
GCCAACATGC TCGGCATGGT CGTGTCCACG GGGGTGGGCG GCGGGCTGAT CCTGGGCGAC
CGGCTGATCG ACGGCGGCAG CGGCAACGCC GGGCACATCG GGCACATCGT GGTCGACCCC
GGCGGGCCCC CCTGCGGATG CGGCGGCCGG GGCTGCCTGG AGGCGGTCGC CCGCGGTCCG
GGCCTGGCCG CCTGGGCGGT CGAGCAGGGC TGGAACCCGG GCGCCGCCGG CCCGCCCGCC
GCCGCGACCG CACCGCCCGG CGAAGGGCCA CGGACCTCCG GCGGTACGGC CGCGACCTCC
GGTGGGGGGA ACGGCGCCCT CAACGGCGAG CCCGGGGATC CGGGCGCCGG GTCCGCCTAT
GCCGGGTCCG GCTATGTGGA GGCGGCGGTG GCCAGCGGGC GGCAGCTCGC CCTGGACGCG
GAGGCGGGCG ACGAGATCGC CCTCGCCGCC ATGAGCCGTG CCGGCCGGGC CCTGGGCCTG
GCCATCGCCT CGGCCACGAA CCTCTGTGAC CTGGACGTCG TCACCATCGG CGGCGGCCTT
TCCCAGGCCG GTCCGCTGCT GTTCGATCCG CTGGAGGCCA CCCTCCGGGA CCACACCCGG
ATGGAGTTCG CCCGGCGGGT CCGGGTCGTC CCGGCCTCCC TCGGCCAGGA CGCCGGCCTG
GTCGGCGCCG CCGCCCTGAT CCTCGCCACC GACCGCTACT GGACCCACTG A
 
Protein sequence
MSGLVLAVDI GGTKFAVALV DSDGNVRTAR RAATPPGGDA RTLWKALGEL VDSLLDGAAA 
DGLINGDAAA GGAVAGVGIG CGGPMTWPEG AVSPLNMPGW RGFPLRARLA ERFPGVPVRI
HNDAVCLAVA EHWRGAGRGS ANMLGMVVST GVGGGLILGD RLIDGGSGNA GHIGHIVVDP
GGPPCGCGGR GCLEAVARGP GLAAWAVEQG WNPGAAGPPA AATAPPGEGP RTSGGTAATS
GGGNGALNGE PGDPGAGSAY AGSGYVEAAV ASGRQLALDA EAGDEIALAA MSRAGRALGL
AIASATNLCD LDVVTIGGGL SQAGPLLFDP LEATLRDHTR MEFARRVRVV PASLGQDAGL
VGAAALILAT DRYWTH