Gene Sros_8238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8238 
Symbol 
ID8671566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp9091770 
End bp9093248 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content76% 
IMG OID 
Productcarbohydrate kinase, FGGY 
Protein accessionYP_003343630 
Protein GI271969434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.758383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCCA AGAAATTCAC AGGTGAACCC GCGTGGGCCG GGATGGACAT CGGCACCCAG 
GGAGTGCGGG TGACCGTGGT CGACGGCCGT GGCACGCCGC TGGCGACGGG GTCCGCCCCG
CTGCGCAGCC GGCGCGGCGA GGGACGCGGA CACGAGCAGG ACCCCGAGGA CTGGTGGCGG
GCCGCGGTCA CCGCGTGCCG GCAGGCCACC GCGGGCCTCG CCCCCGGGCG GATCCGGGCG
CTGGCGGTGT GCAGCACCTC CGGCACCATC GTGCTCGCCG ACGCCGCGGC CAGGCCCGTC
TCGCCGGCAC TCATGTACGA CGACGACCGG GGCCGCCCCT ACGTGGAACG GGTGACCGCC
GCCGGGCAGG AGCTCTGGGA CGACCTGGCC GTGCGGATCC AGGCCTCGTG GGCGCTGCCC
AAGCTGCTCT GGCTCGCCGA GAACGGCGGG CTGCCCGCCG GCGCGCGACT GCTGCACTCG
GCCGACCTGG TGACCGCGCG CCTGGCCGGC GAGCTGACCG CGACCGACTC CAGCCACGCG
CTGAAGACCG GCTACGACTC GCTCAGGGAG CGCTGGCCCC TGGAGGTGAT GTCCAGGCTC
GGCCTGCCCT CCGACCTGTT CGGCCCCGTC ACGCGGCCGG GCAGCCCGAT CGGCGAGGTG
GGGGCCGAGG CCGCCGCCCT CACCGGGCTG CCCGAGGGAT GCGGGATCGT CGCGGGCATG
ACCGACGGCT GCGCGGGCCA GATCGCCGCC GGCGCGCTGA CACCCGGCAG CGGCGTATCC
GTGCTGGGCA CCACGCTGGT GCTCAAGGGG GTGAGCGAGA AGCTCCTGCG CGACCCCGCC
GGCGCCGTCT ACAGCCACCG CCACCCCGGC GGCGGCTGGC TGCCCGGCGG CGCGTCAAAC
GTCGGCGCCG GCGTGCTGGC CTCGGCGTTC CCCGGCCGCG ACCTGGCCGC CCTCGACGAG
GCCGCCGCCC GGTACGAGCC CGCGGGGGCC GTCGTCTACC CGCTCACCAC CCGGGGTGAG
CGCTTCCCCT TCTACCGGCC GGACGCCGAG CAGGTCGTGG TCGGCGAGAC CGCGACGGAG
GCCGAGCACT ACGCGGCGCT GCTGCAAGGC GTCGCGTACG GGGAACGGCT GGGCCTGGCC
TCCCTCGCGC TGCTCGGCGC GCCCGTCTCC GAGCACCTCG CGCTGGTCGG CGGGGCCACG
CGCAGCCGCT ACTGGTGCCA GCTCCGCGCC GACGTCCTGG GCATGCCGGT CCGGATCCCC
CGCCACCCCG AGGCCTCGGT CGGCATGGCG GTGCTCGCCG CCTCGTCCGG GCGCTCGCTC
GACGACACCG CCCGGGAGAT GGTCTCCCGG GGCGAACAGC TCGATCCCCG CCCGGACCGC
GTGGCACGGT TCGAGCAGAG CTTCCGCCTG CTCGTCGACG CGCTCGCCGA GCGGGGCTAC
CTCACCACCG AACTCGCAGC GAAGGCGGTC ACATCATGA
 
Protein sequence
MVSKKFTGEP AWAGMDIGTQ GVRVTVVDGR GTPLATGSAP LRSRRGEGRG HEQDPEDWWR 
AAVTACRQAT AGLAPGRIRA LAVCSTSGTI VLADAAARPV SPALMYDDDR GRPYVERVTA
AGQELWDDLA VRIQASWALP KLLWLAENGG LPAGARLLHS ADLVTARLAG ELTATDSSHA
LKTGYDSLRE RWPLEVMSRL GLPSDLFGPV TRPGSPIGEV GAEAAALTGL PEGCGIVAGM
TDGCAGQIAA GALTPGSGVS VLGTTLVLKG VSEKLLRDPA GAVYSHRHPG GGWLPGGASN
VGAGVLASAF PGRDLAALDE AAARYEPAGA VVYPLTTRGE RFPFYRPDAE QVVVGETATE
AEHYAALLQG VAYGERLGLA SLALLGAPVS EHLALVGGAT RSRYWCQLRA DVLGMPVRIP
RHPEASVGMA VLAASSGRSL DDTAREMVSR GEQLDPRPDR VARFEQSFRL LVDALAERGY
LTTELAAKAV TS