Gene Sros_6037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6037 
Symbol 
ID8669331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6615463 
End bp6616650 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content71% 
IMG OID 
ProductPhosphoglycerate kinase 
Protein accessionYP_003341513 
Protein GI271967317 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.873832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATC TGTCTGAGCT CGATGTGAAG GGTCGGCGCG TCCTCGTCCG CGCCGACCTC 
AACGTCCCCC TCGACGGGGA CGTCATCACC GACGACGGAC GCATCCGGGC CTCGGTGCCG
ACGATCAGGG AGCTCGCCGA CAAGGGCGCG CGCGTCGTCG TCTGCGCCCA CCTGGGGCGT
CCGAAGGGCG AGCCGAACCC GAAATACTCG CTGGCCCCCG TGGCGAGGCG GCTGGGCGAG
CTGCTGGGCG CCGAGGTCGC CTTCGCGACC GACGTGGTCG GGGAGTCGGC CCAGGGCGTC
GTGGACGCCC TCCAGGACGG CCAGGTGGCC CTGCTGGAGA ACCTGCGGTT CGAGCCGGGC
GAGGAGTCCA AGGACGACGC CCGGCGGGCG GCCTTCGCCG AGAAGCTGGC CGCCCTGGCG
GAGGTCTACG TCGGTGACGG CTTCGGCGCC GTGCACCGCA AGCACGCCAG TGTCTACGAC
GTGCCGCTGC TGCTGTCGCA CGCGGCGGGC AGGCTGGTCA CGGCCGAGGT CGAGGTGCTC
AAGAAGCTGA CCGACGACCT CGCCAGGCCG TACGCCGTGG TGCTGGGCGG AGCCAAGGTC
TCCGACAAGC TCGGCGTGAT CGGCAACCTG CTCACCAAGG TCGACCGGCT GCTCATCGGC
GGCGGCATGG CCTACACCTT CCTGGCCGCC CAGGGCTACG AGGTGGGCCA GTCGCTGCTG
CAGAAGGACC AGCTCGACCA GGTGCGCGGC TTCCTCAACG AGGCGGCCAA GCGCGGCGTG
GAGCTCGTCC TGCCGGTCGA CGTGCTGGCG GCCACCGAGT TCGCCGAGGA CGCCGAGTAC
GAGGTGGTCG ACGCCACCGC GATCCCGGCC GATCGGCAGG GGCTCGACAT CGGCCCGCGC
AGCCGCGAGC TGTTCGCGAG CAAGCTGGCC GACGCCAGGA CCGTGTTCTG GAACGGCCCG
ATGGGCGTCT TCGAGTTCGA GGCGTTCTCC GGCGGAACCC GGGCCGTCGC CGAGGCGTTG
GTCCAGTCGG AGGCCTTCAC CGTCGTCGGC GGCGGTGACT CGGCCGCGGC CGTGCGCAAG
CTCGGCCTCC CCGAGGACGG GTTCTCGCAC ATCTCCACCG GTGGCGGCGC CAGCCTCGAA
TACCTGGAGG GCAAGACCCT GCCCGGACTC GTCGCGCTGG AGGCATAG
 
Protein sequence
MKDLSELDVK GRRVLVRADL NVPLDGDVIT DDGRIRASVP TIRELADKGA RVVVCAHLGR 
PKGEPNPKYS LAPVARRLGE LLGAEVAFAT DVVGESAQGV VDALQDGQVA LLENLRFEPG
EESKDDARRA AFAEKLAALA EVYVGDGFGA VHRKHASVYD VPLLLSHAAG RLVTAEVEVL
KKLTDDLARP YAVVLGGAKV SDKLGVIGNL LTKVDRLLIG GGMAYTFLAA QGYEVGQSLL
QKDQLDQVRG FLNEAAKRGV ELVLPVDVLA ATEFAEDAEY EVVDATAIPA DRQGLDIGPR
SRELFASKLA DARTVFWNGP MGVFEFEAFS GGTRAVAEAL VQSEAFTVVG GGDSAAAVRK
LGLPEDGFSH ISTGGGASLE YLEGKTLPGL VALEA