Gene Sros_7942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7942 
Symbol 
ID8671267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8755621 
End bp8758095 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content74% 
IMG OID 
Productprotein kinase 
Protein accessionYP_003343341 
Protein GI271969145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG ACACCGCGCA CGTATCCCTT ACCGGCGGCA ACCTTCCTCT CGAACCCAAC 
GCCTTCGTCG GTCGTGAGGG CGATGTGGAG GAGCTTGTCG AGCTGCTGGG CGTCGCCCGG
TTGGTGACGC TGTGCGGTGC CGGCGGCATC GGCAAGAGCA GGCTGGCGGT CCGGGTGGCG
TCCCAGGTGG CGGCGGGGTT CCCCGGCGGC GTCTGGCTGG TGGAGCTGGC GGAGGCGGTC
CGCGGCGACC TGGTCGAGCC CCGCGTGGCC GCCGTGCTCG GGGTGAAGGC CGAGCCGCCA
CGCCCGCTGT CCGACACGCT GATCGACGCG CTCGGTGACC GGAACCTGCT GATGATCATC
GACAACTGCG AGTCGCTCAT CGAGGAGTCG GCCCGGTTCT GCCGGGCGGT GCTCACCGTC
TGCCCGGCCG TACGGATGCT CACGACCAGC CGGGAGCCGC TGCGGGTGGC GGGGGAGACC
GTGTGGCGGG TGCCGCCGCT CTCGCTGCCC CGGACCGGGG TCCACGAGCT CGCCGGCAGC
GAGGCGGTGC GGCTCTTCGT CGACCGCGCC GGCGCGGCCT CGCGCGGTTT CGCCGTCACC
GAGCAGAACG CCGGCGAGAT CGCCAGCCTG TGCGAGGCGC TCGACGGGAT GCCCCTGGCC
ATCGAGCTGG CCGCGGCGCT GTGCAGGGTG CTGACGGTGG AGCAGATCCA CGCCCGGATC
AGGGACCGGT TCCGTCTCCT GTCCGCCGGC GACAGGATGG CCCCGGCCAG GCACCAGACG
CTCCGCGCGA CCGTCGACTG GAGCTACCAG CAGCTCAACG AGCCCGAGCG GATCCTGCTG
CGGAGGCTGG CGGTGTTCAC CGGGGGATGG ACGCTGGACA TGGCCGAGCA GGTCTGCGCG
GGCGACGGCC TGTGGGCGGA CGACGTCCTG CGCGTGCTCT GCGACCTGGT CGACAAGTCC
CTGGTGCTGG CGGACGCGGA GGTCGCGGGC GAGACGCGTT ACCGGATGCT GGAGACGATC
CGCCAGTACG CCGCCGAGCA GCTCGACACC TCCGGCGAGG AGGCCGCGCT GAGGCGGCGG
CACCGGGACC ACATGATCGA GGTCGCCGGC CGCATCCACC AGACGATCGT CCGGCGCCCC
CGGCCGACCT GGGACGAGAT CTGCCCGATG CTGGTCCTGC TGGACGAGCT GCAGGCCAAC
GTGTGGGCGG CGGTGCGCTG GTCGGCGGAG GCGGACGACC CCGAGAGCGC GCTGCGCCTG
CTGGTCCTGC TCCGCTGGCC GACCATCGCC GGCGGCAGGT TCACCCCCGC CGACGAGTGG
CTGGACCGCC TGCTCGCGGC GGATCCGGCG GAGCTGCCGC CGAGCGTGCG CGGCGACGCC
CTGGTGCTGC GCGGTCAGGT GGCCTTCGGG CAGGGCGACC CCGACACCGC GCGGGTCGCC
TGTACGGCCG CGGCCGAGCT GTGCCGGGCG ACGGGCCTGC ACGGACCGCT GAGCGGGGCG
CTCGCCATCC TCGCCTGGGT GGCCATCTAC CAGCGCCGGC CGGAGAGGGC CAGGGCCTTC
CTGGACGAGG CGCTGGAGGC CGTGCGCAGG GTCGACGACC CCTGGCACGA GATGGTGATC
CGCTCCATGG AGGGGACGTT CGCGCTGTCG GAGGGGCGGG CCAAGGAGTC GCAGCGCTCC
TTCGAGGCCG CGCTGGCCAT CGCGCACGAG CTCGACAACC ACTGGGCGGC CCCGGTCGCG
CTCATCGGCC TGGCCCAGGT CGCCTGGCTC CGGGGCGACC TGGCGGAGGC CCGCACCTAC
TACGAGCAGG CACTCGACCT GCTCCAGGAC ATCACCGCCC ACTGGCAGAC CATCACCTGC
CTGGCGGGCC TGGGCCGGAT CGCCCTGGCC CAGGGCGACC TCGCCACCGC GCGCGTCAGG
CTGATCGAGA GCCTGCGTCT GTGCCAGGGG ACGGGCCAGC GGCTCGGTGT GGCCCGGCGG
ATCGAGACGC TCGCGCAGCT CGCGCTGGCC GAGGAGGACG ACCGCCGGGC GGTGCGCCTG
GCGGGGGCCT CGGCCGCGAT CCGGAGCGCG ATGAGCGGCG CCGTCCTGCC CTCCGGTTTC
GGCGCGCGGA TGGAGGAGCT GCTCCAGCCC GCCCGTGCCC GGCTCGGAGA CGCGCTGGTC
GCCCAGCTCT GGGCGCTCGG CAAGGAGATG TCCCAGGACC AGGCGGTCCG CTACGCGCTG
CAGGGTGACG ATCAGTCCGA CGCGCCCCTG ATGGTCGGCA GGCATCCGGT CGCCGCCCCG
CTCACCACGC TCACCGGCAG GGAGTGGGAG ATCGCCGGGC TGATCGCCAG GGGCCTGAGC
AACAAGGCCA TCGCCGACGA GCTGGTGATC AGTCCCGCGA CCGCGGCCCG GCACGTGGCC
AACATCCTGG CCAAGCTCGG CTTCTCCTCA CGGACCCAGG TCGCCACCTG GGTCATCGAG
CAGAACCGCG GCTGA
 
Protein sequence
MTQDTAHVSL TGGNLPLEPN AFVGREGDVE ELVELLGVAR LVTLCGAGGI GKSRLAVRVA 
SQVAAGFPGG VWLVELAEAV RGDLVEPRVA AVLGVKAEPP RPLSDTLIDA LGDRNLLMII
DNCESLIEES ARFCRAVLTV CPAVRMLTTS REPLRVAGET VWRVPPLSLP RTGVHELAGS
EAVRLFVDRA GAASRGFAVT EQNAGEIASL CEALDGMPLA IELAAALCRV LTVEQIHARI
RDRFRLLSAG DRMAPARHQT LRATVDWSYQ QLNEPERILL RRLAVFTGGW TLDMAEQVCA
GDGLWADDVL RVLCDLVDKS LVLADAEVAG ETRYRMLETI RQYAAEQLDT SGEEAALRRR
HRDHMIEVAG RIHQTIVRRP RPTWDEICPM LVLLDELQAN VWAAVRWSAE ADDPESALRL
LVLLRWPTIA GGRFTPADEW LDRLLAADPA ELPPSVRGDA LVLRGQVAFG QGDPDTARVA
CTAAAELCRA TGLHGPLSGA LAILAWVAIY QRRPERARAF LDEALEAVRR VDDPWHEMVI
RSMEGTFALS EGRAKESQRS FEAALAIAHE LDNHWAAPVA LIGLAQVAWL RGDLAEARTY
YEQALDLLQD ITAHWQTITC LAGLGRIALA QGDLATARVR LIESLRLCQG TGQRLGVARR
IETLAQLALA EEDDRRAVRL AGASAAIRSA MSGAVLPSGF GARMEELLQP ARARLGDALV
AQLWALGKEM SQDQAVRYAL QGDDQSDAPL MVGRHPVAAP LTTLTGREWE IAGLIARGLS
NKAIADELVI SPATAARHVA NILAKLGFSS RTQVATWVIE QNRG