Gene Sros_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1652 
Symbol 
ID8664929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1767946 
End bp1769265 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content74% 
IMG OID 
ProductHistidine kinase 
Protein accessionYP_003337386 
Protein GI271963190 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.527879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.680022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG AGACGGCAGT GGGCGGACCG GTGGTGGGCA TGCCGCCGCG GCGGCGCGTT 
CCGCTGGGGC CCAACGGGCC GCTGGGCATG CTCGTCGACC CGATGACCTG GCGTGCCGTG
CCGTACATGC TGGTGAGCGT CTTCTACGGC GCGGTCTGCT CCGCCTTCAT GGCGTCCGCC
ATCCCGATCG CGCTCAGCCT GGTGATCGTG TGGGCCGGAC TGCCGCTGCT CGCGCTGACC
ATGTGCGCCT GGCGCGCCGC GGCGATGCTG GAGCGCAGGC TGGTCCGGCT GGCCTTCGGC
GTCACGATCC GCGACCCCTA CCGGCCCTCC CGGGGCGACA ACCTGTTCCT GCGCTGGAAG
GACATGTTCG TCGACCCGGC GACCTGGAAG GACCTCCTCT ACCTCCTGCT GTTGCTGCCG
ATCGGAGTCG TGGAGTTCGT CGTCTCGGTG GCCCTGTGGT GCCTGGGGTT CGGCATGATC
ATCGTGCCCA CCGTCCTGCT GTTCGGCGGA GCCCCGGTGA CGATCACCGA CGGGCTGCTG
GTCGACAGCG TGCCGGAGGC GCTGCTGTGC GTGCCCGTCG GAGTGGGGGT CCTCCTCGTC
GCGCTCTACG CGACGCGGGG GATGGCCTGG CTGCACGCGC TGCTCGCCGT CGCGCTGCTG
GGGGCGGGGG AGAAGAACCT GCTGGCCGCC CGGGCGGCCC ACCTGCGGGC CAGCCGGGCC
CGCGCGGTCG ACGCGGCCGA GGCGGAGCGG CGCAGGATCG AGCGCGACCT GCACGACGGC
GCCCAGCAGC GGCTGCTCTC CGTCGCCATG GACCTGGGCC GGGCCCAGGC CAAGATGGAC
TCCGACCCCC AGGGCGCCCG GGAGCTCCTC GCCCAGGCCC ACGCCGGCGC CAAGGCGGCG
ATCGCCGAGC TGCGCGACCT CGCCAGGGGC ATCCACCCGG CGATCCTCAC CGACCGCGGA
CTGGACGCGG CGCTCTCCTC GCTCGCGGCC CGAGCCCCCG TGCGGGTGGA CCTGTCGGTG
GAGGTCTCCC ACCGCCCCCC GCCCGCGGTG GAGAGCATCG CGTACTTCGT CGTGGCCGAG
TCCCTGACCA ACATGGTCAA GCACGCCGAG GCGACCGAGG TCTCCATCCG GGTCAGCCGC
GAAGGCCAGC GGGTGGTCGT CGAGGTGCAC GACAACGGGG TCGGGGCCGC GGTGCCGCGC
GCCGGAGGGG GCCTCGCGGG GCTGGCGGAC CGGGCCGCGA CCATCGACGG CACCCTGACC
GTGGACAGCC CGCTCGGCGG TCCTACGCTG ATCCGCGCCG AACTGCCCTG CCAATGGTGA
 
Protein sequence
MTAETAVGGP VVGMPPRRRV PLGPNGPLGM LVDPMTWRAV PYMLVSVFYG AVCSAFMASA 
IPIALSLVIV WAGLPLLALT MCAWRAAAML ERRLVRLAFG VTIRDPYRPS RGDNLFLRWK
DMFVDPATWK DLLYLLLLLP IGVVEFVVSV ALWCLGFGMI IVPTVLLFGG APVTITDGLL
VDSVPEALLC VPVGVGVLLV ALYATRGMAW LHALLAVALL GAGEKNLLAA RAAHLRASRA
RAVDAAEAER RRIERDLHDG AQQRLLSVAM DLGRAQAKMD SDPQGARELL AQAHAGAKAA
IAELRDLARG IHPAILTDRG LDAALSSLAA RAPVRVDLSV EVSHRPPPAV ESIAYFVVAE
SLTNMVKHAE ATEVSIRVSR EGQRVVVEVH DNGVGAAVPR AGGGLAGLAD RAATIDGTLT
VDSPLGGPTL IRAELPCQW