Gene Sros_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0101 
Symbol 
ID8663365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp102379 
End bp103797 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content76% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003335900 
Protein GI271961704 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.128582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCATC TATCGGTCAG GGTCCGGGCG ACCCTCGCCG CCACCGCCAT CGTCGCGGTC 
GCGCTGGGCG TCGCGGCGGT GGTGCTCGTC GGCGTGCTCA AGGGCAGCCT CGTGGACAGC
GCCTCCGCCG AGGCGACCCG GCGCGCCTAC GGCACCGCCG GCATGATCAC CGCCAGCCCC
GGGCTCATCG CCGGCAGGGT CGCCGAGCCG CTGGACCCCG ACGTCCAGGT CATCGAGAAG
GCCGAGCTCT CCGAGGCGGA GTGGCGAACG GTCTTGGCCC AGCCCGCCCA GCCCGCCCTG
CCGATCACCT CCGGACAGGA CACCCCCGGA ACGGGAGCGA TCCGATCGGA CACCACCCGG
TCGACCGCCA CCCGAGGAGA GACAGCCCGA TCGGACGCCG CCAGGCCGGA CAGCACTGGA
CCGGCCGCCG TACGCGCCGA GCGGTGGGCG CCCGCCGCCT CGTTCACGGT GGCCACGATG
CCCGTGTCCA CCGTCGACGG CGTGGTGCTC GTCCAGGCCA GGGCCTCCCT GGAACCCGCC
GGCGCCGCCC TGCAGACCCT GCAGGGGCTG CTGATCCCCG GCATCCCCGG GCTGCTCCTG
CTGGTCGCGG CCCTGACCTG GCTGGCGGTC GGCCGCGCGC TCGCACCGGT CTCGGCCATC
CGCACCGAGA TGGCCGACAT CACCGCCAGT GATCTGCACC GCCGGGTCCC GGTGCCGCGG
TCCCGCGACG AGATCGCCCG CCTCGCCGAG ACGATGAACC GCACGCTCGA CCGCCTGGAA
CTCGCCGTCG ACCGGCACAA GCGCTTCGTC GCCGACGCCG CCCACGAGCT GCGCAGCCCG
CTGGCCATCC TGAGGACCCG CCTGGAGCTC GCCCCGCCCG GACCGCTGGC GGCCGAGGCG
CTGACGGACG TGGAGCGGAT CCAGGCGCTC ACCTCCGACC TGCTGCTGCT GGCCCGCCTG
GACGCCGGTG AGCCCGCCTG TCACGGGGAG GTGGACCTCG GACAGGTCGC CGCCGAGGAG
GCGACCCGGG CCCGGCCCAG GCCGGAGATC CGCGTGGAGC TGGAGGTGGC CGCCGACGTG
GTGGTCCGCG GATCGGCCGA GGAGCTGCGC CGCCTGGTCG CCAACCTGGT GGACAACGCC
GTACGGCACG CGGACTCGAC GGTCACCGTC CGCCTGGCCC GGGACGGGGG CGGGGCCGTA
CTCGACGTGC GCGACGACGG GCCGGGGATC CCGGCCGAGC ACCGTGAGGC GGTCTTCGAC
CGGTTCACCC GGCTGGACGA GGCCCGGGGC CGGGACGCGG GCGGGTCGGG GCTCGGCCTC
GCCATCGCCC GGGACATCGC GGTACGGCAC GGCGGCGGCC TGAGTGTCGT CGGGGGAGGT
CCGGGAGCGC GGCTGCGGAC CCGTCTTCCC GCGCCGTGA
 
Protein sequence
MSHLSVRVRA TLAATAIVAV ALGVAAVVLV GVLKGSLVDS ASAEATRRAY GTAGMITASP 
GLIAGRVAEP LDPDVQVIEK AELSEAEWRT VLAQPAQPAL PITSGQDTPG TGAIRSDTTR
STATRGETAR SDAARPDSTG PAAVRAERWA PAASFTVATM PVSTVDGVVL VQARASLEPA
GAALQTLQGL LIPGIPGLLL LVAALTWLAV GRALAPVSAI RTEMADITAS DLHRRVPVPR
SRDEIARLAE TMNRTLDRLE LAVDRHKRFV ADAAHELRSP LAILRTRLEL APPGPLAAEA
LTDVERIQAL TSDLLLLARL DAGEPACHGE VDLGQVAAEE ATRARPRPEI RVELEVAADV
VVRGSAEELR RLVANLVDNA VRHADSTVTV RLARDGGGAV LDVRDDGPGI PAEHREAVFD
RFTRLDEARG RDAGGSGLGL AIARDIAVRH GGGLSVVGGG PGARLRTRLP AP