Gene Sros_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1334 
Symbol 
ID8664609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1380464 
End bp1381741 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content71% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003337072 
Protein GI271962876 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.134624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.173507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGTC GACTGCTCTC CTCCACTCTG CTCGTCGCGG TCATCGGGGT GCTTCTGCTC 
GGCATCCCCC TGGGTGTGGC GGTCAACCGG CTGATCGAGG AAGAGGCGAC CCAGGAGCTC
TCCTCCCAGG CCAAGAGCCT GCTGGGGGAA GTGGAGTACG CCCGGATCCA GGATCAGCCC
ATCGATCCCG AACAGCTCAA GCGCAAATAT CCCGACCGCT ACATCCAGAT CTATGCGAAG
GGGACGCCGC CGCAGGTGAC GACGGTGGGC GACGAGCCCC CCGAGAACCA CAAGATGACC
CAGGACGCCC AGTCGGAGAA CGGGGTGTAC GTCGCGGTCA GCCGAGACAA GACGGAGGTG
GAGCGCGAGG TCCGGGCGTG GCTCCTGCTC ATCCTGGCGC TCGCCGCGGC GGCGCTCGCG
GTCGCGGTCG GGCTCGCCAT CGTGCAGTCG CGCCGTCTGA CGCTCCCGCT CAGCGACCTG
GCGATGATCG CTGAGCGGCT CGGCTCGGGG GACGCCAGGC CGAGCAAGCA CCGTTACGGC
ATCCAGGAGC TCGACCGGGT GGCCGAGGTG CTGGACCGCA GCGCGACCCG GATCTCCGAC
CTGCTGGCCA GGGAGCGGGA GTTCGCGACC GACGCCTCGC ACCAGCTCCG CACGCCGCTC
ACCGGGCTGA CCATGCGGCT GGAGGAGATC GTGGCGGCCG CGCACGAGCC CGGCATCGTC
AAGGAGGAGG GCGAGGCCGC CATCGTGCAG GCCGAGCGGC TCACCGCCGT GATCGACGAG
CTGCTGGCCG CCGCCAGACG GCAGCGGCAC GCCCAGACCG AGGTGGTCGA GCTCGACGAC
CTGCTGGACC AGCAGTTCAT CGAATGGGGT CCGGTGTTCC GCCGGGGCGG GCGGCAGCTC
AAGCTGTCCG GCACGCGCGG TCTCCAGGCG GTGGGCACCA GCGGCGGCAT CAGCCAGGTG
ATCTCCACTC TCCTGGAGAA CTCGCTGGAG CACGGCGACG GCACGGTGAC GGTGACCACC
AGCGACAAGG ACAGGTCCGT CCTCGTCGAG GTCGCGGACG AGGGAGAGGG CATCCCCGAA
GACCTCGCCC CCCGGGTCTT CGAGCGCAAC GTGAGCGGCG CGGGGGGCAC CGGGCTGGGG
CTGACCCTCG CGCGGGCGCT GGCCGCCGCC GACGGCGGAC GCCTGGAGTT GGTACGGCCG
CGCCCCGCGG CCTTCGCGCT GTTCCTCCGG CAGGTCGGCG ACCCCGGCAG AAAACGGGTG
GTCAGCGGGC CGGCATGA
 
Protein sequence
MRRRLLSSTL LVAVIGVLLL GIPLGVAVNR LIEEEATQEL SSQAKSLLGE VEYARIQDQP 
IDPEQLKRKY PDRYIQIYAK GTPPQVTTVG DEPPENHKMT QDAQSENGVY VAVSRDKTEV
EREVRAWLLL ILALAAAALA VAVGLAIVQS RRLTLPLSDL AMIAERLGSG DARPSKHRYG
IQELDRVAEV LDRSATRISD LLAREREFAT DASHQLRTPL TGLTMRLEEI VAAAHEPGIV
KEEGEAAIVQ AERLTAVIDE LLAAARRQRH AQTEVVELDD LLDQQFIEWG PVFRRGGRQL
KLSGTRGLQA VGTSGGISQV ISTLLENSLE HGDGTVTVTT SDKDRSVLVE VADEGEGIPE
DLAPRVFERN VSGAGGTGLG LTLARALAAA DGGRLELVRP RPAAFALFLR QVGDPGRKRV
VSGPA