Gene Sros_1821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1821 
Symbol 
ID8665099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1943801 
End bp1946950 
Gene Length3150 bp 
Protein Length1049 aa 
Translation table11 
GC content73% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003337553 
Protein GI271963357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.284128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGACAG CGTCAATCCC CAGCGAACGC GAATCCGGGG CGGCATCCCC CCCGGCGTTC 
GACGCCCAGG GGGTGCCGCC GGAAGGCGGC CAGGCTCCGG ACCTCGACGG TCGTGGACCC
GAGACGAACG GCAGCAAGCT CGCGCTCAAG AACTGGCGCG TGCGGACGCG GCTGATCGCG
CTCATCGTCA TCCCCACCGC CGCCGCGATC ATCCTGGGCG GCCTGCGGGT GACCACGTCG
ATCAGCACCG CCGCCGAATA CGAGCGGGTG CGGACCAGCG CCGAGCTGGT CGCCGAGCTG
AGCGACCTGG CGCACAACCT GGAGGCCGAG CGCGACCTGT CGGCCCGGTT CGTCGCCCAG
GGCCGCGGCA GCACCGGCAA GGCCAGGCTG CAGGAGCAGT ACCGGGCCGT CGACCAGGTG
GCCAAGAAGG CCAGGGACCG CATCGACCTC ATCATCGGCA GCAATGCCGA CGAGGGCTTC
GGCGAGCGGG GCAAGACCGA GCTCGCCCAG ATGCGCAGCC GCATCGACGA GCTTGACAGC
GTCCGCAAGA CCGCGGTCGG CACCCAGCTC CCCGCCCAGC CCACGATCGC CATGTACTCC
CGGACGATCG CCGACCTGCT GGCCCTGCAC GACGAGATCA TCCAGGGTGT CGCCGACCAG
GAGCTCGCCG GCAGCGCGAC CGCGTTCGGC GCGCTGTCCA GGGCCAAGGA ACAGGCCTCC
AGGGAACGGG CGAACCTCGC TATCGCCCTC GCCGACAGGA CCTTCACCTC CGAAGGCCTG
AACGCCATGC TGGCCGCCCG CGCGCAGCGC GACAGCGAGC TCGCGGCCTT CCGCTCCGAC
GCCTCGGTCA CCCAGCGCCA GCTCTACGAC GACACCGTGA GCAGCCAGAA GAAGGACCGC
GCGGAGTCCA TGCGCGCCCG CGCCCTCGTG CTGGCCGTCG AGGGCGCACC GCTCGTCCGC
ATCGACGTCT CCAGGACCGG CGCCGGCGAC CAGACGACCT GGTTCGACGC CTCCTCCGAC
ACCATCGAGC GGATGCGTGC GGTGGAGAAG CGGATCGCCG ACACCCTGAT CACGCAGAGC
CGCGTCCTCC AGGAGTCCGA GCAGCAGGGC GCGCTGATCG CCGGTGGGCT GAGCGTGCTG
CTGCTCATCC TCGTCCTCGT CATCACCGCG ATCATGGCGC GGTCGCTGGT CAGGCCGCTG
CGCACGCTGC GCACCGAGGC CCTGTCCATC GCCGGCCAGC GCCTCCCGGA CACCGTGCAG
AGCATGCGCG AGAGCGGCGA GGCCGCGGCC GAGGACATCG CCCCGATCGG GGTGGCCTCC
GACGACGAGA TCGGCCAGGT CGCCCGCGCC TTCGACGAGG TGCACCGCGA GGCCGTACGG
CTGGCCGGCC AGGAGGCGAC GCTGCGGAGC AACGTCAACG CGATGTTCGT CAACCTCTCC
CGGCGCAGCC AGACCCTGGT CGAACGCCAG CTGTCCCTCA TCGAGAGCCT GGAGCAGGGC
GAGCAGGACG AGAGCCGTCT CGGCAGCCTG TTCCGCCTCG ACCACCTGGC CACCCGCATG
CGCCGCAACA GCGAGAACCT CCTGGTCCTC GCCGGCCAGG AGCCCGCGCG CCGGTGGAGC
CAGCCGATCC CCCTGATCGA CGTGGTCCGC GCCTCGCTCT CCGAGGTCGA GAACTACGAG
CGGGTGGACC TGCGGCTCTC CGCCGGTGTG GCCGTGGTCG GCACCTCCGT CAACGACGTC
GTGCACCTGA TCGCCGAGCT GGTGGAGAAC GCCATCTCCT TCTCCCCCCG GGAGACCAAG
GTCGTCGTGT CCAGCAACCG CATCGACGGC GGCGGCGTGA TGGTCTCGGT CACCGACATC
GGCATCGGCA TGACTCCCGA GGAGCTCGGG CAGGCGAACT GGCGGCTGGC CAACCCGCCG
GTGGTGGACG TCTCGGTCTC CCGCCGCATG GGCCTGTTCG TGGTCGGCCG GCTGGCCCTG
CGGCACGGCA TCCGCGTGCA GCTCCGCCAG CAGGACAGCG GCGGCCTGAC CGCCATGGTG
CTGCTCCCCG AGGCCCTGCT CGCCGCCGCC GGCGCCCACC CGGGCGGCAC GGCCGTGCCG
CAGGGCGGCG ACTGGGCCGG GTCGATGAGC CCCATGGACC GGGCGCCCGT GCTGGCCAGC
CCCACCGCGC TCGACCCCGC GCAGCAGGCG TTCGCCTCGT TCGACGCCGC CCACCCCTTC
ACCTCCTTCG ACATGGGGCA GCAGTTCGGC TCCTTCGACG CCGGGCAGTC CTCACCGGGC
GGCGGCTACT TCGGCCAGGC GCCGGTCGAC ACCCCGTGGC CCGGCCACGT GCCGCCACCG
GGAGCCGACT CCGGCTGGCC GAACACCTCC CAGACGGACA CCGGCGTGTG GCCGAACGCG
CCGATGCGCG GCGGCGACTC CGGGGCCTGG CCGAACCCGC CCGCCCGTGA AGGCGACTCC
GGGATGTGGC CGAGCGCGCC GATGAGCGGC GGCGACTCCG GGATATGGCC CACCCCGCCC
TCCCGCGAGG GCGACGGCGG AGCCTGGCCG AACCCGCCCG CCCGCGAGGG CGGGGCCGGG
GGATGGCCGT CCACCGCCGA CTCGGGGCCG TTCGAACGGC GCACCTTCGA GCCGGCCGAC
AGCACCGGTC CGCTGCCCGT GGTCCGCGAC TCCTCGCCCA TGGAAGAGGC GAAGGAGGAG
TTCCTGCCGA TCTTCGCCGC GGTCGAGTCC GACTGGTTCA GGAAGGTCGA ACCCGCGGCG
CCCGTCCAGG ACCTGACCGA GGAGCTCAAG GACGCGGTCT CCCCCCAGCC CGCGCCCGCC
TCCGACGCCT GGTCCTCGCC CGCGGACGCG GGCTGGCAGG CCGCCCAGGC GGCGAGCGAA
CCCTCGCTCG GCGGGATCAC CGGTTCCGGG CTCCCCAAGC GGGTGCCCAA GGCGAACCTG
GTGCCCGGTA CGGCCGCACC CGACCCGGGT GCGGCCCCCC AGACCCCCGT ACTCCGGCCG
ACCGTCTCCC CCGAGGCGGT GCGCAACAGG CTGGCGAGCT TCCAGCAGGG AGTACGGCAG
GGCCGCGCGG CGGCCAGGGG CGAGGCCGGC GACGGGCAGC CGTATCCCGA CTTCGGTCGG
GACGTTGAAG GAAACAAGGA GGACCGGTGA
 
Protein sequence
MRTASIPSER ESGAASPPAF DAQGVPPEGG QAPDLDGRGP ETNGSKLALK NWRVRTRLIA 
LIVIPTAAAI ILGGLRVTTS ISTAAEYERV RTSAELVAEL SDLAHNLEAE RDLSARFVAQ
GRGSTGKARL QEQYRAVDQV AKKARDRIDL IIGSNADEGF GERGKTELAQ MRSRIDELDS
VRKTAVGTQL PAQPTIAMYS RTIADLLALH DEIIQGVADQ ELAGSATAFG ALSRAKEQAS
RERANLAIAL ADRTFTSEGL NAMLAARAQR DSELAAFRSD ASVTQRQLYD DTVSSQKKDR
AESMRARALV LAVEGAPLVR IDVSRTGAGD QTTWFDASSD TIERMRAVEK RIADTLITQS
RVLQESEQQG ALIAGGLSVL LLILVLVITA IMARSLVRPL RTLRTEALSI AGQRLPDTVQ
SMRESGEAAA EDIAPIGVAS DDEIGQVARA FDEVHREAVR LAGQEATLRS NVNAMFVNLS
RRSQTLVERQ LSLIESLEQG EQDESRLGSL FRLDHLATRM RRNSENLLVL AGQEPARRWS
QPIPLIDVVR ASLSEVENYE RVDLRLSAGV AVVGTSVNDV VHLIAELVEN AISFSPRETK
VVVSSNRIDG GGVMVSVTDI GIGMTPEELG QANWRLANPP VVDVSVSRRM GLFVVGRLAL
RHGIRVQLRQ QDSGGLTAMV LLPEALLAAA GAHPGGTAVP QGGDWAGSMS PMDRAPVLAS
PTALDPAQQA FASFDAAHPF TSFDMGQQFG SFDAGQSSPG GGYFGQAPVD TPWPGHVPPP
GADSGWPNTS QTDTGVWPNA PMRGGDSGAW PNPPAREGDS GMWPSAPMSG GDSGIWPTPP
SREGDGGAWP NPPAREGGAG GWPSTADSGP FERRTFEPAD STGPLPVVRD SSPMEEAKEE
FLPIFAAVES DWFRKVEPAA PVQDLTEELK DAVSPQPAPA SDAWSSPADA GWQAAQAASE
PSLGGITGSG LPKRVPKANL VPGTAAPDPG AAPQTPVLRP TVSPEAVRNR LASFQQGVRQ
GRAAARGEAG DGQPYPDFGR DVEGNKEDR