Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_1821 |
Symbol | |
ID | 8665099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 1943801 |
End bp | 1946950 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Signal transduction histidine kinase-like protein |
Protein accession | YP_003337553 |
Protein GI | 271963357 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.284128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGACAG CGTCAATCCC CAGCGAACGC GAATCCGGGG CGGCATCCCC CCCGGCGTTC GACGCCCAGG GGGTGCCGCC GGAAGGCGGC CAGGCTCCGG ACCTCGACGG TCGTGGACCC GAGACGAACG GCAGCAAGCT CGCGCTCAAG AACTGGCGCG TGCGGACGCG GCTGATCGCG CTCATCGTCA TCCCCACCGC CGCCGCGATC ATCCTGGGCG GCCTGCGGGT GACCACGTCG ATCAGCACCG CCGCCGAATA CGAGCGGGTG CGGACCAGCG CCGAGCTGGT CGCCGAGCTG AGCGACCTGG CGCACAACCT GGAGGCCGAG CGCGACCTGT CGGCCCGGTT CGTCGCCCAG GGCCGCGGCA GCACCGGCAA GGCCAGGCTG CAGGAGCAGT ACCGGGCCGT CGACCAGGTG GCCAAGAAGG CCAGGGACCG CATCGACCTC ATCATCGGCA GCAATGCCGA CGAGGGCTTC GGCGAGCGGG GCAAGACCGA GCTCGCCCAG ATGCGCAGCC GCATCGACGA GCTTGACAGC GTCCGCAAGA CCGCGGTCGG CACCCAGCTC CCCGCCCAGC CCACGATCGC CATGTACTCC CGGACGATCG CCGACCTGCT GGCCCTGCAC GACGAGATCA TCCAGGGTGT CGCCGACCAG GAGCTCGCCG GCAGCGCGAC CGCGTTCGGC GCGCTGTCCA GGGCCAAGGA ACAGGCCTCC AGGGAACGGG CGAACCTCGC TATCGCCCTC GCCGACAGGA CCTTCACCTC CGAAGGCCTG AACGCCATGC TGGCCGCCCG CGCGCAGCGC GACAGCGAGC TCGCGGCCTT CCGCTCCGAC GCCTCGGTCA CCCAGCGCCA GCTCTACGAC GACACCGTGA GCAGCCAGAA GAAGGACCGC GCGGAGTCCA TGCGCGCCCG CGCCCTCGTG CTGGCCGTCG AGGGCGCACC GCTCGTCCGC ATCGACGTCT CCAGGACCGG CGCCGGCGAC CAGACGACCT GGTTCGACGC CTCCTCCGAC ACCATCGAGC GGATGCGTGC GGTGGAGAAG CGGATCGCCG ACACCCTGAT CACGCAGAGC CGCGTCCTCC AGGAGTCCGA GCAGCAGGGC GCGCTGATCG CCGGTGGGCT GAGCGTGCTG CTGCTCATCC TCGTCCTCGT CATCACCGCG ATCATGGCGC GGTCGCTGGT CAGGCCGCTG CGCACGCTGC GCACCGAGGC CCTGTCCATC GCCGGCCAGC GCCTCCCGGA CACCGTGCAG AGCATGCGCG AGAGCGGCGA GGCCGCGGCC GAGGACATCG CCCCGATCGG GGTGGCCTCC GACGACGAGA TCGGCCAGGT CGCCCGCGCC TTCGACGAGG TGCACCGCGA GGCCGTACGG CTGGCCGGCC AGGAGGCGAC GCTGCGGAGC AACGTCAACG CGATGTTCGT CAACCTCTCC CGGCGCAGCC AGACCCTGGT CGAACGCCAG CTGTCCCTCA TCGAGAGCCT GGAGCAGGGC GAGCAGGACG AGAGCCGTCT CGGCAGCCTG TTCCGCCTCG ACCACCTGGC CACCCGCATG CGCCGCAACA GCGAGAACCT CCTGGTCCTC GCCGGCCAGG AGCCCGCGCG CCGGTGGAGC CAGCCGATCC CCCTGATCGA CGTGGTCCGC GCCTCGCTCT CCGAGGTCGA GAACTACGAG CGGGTGGACC TGCGGCTCTC CGCCGGTGTG GCCGTGGTCG GCACCTCCGT CAACGACGTC GTGCACCTGA TCGCCGAGCT GGTGGAGAAC GCCATCTCCT TCTCCCCCCG GGAGACCAAG GTCGTCGTGT CCAGCAACCG CATCGACGGC GGCGGCGTGA TGGTCTCGGT CACCGACATC GGCATCGGCA TGACTCCCGA GGAGCTCGGG CAGGCGAACT GGCGGCTGGC CAACCCGCCG GTGGTGGACG TCTCGGTCTC CCGCCGCATG GGCCTGTTCG TGGTCGGCCG GCTGGCCCTG CGGCACGGCA TCCGCGTGCA GCTCCGCCAG CAGGACAGCG GCGGCCTGAC CGCCATGGTG CTGCTCCCCG AGGCCCTGCT CGCCGCCGCC GGCGCCCACC CGGGCGGCAC GGCCGTGCCG CAGGGCGGCG ACTGGGCCGG GTCGATGAGC CCCATGGACC GGGCGCCCGT GCTGGCCAGC CCCACCGCGC TCGACCCCGC GCAGCAGGCG TTCGCCTCGT TCGACGCCGC CCACCCCTTC ACCTCCTTCG ACATGGGGCA GCAGTTCGGC TCCTTCGACG CCGGGCAGTC CTCACCGGGC GGCGGCTACT TCGGCCAGGC GCCGGTCGAC ACCCCGTGGC CCGGCCACGT GCCGCCACCG GGAGCCGACT CCGGCTGGCC GAACACCTCC CAGACGGACA CCGGCGTGTG GCCGAACGCG CCGATGCGCG GCGGCGACTC CGGGGCCTGG CCGAACCCGC CCGCCCGTGA AGGCGACTCC GGGATGTGGC CGAGCGCGCC GATGAGCGGC GGCGACTCCG GGATATGGCC CACCCCGCCC TCCCGCGAGG GCGACGGCGG AGCCTGGCCG AACCCGCCCG CCCGCGAGGG CGGGGCCGGG GGATGGCCGT CCACCGCCGA CTCGGGGCCG TTCGAACGGC GCACCTTCGA GCCGGCCGAC AGCACCGGTC CGCTGCCCGT GGTCCGCGAC TCCTCGCCCA TGGAAGAGGC GAAGGAGGAG TTCCTGCCGA TCTTCGCCGC GGTCGAGTCC GACTGGTTCA GGAAGGTCGA ACCCGCGGCG CCCGTCCAGG ACCTGACCGA GGAGCTCAAG GACGCGGTCT CCCCCCAGCC CGCGCCCGCC TCCGACGCCT GGTCCTCGCC CGCGGACGCG GGCTGGCAGG CCGCCCAGGC GGCGAGCGAA CCCTCGCTCG GCGGGATCAC CGGTTCCGGG CTCCCCAAGC GGGTGCCCAA GGCGAACCTG GTGCCCGGTA CGGCCGCACC CGACCCGGGT GCGGCCCCCC AGACCCCCGT ACTCCGGCCG ACCGTCTCCC CCGAGGCGGT GCGCAACAGG CTGGCGAGCT TCCAGCAGGG AGTACGGCAG GGCCGCGCGG CGGCCAGGGG CGAGGCCGGC GACGGGCAGC CGTATCCCGA CTTCGGTCGG GACGTTGAAG GAAACAAGGA GGACCGGTGA
|
Protein sequence | MRTASIPSER ESGAASPPAF DAQGVPPEGG QAPDLDGRGP ETNGSKLALK NWRVRTRLIA LIVIPTAAAI ILGGLRVTTS ISTAAEYERV RTSAELVAEL SDLAHNLEAE RDLSARFVAQ GRGSTGKARL QEQYRAVDQV AKKARDRIDL IIGSNADEGF GERGKTELAQ MRSRIDELDS VRKTAVGTQL PAQPTIAMYS RTIADLLALH DEIIQGVADQ ELAGSATAFG ALSRAKEQAS RERANLAIAL ADRTFTSEGL NAMLAARAQR DSELAAFRSD ASVTQRQLYD DTVSSQKKDR AESMRARALV LAVEGAPLVR IDVSRTGAGD QTTWFDASSD TIERMRAVEK RIADTLITQS RVLQESEQQG ALIAGGLSVL LLILVLVITA IMARSLVRPL RTLRTEALSI AGQRLPDTVQ SMRESGEAAA EDIAPIGVAS DDEIGQVARA FDEVHREAVR LAGQEATLRS NVNAMFVNLS RRSQTLVERQ LSLIESLEQG EQDESRLGSL FRLDHLATRM RRNSENLLVL AGQEPARRWS QPIPLIDVVR ASLSEVENYE RVDLRLSAGV AVVGTSVNDV VHLIAELVEN AISFSPRETK VVVSSNRIDG GGVMVSVTDI GIGMTPEELG QANWRLANPP VVDVSVSRRM GLFVVGRLAL RHGIRVQLRQ QDSGGLTAMV LLPEALLAAA GAHPGGTAVP QGGDWAGSMS PMDRAPVLAS PTALDPAQQA FASFDAAHPF TSFDMGQQFG SFDAGQSSPG GGYFGQAPVD TPWPGHVPPP GADSGWPNTS QTDTGVWPNA PMRGGDSGAW PNPPAREGDS GMWPSAPMSG GDSGIWPTPP SREGDGGAWP NPPAREGGAG GWPSTADSGP FERRTFEPAD STGPLPVVRD SSPMEEAKEE FLPIFAAVES DWFRKVEPAA PVQDLTEELK DAVSPQPAPA SDAWSSPADA GWQAAQAASE PSLGGITGSG LPKRVPKANL VPGTAAPDPG AAPQTPVLRP TVSPEAVRNR LASFQQGVRQ GRAAARGEAG DGQPYPDFGR DVEGNKEDR
|
| |