Gene Sros_4838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4838 
Symbol 
ID8668132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5360404 
End bp5362104 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content70% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003340400 
Protein GI271966204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TCGAGCCCGG CGCATTGCTG CCGAGCATGC GGCTGGATGA GCTCCTGTCG 
GAGCTGCAGA TGCGGCTGGA GGCGGTGCTG GCCACCCGGG ACCGGGTGCA CGCACTGTTG
AACGCGGTCG TGGTGGTGGG CAGCGATCTG AATCTGGAGA CGGTGCTGCG CCGGATCGTG
GAGACCGCCA CCATGTTGGT CGATGCCACC TACGGGGCGC TGGGAGCGGT GGGGGAGCAC
AACACGCTGG TGCAGTTCAT CCCGGTGGGG CTGAGCGAGC AGGAGATCGC CCGGATCGAG
CACTGGCCGC ACGGCCTGGG CCTGCTGGGT TTGCTGATCA AGGAGCCGCG GCCGCTGCGG
CTGGCCCACA TCAGCGATCA CCCTGCGTCA TACGGGTTCC CGCCGGGGCA TCCGCCGATG
GGGGCATTTC TGGGGGTGCC GATCCGGGTG CGGGAGGAGG CCTTCGGCAA CCTCTACCTG
ACCGAGAAGC GCGGCGGAGG GGAGTTCGAC GCCGAGGATG AGGCGATCGT CACCGCGCTG
GCCGCGGCGG CGGGCGTGGC CATCGAAAAC GCCCGATTGT ATGCCGACAG CCGCCGCCGG
GAGCGCTGGC TGCAGGCCTC AGCGGAGGTC ACCACCAGCT TGCTGTCGGG GGCCGAGCCG
GGACAGGTGC TCACGTTGAT CGCCCGGCGT GCGCGGGAGA TGGCCGGCGC CGACGTGGTG
GCGGTGCTGT TGCCCGATGA CAGCGGGCGC ATCCTGCAGG CGGTGATCGC CGATGGGCTG
GCCTGTGAGG AGGTGGCCTG TGCGCAGGCG CCGGTCGCCG ACTCTTTGGC GGGCCGAGCG
TTCACCAGCG GTGAGCCGTT GATGGTGGCT GATCCGGCCG AGGCCGAGGT GCCGATCGCG
ATCGCCGACT ACGTCTCGCT GGGACCGGTG GCCGTGGTGC CGATCGGCGC GCCGGGCAGC
GTGCGCGGCG TGCTGTCGCT GGGCAAGCGC TCGGGCCGGC TGCCGTTCAG CCAGGCGGAG
TTGCATACCC TGCACGCCTT CGCCGGGCAG GCCGCGATCG CGTTGGAGCT GGCCGAGAGC
CGGATGGACG CCGAGCGGCT GGGGCTGCTG GAAGATCGCG ACCGGATCGC CAAGGATCTG
CATGACGTGG TGATCCAAAG ATTGTTCGCC GTGGCGATGA CGCTGATGAG CACGGTGCGG
CTGGTCGACA GACCGGAGGC CTCGGCTCGG CTGCAAACCT CGATCGATGA GCTGGATGCG
ACCATCCGGC AGATCCGCTC GACCATTTTC GCCCTGCAGA TCTCTTCGGA AGACGGTGCG
GAGGGGCTGC GCGCGCAGAT CACAGGACTG GTAGAGGGCG CCCGAGGCCA CCTGGGCTTC
ATGCCGGCTC TGACCATGGA AGGCCGCCTC GACGCCATGG TGCCGGACCA GGTCGCCGAG
CAGTTGCTGG CCGTGCTGAG GGAGGCGCTG TCCAACGTCG TGCGTCACGC CCGGGCCTCC
AAGGTCGAGG TGGCGGTCGA GGCGGGTGAG GACCGGCTCG TCCTCACCGT CATCGACGAC
GGGCTGGGGG TGCCGGAGGG CGGCCGGCGC AGCGGGCTGC GCAATCTCCA GGACCGGGCC
GAACGCCTCG ACGGCTCTTT CACGGTCGAA TCTCGCCCAG GGGGCGGCAC CTGCCTGATG
TGGAGTGTGC CGCTGACCTA A
 
Protein sequence
MAQVEPGALL PSMRLDELLS ELQMRLEAVL ATRDRVHALL NAVVVVGSDL NLETVLRRIV 
ETATMLVDAT YGALGAVGEH NTLVQFIPVG LSEQEIARIE HWPHGLGLLG LLIKEPRPLR
LAHISDHPAS YGFPPGHPPM GAFLGVPIRV REEAFGNLYL TEKRGGGEFD AEDEAIVTAL
AAAAGVAIEN ARLYADSRRR ERWLQASAEV TTSLLSGAEP GQVLTLIARR AREMAGADVV
AVLLPDDSGR ILQAVIADGL ACEEVACAQA PVADSLAGRA FTSGEPLMVA DPAEAEVPIA
IADYVSLGPV AVVPIGAPGS VRGVLSLGKR SGRLPFSQAE LHTLHAFAGQ AAIALELAES
RMDAERLGLL EDRDRIAKDL HDVVIQRLFA VAMTLMSTVR LVDRPEASAR LQTSIDELDA
TIRQIRSTIF ALQISSEDGA EGLRAQITGL VEGARGHLGF MPALTMEGRL DAMVPDQVAE
QLLAVLREAL SNVVRHARAS KVEVAVEAGE DRLVLTVIDD GLGVPEGGRR SGLRNLQDRA
ERLDGSFTVE SRPGGGTCLM WSVPLT