Gene Sros_1413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1413 
Symbol 
ID8664688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1473423 
End bp1475162 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content72% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003337150 
Protein GI271962954 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.119595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0381206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCCT CGCCGAAGAA GCGCCGCCGT ACCCCGCGCC AGGTGGTGCG GGGCGTCCGC 
CGGCGGGCCC GTCGCGCCGC GGGAGGCGTG CGCCGGGTCT GGCGGCGCTC CCTGCAGCTC
CGCGTCGTCA CCAGCACGAT GGTCATCTCC ATCGCGGTGG TGGCCGTGCT CGGCGTCTTC
CTCATGCAGC AGATCATCGC GACCACGCTC CAGGACCGGG AGCGCGCGGC CAGGAGCGAG
GCGCAGGCCG ACCGGAACGC GGTGCTCGGC TACCTGAACC GGCCGTCCGG CGACGTCTCC
AAGCCGTCCG TCGATTCGGG CGACGGGCTC CAGCGGGGTG GCGACAGCCC GCTCAGCCAG
GCGGTCTACG CCCTCGCCGG GCGCGCGGGG GCCGTCCCCC GCTACTCGGT GATCGCCCGC
AACGACAACC AGCCGGGGGA GTTCTTCGTC ACCCCGAACA TCGATCCGCG CAGCATCCCG
TCCGGGCTGC GCGAGGAGGT GTCCAAAAAG GAGGCGGGCG AGGACAGCAC CCACTACGGG
GACCTCTACT ACGAGGGCAG GCGCGAGCCG GTGCGGGGCA TGGTGATCGG CACCCGCCTC
GACACCTCCG GGACCATGCT GGACAGCGGA TACGAGATCT ACCACCTCAT CCCGCTGGAC
AAGGAGGAGG AGACCCTCAA CTCGGTCCTG CGGATGCTCG TCGCCGTCGG GGCCGCCCTG
GTGCTGCTGC TGGCGGCCAT CGCCTCGCTG GTCACCCGCC AGGTGGTCAC GCCGGTACGG
CTGACGCGCC AGGCCGCCGA GCGGCTGGCC GCCGGACGCC TGGACGAGCG GCTGAAGGTG
CGCGGCGAGG ACGACCTGGC CCGCCTGGCC ACCTCCTTCA ACGACATGGC CGCCAACCTG
GCGTTGAAGA TCCACCAGTT GGAGGAGCTC TCCCACGTCC AGCGGCAGTT CGTCTCCGAC
GTCTCGCACG AGCTGCGCAC CCCGCTGACC ACCGTGCGGA TGGCCGCCGA CCTGCTCTAC
GACGCCCGCG AGGACTTCGA CCCGATGGCC GCCCGCTCGG CCGAGCTGAT GCAGAACCAG
CTCAACCGGT TCGAGTCGAT GCTCGCCGAC CTGCTGGAGA TCAGCCGCTA CGACGCGGGC
GCGGCCGAGC TGGACGTCGA TCCGGTGGAC GTCAGGGACG TGGTGCTGCG CGCCGTCGCC
GACTCCGAGG CACTGGCCGA GCGCCACTCG ACCCGGTTCG ACCTGCGCCT GCCGGGTGAG
CCCTGCATGG CGGAGATGGA CAGCCGCCGG GTCGAGCGGA TCCTGCGCAA CCTGCTGTTC
AACGCGATCG AGCACGGCGA GGGCCGCGAC ATCGTCGTCT CGGTGGGGGC CGACCGCGAC
GCGGTGGCGG TGGCCGTACG GGACCACGGG GTGGGTCTCA AGCCGGGTGA GGAGAACATG
GTCTTCGACC GGTTCTGGCG GGCCGACCCG TCACGCGCGC GGACGATCGG CGGCACCGGC
CTGGGCCTGG CGATCTCCCG CGAGGACGCC GTGCTGCACG GCGGCTGGCT CCAGGCCTGG
GGGGCGCAGG GCGAGGGGTC GCAGTTCCGG CTCTCCCTGC CCCGGGTGGC CGGGGCGCCG
CTGCGGGGGT CACCGCTGTC GCTGGTCCCG CCGGAGGTGG AGATGCGGCG GACATGGCGG
GGGCACATGA CCCCGGTGCT CTCACCGGCG GTCGCCGACG GGGGAAACGA TGCGGACTAG
 
Protein sequence
MPSSPKKRRR TPRQVVRGVR RRARRAAGGV RRVWRRSLQL RVVTSTMVIS IAVVAVLGVF 
LMQQIIATTL QDRERAARSE AQADRNAVLG YLNRPSGDVS KPSVDSGDGL QRGGDSPLSQ
AVYALAGRAG AVPRYSVIAR NDNQPGEFFV TPNIDPRSIP SGLREEVSKK EAGEDSTHYG
DLYYEGRREP VRGMVIGTRL DTSGTMLDSG YEIYHLIPLD KEEETLNSVL RMLVAVGAAL
VLLLAAIASL VTRQVVTPVR LTRQAAERLA AGRLDERLKV RGEDDLARLA TSFNDMAANL
ALKIHQLEEL SHVQRQFVSD VSHELRTPLT TVRMAADLLY DAREDFDPMA ARSAELMQNQ
LNRFESMLAD LLEISRYDAG AAELDVDPVD VRDVVLRAVA DSEALAERHS TRFDLRLPGE
PCMAEMDSRR VERILRNLLF NAIEHGEGRD IVVSVGADRD AVAVAVRDHG VGLKPGEENM
VFDRFWRADP SRARTIGGTG LGLAISREDA VLHGGWLQAW GAQGEGSQFR LSLPRVAGAP
LRGSPLSLVP PEVEMRRTWR GHMTPVLSPA VADGGNDAD