Gene Sros_5904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5904 
Symbol 
ID8669198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6473315 
End bp6474514 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content75% 
IMG OID 
ProductHistidine kinase 
Protein accessionYP_003341382 
Protein GI271967186 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0106883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0686301 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAACA CCCCAGGCGG CCTGCGGGCC TGGCTCCACG GACACCCGCT GATCTCCGAC 
GCCGCCCTGG CGGTGGCGCT TGCGGCCGCC GCCGTGCCGT GCGCCTACGT CACCTCCCTG
TCCGGCTTCG ACGCCCCGGC CGGCGCCGCC GGGCCCGACG CGCTGGGCCT CACCCTGATC
GTGGCCGGCT GCCTGTCCCC GGCGCTGCGC CGGCGTCTGC CGTTCACGAT GCTCTTCCTG
GTCGGCGTCG TGGAGATGAC GCTGGCGTCA CTGGACCGGA GCGGCTCCCT GCTCTGGGTC
GCCGCGCTGG TGCTGGTCTA CACGATCGCC GCCCGTCGCG GCCTGGCGCT CAGCCTGTGC
GCGCTGGTGC TGAGCCTCAC CTACCGCGCC GTCTTCGCGG TGACGGCCGC CGACCCCGGC
GACCGGACGG CGCACCTGTT CGTGGCCCTG CTGACCATGA CGCTGTGGAT CGCCGGGCGC
GGTGTCCGGC TGCGCCGGGC CTACCTCGCC GAGCTGCGCG ACCGGGCCGG GCGGATGGAG
CGGGCCCGGG AGGCCGACAC GCGGGCGGCC AGGGCCGAGG AGCGCTCCCG CATCGCCCGC
GAGCTGCACG ACGTGGTCGC CCACCACGTG AGCGTGATGA CCGTCCAGGC CTCCGCGGCC
CGCCGGGTGC TGGCCACCAA CCCCGACGGC GCCCGTGAGG CGCTGTCGGC GATCGAGGAG
ATGGGCCGGA CCGCGATGGC CGAGATGCGC AACATCGTGG GCGTGCTCAG GACCGACGCG
GCGCCCGCCG AGCGCAACCC CCAGCCGGGG GTGCAGGAGA TCCCCACCCT GGTCGACCAG
ATGCGCGAGG CGGGCCTGCG GACGCAGCTG TGGATCGAGG GCCGGGAGGG CTCGCTGCCG
CCCGGCGTCG ACCTGGCCGT CTACCGGCTG GTCCAGGAGG CGCTGACCAA CAGCCTGCGG
CACGCGGGAC CGCAGGCCCG CGCCTGGGTG ACCGTACGGC AGGAGCCGGG CGAGCTGGCT
GTCCGGGTCG AGGACGACGG TCAGGGCTCC GGCGCCGCCG GACCGGCCGA CGACCGGACC
GGGCACGGGC TGGTCGGCAT CCGCGAGCGT GTGGCCCTCT ATGGTGGGAT CCTGAGGATC
GGCCCGCGTC CGGAGGGCGG GTTCGAGGTC AATGCCCGGT TTCCCCTCAA GGACGTGTGA
 
Protein sequence
MRNTPGGLRA WLHGHPLISD AALAVALAAA AVPCAYVTSL SGFDAPAGAA GPDALGLTLI 
VAGCLSPALR RRLPFTMLFL VGVVEMTLAS LDRSGSLLWV AALVLVYTIA ARRGLALSLC
ALVLSLTYRA VFAVTAADPG DRTAHLFVAL LTMTLWIAGR GVRLRRAYLA ELRDRAGRME
RAREADTRAA RAEERSRIAR ELHDVVAHHV SVMTVQASAA RRVLATNPDG AREALSAIEE
MGRTAMAEMR NIVGVLRTDA APAERNPQPG VQEIPTLVDQ MREAGLRTQL WIEGREGSLP
PGVDLAVYRL VQEALTNSLR HAGPQARAWV TVRQEPGELA VRVEDDGQGS GAAGPADDRT
GHGLVGIRER VALYGGILRI GPRPEGGFEV NARFPLKDV