Gene Sros_1447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1447 
Symbol 
ID8664722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1513819 
End bp1515894 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content76% 
IMG OID 
ProductHistidine kinase 
Protein accessionYP_003337184 
Protein GI271962988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.705239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.184804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACAG CGGCGCGCAT CGCGTGGCCG TGCTGCGCTG TGACCCTGCT CCTGGTGGCC 
CTGTCCGTCG TCCTGGCCGT GCTCGACGGG ACCGATCCCC GGACGCTCAG CCACCTGCTG
TTCGTGGTGG CCTCCGCGCT GGCGGGCGGC CTGATCGGCT CGCGCCGGCC GGAGCATGCG
GTGGGCCGGC TGCTGGCGCT GAGCGGGTTC TGCTTCGCGC TCATGGAGGC CTGCGGCCAG
TACGCCATCT TCGGCCTGGT CACCCGCCCC GGACTGCCGC TCGCCCAGGC CCTGGCGTGG
CCGCAGACCT GGCTCTGGGT GCCCGCGAAC CTGTCGCTCG CCCTCATCCC GCTGATCTTC
CCCAGCGGGC GGCTCTCCTC CCCCCGGCTG CGTCCTCCGG TCCGCGCGGT GGCCGTCGCC
GCGGTGCTGA CCATGGCGGT GAGCGCGCTG CGGCCCGGGG AGAACCACCA GGTCGGCACG
GACACCGGCG TGGTCAACCC GCTCGGGGTC GCGGGGCTGG CCCGGGTGGC GCCGGCGATG
GAGACCGCGA TGACGGCTCT CATGGCGGTG GTCTTCGTCG CCGGAGGGAT CGACCTGCTC
GTCAGGGCCC TGCGGAGCGG CGAGCCGGAA CGCCGCCAGA TCAAGTGGCT GGCCTACGTG
GTCGGGCTCC TGGTGCTCGT CGTCGCGGCC CGGCTGGCGG CCGGGCTGAC CGACGGGTTC
CCGGACACGA TCTGGCCGGT CACCAGCTCC CTGTGGGAGC TGCCCGGCGC CCTGGGCACG
GCCCTCGTGC CCGCCGCCAT CTGCGTCGCG ATCCTGCGCC ACCGGCTCTT CGACATCGAC
CTGGTGATCA ACCGCACGCT GGTCTACGCG CTGCTGTCGG GCTGCGTGAC CGGCGGCTAC
GTGGCCGTGG TGGGCTATCT CGGCGCGATC TTCCCCGGCG GGGACCTGCC GGTCTCCGTG
CTGGCCGCCG GGCTCGTGGC GCTCGTCTTC GCGCCGCTGC GGGAACGCCT GCAGAGCTGG
GTCAACCTGC TGACGTACGG CGAGCGGGAC GACCCCTACG CGGCGCTCAC CCGCCTGGGC
CGCCGGCTGG AGAACACCGG CGAGCCCGGC ACCGTGCTGG CCGGGGTCGC GCGGTCGGTG
GCCGAGGCCC TGCGCCTGCC GTACGCGGCG GTGGAGACGG CCGCGGGCTC GCGGCACGCC
TTCGGCGCGG CCGTCGGCGA TCCGGTACGG CTGCCGCTGA CGCACAGCGG GGAACGGGTC
GGAGACCTCA TCCTGTCGCC CCGGCCGGGA GAGAGCGGGT TCGGCCCCCG CGACATGCGC
GTCCTCACCG ACCTGGCCAG GCAGGTGGCG GTGGCCGTGC ACGCCGTACG GCTCTCCGCC
GACCTGCGGC GCTCGCGCGA GCGGCTGGTG ATGGCGCGCG AGGAGGAACG CCGTCGCCTC
CGGCGCGACC TGCACGACGG GCTCGGCCCC ACGCTTGCCG CGCTGACCAT GCGGGCCGAG
GCCGTGCACG ACCTGGTCGA GGAGGAGCAC GCGCGGCGGC TGCTGGCCGA CATCGTCGGC
GACGCCGAGG CGGCGGTGAC CGACGTGCGG ACCCTGGTGG ACGGGCTGCG CCCGCCCGCG
CTCGACTCCC TCGGGCTGCT CGGCGCGCTC CGCGCCCACG CCACCCGGCA GCCGCCCGGC
CTGCGCGTCA CCGTGCACGC TCCGGACGGC CTGCCGCCGC TGCCGGCGGC CACGGAGGTG
GCCGCCTACC GCATCGCCGC CGAGGCGCTG GCCAACGTGC GCAGGCACGC GGCCGCGACC
GGCGCGGAAC TGCGGGTCGA GGTGGCCCGC GGCACGCTCA GGCTTGAGGT GTCCGACGAC
GGCGGCGGGA TCGGGCCGTC CGGGGGCGGC GGGACGCGGC CGGACGGGCC GGGCTCCGGC
CGTACCGGGG TCGGGCTGGC GTCGATGCGC GAGCGGGCCC TGGAACTGGG CGGAACGTGC
ACGGTCGAGG AACGCCCGGA GGGCGGGACC CTGGTCAGGG TCACGCTGCC GGCGCACGAG
GGAGAGGACG CCTGTGGTCC GGATCTTGCT GGTTGA
 
Protein sequence
MTTAARIAWP CCAVTLLLVA LSVVLAVLDG TDPRTLSHLL FVVASALAGG LIGSRRPEHA 
VGRLLALSGF CFALMEACGQ YAIFGLVTRP GLPLAQALAW PQTWLWVPAN LSLALIPLIF
PSGRLSSPRL RPPVRAVAVA AVLTMAVSAL RPGENHQVGT DTGVVNPLGV AGLARVAPAM
ETAMTALMAV VFVAGGIDLL VRALRSGEPE RRQIKWLAYV VGLLVLVVAA RLAAGLTDGF
PDTIWPVTSS LWELPGALGT ALVPAAICVA ILRHRLFDID LVINRTLVYA LLSGCVTGGY
VAVVGYLGAI FPGGDLPVSV LAAGLVALVF APLRERLQSW VNLLTYGERD DPYAALTRLG
RRLENTGEPG TVLAGVARSV AEALRLPYAA VETAAGSRHA FGAAVGDPVR LPLTHSGERV
GDLILSPRPG ESGFGPRDMR VLTDLARQVA VAVHAVRLSA DLRRSRERLV MAREEERRRL
RRDLHDGLGP TLAALTMRAE AVHDLVEEEH ARRLLADIVG DAEAAVTDVR TLVDGLRPPA
LDSLGLLGAL RAHATRQPPG LRVTVHAPDG LPPLPAATEV AAYRIAAEAL ANVRRHAAAT
GAELRVEVAR GTLRLEVSDD GGGIGPSGGG GTRPDGPGSG RTGVGLASMR ERALELGGTC
TVEERPEGGT LVRVTLPAHE GEDACGPDLA G