Gene Sros_4673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4673 
Symbol 
ID8667967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5197494 
End bp5198693 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003340269 
Protein GI271966073 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.113165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.377169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGG GCGTGAATGA GCCAGGGGTG GATGAGTGGG GCGTCGATGA GTCGGGCGTG 
AAAGGAAAGA GCCGGGGCGC GGCTCTGCGT TACCTCTGCG CCACTGTTCC TGCGCTGTTC
GTCAGCGGGA TCTCGATCCT GTCAGCCGTG AACCGCGAGG TCGCCCCCGA CCTCGCCCAG
ATTGCGCTGG ACCTGGCCCT GTTCGGGGCC GGTCTGGCGC TGATGCGATG GCGACACCGC
TTCCCCTGGC AGGTCGCGCT GGCGACCGCA CTCCTGACCC TCTACTCGAC CACCGCGGCC
GGCCCCGCCT ACGTCGCCTA CGTCTCCCTG TGCACGCATC GCCGTTGGCG CCAGATCGTC
CCGGTCGCTC TCGCCACGTG GCTGTGCCCG GCGGCGCAGA TCCTGTGGTC GGACGCAGAC
AAGCTCAGGG TCGTCTCCGT CACGAGCGTC ACCATGGTGA CCGGCAGCGT GATCGTCGGC
GGACTGACCG TCTTTGGCCT CTACCTGCGC GCGCGGCGCG ACCTGGCCGC CTCGCGGCGA
CGGGCCGCGC TCGAGGCGCA GGCGCACCGC GTCGAGCAGG CCAAACTAGC CGAACGGGTC
AAGATCGCCC ACGAGATGCA TGACGTGCTG GCACACCGGA TCTCTCTGCT GGCCATGCTC
GCGGGCGGCC TGTCGCACCG CACCGACCTC ACCGCCGAGC AGACCCGCGA AACGGCCCAG
GCGATTCAGG AGAACGCACA CCAGTCGCTC AACGAACTGC GCGCCGTACT CGGCACGCTG
CGGCGCGACG GCGGCGTCGA GGACCCGCAG CCGAACCTGG CCGACCTCGA CGCCCTGTTC
GACGAAGTAC GCGTGGCCGG GCAGCAGGTC GAGGTGGCCG ACACTGTCGA CGGGCGCGAG
CTGCTGCCGG CGCAGACAGG GCGGCACGCG TACCGGATCG TGCAGGAGGC GCTGACCAAC
GCGCGCAAGC ACGCGCCGGG CACCCGAGTG AGAGCCGAGC TCGGCGGACG GCCTGGCCAA
GGGTTACGGA TCCGGATGAG CAACCCGGCT CCATACGCCG GATCGTCCAG CCCCGGCTCC
GGCGGGCGGC TGGGCCTGGT CGGGCTGGCC GAGCGTGCCC GGATGGCCGG GGGCACCCTG
AGCCATGCGG TCCAGGACGG ACGCTTCGTC CTGGACGTGC GACTGCCGTG GGAGGCTTGA
 
Protein sequence
MLSGVNEPGV DEWGVDESGV KGKSRGAALR YLCATVPALF VSGISILSAV NREVAPDLAQ 
IALDLALFGA GLALMRWRHR FPWQVALATA LLTLYSTTAA GPAYVAYVSL CTHRRWRQIV
PVALATWLCP AAQILWSDAD KLRVVSVTSV TMVTGSVIVG GLTVFGLYLR ARRDLAASRR
RAALEAQAHR VEQAKLAERV KIAHEMHDVL AHRISLLAML AGGLSHRTDL TAEQTRETAQ
AIQENAHQSL NELRAVLGTL RRDGGVEDPQ PNLADLDALF DEVRVAGQQV EVADTVDGRE
LLPAQTGRHA YRIVQEALTN ARKHAPGTRV RAELGGRPGQ GLRIRMSNPA PYAGSSSPGS
GGRLGLVGLA ERARMAGGTL SHAVQDGRFV LDVRLPWEA