Gene Sros_3869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3869 
Symbol 
ID8667159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4306338 
End bp4307960 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content74% 
IMG OID 
Productsignal transduction histidine kinase regulating citrate/malate metabolism 
Protein accessionYP_003339529 
Protein GI271965333 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0780654 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0130314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCAAG TCACCATTGC GTACATTTTG GTCGTGGACA TGAGGACGCT GCCGGGGCGG 
ATGCGCCGCT GGAGCCTGGC CCGGCAGATG CTGGCCTTGC AGATCGCCGT CGTCGGGGTC
ACCGTGACGG GAGGCGCGGC GCTGGCCTTC CTCCAGGCCC GCGACCTGCT CGTGGAGGAG
GCGGCGGGGA AGTCCAGGGC GGTGACCGTC AGCGTGGCCG ACTCCCCCTC GGTCCTGGCC
TCCCTCGGCG ACTCCGTCGC GCTCCAGCCG TACGCCGAAC GGGTCCGGCG GGAGACCGGG
GTGGACTTCG TCACGATCAT GAGCACCGGC GGGCTCCGCT ACACCCATCC CAACCCGGCC
CAGATCGGCC GCCGGTTCCT GGGCACCACC GCGCCCGCCC TGGAGGGCCG GACCTTCACC
GAGACCTACA CCGGAACCCT CGGCTCCTCG GTGCGCGCCG TCGCCCCGGT GCGCGACGCC
GCCGGGAGGG TGCGGGCCCT GGTCAGCGCG GGCATCACGG TCGAGAGGAT CGGCGTCAGG
CTCCGCGGGC AGATCACCGG CGCGGTGGTC ATCGGCCTGC TCGGCCTGTC CACCGGCGGC
GCGGGCACCT ATCTGGTCGG CAGCCGGCTG CGCCGCCAGA CCCGCGGGAT GGGCCCCGCC
GAGCTCAGCC GGATGTACGA GTACCACGAC GCGATCCTGC ACGCCGTACG GGAGGGGCTG
CTGCTGGTCG ACGACGCGGG CAGGCTGACG CTGTGCAACG ACGGCGCCCG GGAGCTCCTC
GGCCTGCGCG GCGAGGCGGA CGGGCAGCAC GTCACCGAGC TGGGCCTGCC CTCCTCCCTC
ACCGGGCTGC TGGTCGCCGG GGAGAACCGG AGCGACGAGA TCCACCTGAC CGGAGAGCGG
ACACTCGTGG TGAACGTCGC GGTGGTCCGC TCCGGCAACC GCTCGCTCGG CACGGTGGTC
ACCCTGCGCG ACCACACCGA GCTGCAGTCC CTGACCGGGC AGCTCGACGC CGAGCGCGGC
TTCGCCGACT CCCTGCGCGC CGCCGCGCAC GAGTCGGCCA ACCGGCTGCA CACCGTCATC
ACCCTGGTCG AGCTGGGCCG CGCCGAGCAG GCGGTGGCCT TCGCGACCGC CGAGCTGAGG
GCGGCGCAGG AGCTCACCGA CCGGGTGGTG GGCGCGGTGC GCGAGCCGGT GCTCGCGGCG
CTGCTGCTGG GCAAGAGCGC CGAGGCGGCC GAGCGCGGCG TGGAGCTGGC GATCAGCCCG
GACACCGAGC TCGACGACAT CGGCCTGGAC GGCCGGGATC TGGTGACGAT CCTCGGCAAT
CTGATAGACA ACGCGGTCGA GGCGGCCGTG TCGGGCGTCC CGCCCGCGCG GGTGGACGTC
CGCCTGCGCG CCGACGGGAC GACCTTCCTG CTCGGCGTGT CCGACAGCGG CCCGGGGATG
GACGCCGCCA CCGCGCGGGA GGCCTTCCGG CGGGGCTGGA CCACCAAGGG CGACGGCCGC
GGGCTCGGGC TGGCCATGGT GGGTCAGGCG GTGCGCCGCC TGGGCGGCAC GATCGACGTC
GGCGGCGGCC GGGGAGCGGT GTTCACCGTA CGGCTGCCGC TGCGAGGGAG GGATGTTCCG
TGA
 
Protein sequence
MTQVTIAYIL VVDMRTLPGR MRRWSLARQM LALQIAVVGV TVTGGAALAF LQARDLLVEE 
AAGKSRAVTV SVADSPSVLA SLGDSVALQP YAERVRRETG VDFVTIMSTG GLRYTHPNPA
QIGRRFLGTT APALEGRTFT ETYTGTLGSS VRAVAPVRDA AGRVRALVSA GITVERIGVR
LRGQITGAVV IGLLGLSTGG AGTYLVGSRL RRQTRGMGPA ELSRMYEYHD AILHAVREGL
LLVDDAGRLT LCNDGARELL GLRGEADGQH VTELGLPSSL TGLLVAGENR SDEIHLTGER
TLVVNVAVVR SGNRSLGTVV TLRDHTELQS LTGQLDAERG FADSLRAAAH ESANRLHTVI
TLVELGRAEQ AVAFATAELR AAQELTDRVV GAVREPVLAA LLLGKSAEAA ERGVELAISP
DTELDDIGLD GRDLVTILGN LIDNAVEAAV SGVPPARVDV RLRADGTTFL LGVSDSGPGM
DAATAREAFR RGWTTKGDGR GLGLAMVGQA VRRLGGTIDV GGGRGAVFTV RLPLRGRDVP