Gene Sros_3523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3523 
Symbol 
ID8666811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3902936 
End bp3904711 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content73% 
IMG OID 
ProductHistidine kinase 
Protein accessionYP_003339202 
Protein GI271965006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0630905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCCG CGAACATGGA CGCCCACCCC TCACCGGGGC GGGCGTCCAT CCTTTCTGAT 
GCCAGACTCG TAGGCATGGC AGAGATCGAG CCCGGTGCGT TGATACCCCA CATGCGATTG
GACGAGCTGC TCTCGGAGCT GCATGTCCGG CTGGAGGCGG TGCTGGCCAC CCGTGACCGG
GTGCACGCGC TGCTGAACGC GGTCGTGTCC GTGGGCAGCG ACCTGGACCT GGAGACCGTG
CTGCGCAGGA TCGTGGAGAC CGCCACCACC CTGGTCGACG CCACCTACGG CGCGCTGGGC
GCGATCGGCG AGGAGAACAC CCTGGTGCAG TTCATCCCGG TCGGGCTGAC CGAGGAGGAG
ATCGCCAGGA TCGAGCACTG GCCGCACGGG CTGGGACTGC TGGGCCTGCT GATCAAGGAG
CCGCGCCCGC TGCGGCTGGC TCGCATCTCC GACCATCCGG AGTCCTACGG GTTCCCGCCG
GGCCATCCGC CGATGGGGTC GTTCCTGGGC GTGCCGATCC GGGTGCGCGA GGAGGTGTTC
GGCAACCTCT ACCTCACCGA GAAGCGCGGC GGAGGGGCGT TCGACGCCGA GGACGAGGCG
ATCGTCAGCG CCCTGGCCAC CGCCGCCGGG GTGGCCATCG AGAACGCCCG GCTGTACGAG
GACAGCCGGC GGCGCGAGGT GTGGCTGCAG GCCTCGTCAG AGGTGACCAC CAGCCTGCTG
TCGGGGGCCG AACCGCAGGA GGTGCTCACC CTGATCGCGC GCCGGGCCCG GCAGATGGCC
GGCGCCGACA TCGTGGCGGT GCTGCTTCCG GACGAGACCG GACAGGTGCT GCGGGTGATC
CTCGCCGAGG GGCCGGCCGG CGACCAGGTG GCCCACGGGG AGACGCCCGT CGCGGACTCC
CTGGCGGGCA GGGCCTTCAC CAGCGGCGAG CCGCTCATGG TCGCCGATCC CGCCGAGGCG
CACAGCCCGA TCGCGATCGC CGGCTACGCC TCGCTCGGCC CGGTGGCCGC CGTGCCGATC
GGCGCGGCGG GAGGTGCGCG CGGGGTGCTG TCGCTGGGCA AGCGCTCGGG CCGCATGCCC
TTCAGCCAGT CCGAGCTGCG CACCCTGCAC GCCTTCGCCG GGCAGGCCGC GGTCGCGCTG
GAGCTGGCCG AGAGCCGGAT GGACGCCGAG CGGCTCGGCC TGCTGGAGGA GCGTGACCGG
ATCGCCAGGG ACCTGCACGA CGTGGTGATC CAGCGGCTGT TCGCCGTGGC GATGACTCTG
ATGAGCACGG TGCGGCTGGT CGACAGACCG GAGGCCTCGG GGCGGCTGCA GAACTCCATC
GACGAGCTGG ACGGCACCAT CCGGCAGATC CGCTCGACCA TCTTCGCTCT GCAGACCTCC
CAGCGGGGCG CCGACTTCGG GCTGCGCTCG CAGATCGTGG AGCTGGTGGA GGGGGCCCGC
GCGCATCTGG GCTTCATGCC GGGGCTGAGC ATGGAGGGGC AGCTGGACGG CAGGGTGCCG
GCGCCGGTGG CCGAGCATCT GCTGGCCGTG GTCCGGGAGG CGCTGTCCAA CATCGTGCGT
CATGCCAGGG CCGCCAGGGC CGAGGTGAGC GTCGAGGTGG CCGACGGCCG GCTCGTCCTC
GTCGTGACCG ACGACGGGGT CGGGATGCCC GAGGACGGCC GGCGCAGCGG GCTGCGCAAC
CTCCAGGAGC GCGCCGAACG GCTCGGCGGC TCCTTCGTGG CCGAGACGCC GCGGGGAGGC
GGCACCCGGC TGGTGTGGAG CGTGCCCCTC GGCTGA
 
Protein sequence
MTAANMDAHP SPGRASILSD ARLVGMAEIE PGALIPHMRL DELLSELHVR LEAVLATRDR 
VHALLNAVVS VGSDLDLETV LRRIVETATT LVDATYGALG AIGEENTLVQ FIPVGLTEEE
IARIEHWPHG LGLLGLLIKE PRPLRLARIS DHPESYGFPP GHPPMGSFLG VPIRVREEVF
GNLYLTEKRG GGAFDAEDEA IVSALATAAG VAIENARLYE DSRRREVWLQ ASSEVTTSLL
SGAEPQEVLT LIARRARQMA GADIVAVLLP DETGQVLRVI LAEGPAGDQV AHGETPVADS
LAGRAFTSGE PLMVADPAEA HSPIAIAGYA SLGPVAAVPI GAAGGARGVL SLGKRSGRMP
FSQSELRTLH AFAGQAAVAL ELAESRMDAE RLGLLEERDR IARDLHDVVI QRLFAVAMTL
MSTVRLVDRP EASGRLQNSI DELDGTIRQI RSTIFALQTS QRGADFGLRS QIVELVEGAR
AHLGFMPGLS MEGQLDGRVP APVAEHLLAV VREALSNIVR HARAARAEVS VEVADGRLVL
VVTDDGVGMP EDGRRSGLRN LQERAERLGG SFVAETPRGG GTRLVWSVPL G