Gene Sros_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3997 
Symbol 
ID8667291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4452263 
End bp4453966 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content75% 
IMG OID 
ProductSignal transduction histidine kinase-like protein 
Protein accessionYP_003339648 
Protein GI271965452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.224789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.328912 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAGC CGGGACCGGA CCCCGATACC TTTTTCCCCA TGCGATCCCG TGGTCCGGTC 
CTGGACATCT GCCTGGCCGC CGCGGGAGTC GTTCTCGACC TGCTGATCTC GGGCAAGTCC
GGCCAGAGCG GCACGGCGCC CTGGTGGGTC GCGGGGGCCT TCGCGCTGCT CGCCGGGGTC
CCCCTGGGAT TCGTCCGGCG CCGCCCGTTC GCGGTCTGCC TCCACCAGGC GATCTTCCTG
GTCGTGTGCG ACCAGCTCGG CGCCCACACC TCCAACACGC TGCAGATCCT GCTCCCCGTC
GCGGTGGCGA CGCTCGCCTA CCGCGGCACC TGGCGATGGA TCGCCCCGGC CGCCGTCCTG
ACCGGCGTCG CCACGGCGAT CAACCTCGCC GACCCCGGGA TCGCCTTCAC CGCCGGCACC
TGGTACCTGC CGATCGGGAT GAGCGCCGTC CCGGTCGTCA TCGGCCGCTA CCTGCGCAGC
CCCATCGAGC CGGTGTCCGA GGTGGAGCGG CGTCCCGGCC TCGACCTGCT GCTCGCGGGC
GGCGGGGTGG CCTTCATGGT GCTCGACACC TGGACGGAGT GGGACAGCGG GCAGCTGCCG
GTCTGGGCGG CCGGCTGGTT CGCCATCGGC TGCGGGCTCA CCGCCGGCCT GGTGCGCAGC
CTGCCCGCCA CGGCGTTCGC GCTGCAGGCG CTGCTGGTGC TCCTGGCCGA CCAGAACGCG
GGGTTCGCCG CCAACTCGAT GCAGGGGCTG TTCCTGGTCA CGCTCGGCGC GTTCGCGATG
CGCGCCCCGT GGTCCTGGAC GGCCGCCGCC TACCTGATCG CCAGTGGCGT GACCGCGCTG
AACATCGTCG GAGGCGAGCT GTCACACGTC ACCCCGCCCC GGGTGGCCGC GCTGCTGGCG
ATGGTGGCGG CACCCATCGT CATCGGCCGC TACGTCGGCG TCCGCCGGGC CGCGGCGGAT
CTGGAGCTGG CCATGGCGGA GGAGGCCAGG CAGCTCACCG CCGAGCGCGC CCGGGCCGAC
CTGCTGGCCG AGCGCGAGCG CATCGCCCGC GACGTCCACG ACATCGTGGC CCACCATGTC
GGCGCCATGG TGCTGCGGGC GGGCGCCGCC CAGTACGCGG CGCCCTCCGG CCCGGTGGCC
GAGGCGCTGT CGGACATCCG GTCCACCGGC CACCAGGTGC TGGAGGACCT GCGCGGCCTG
CTCAACGTGC TGCGCGACCC GGATCTCGCC CACCTGCCCG TGGCCGACCC CGAGGAGGTG
GTGCGCGACG CGGTGGAGCG GATGAACGCG GCCGGGCTGA GGGTCGAGCT CCGCCTCGGC
CCGGAGGCCG AGCTCGCCCC CCTGGTGACC CGGGCCTCGG CCGCCCGGAT CGTCCAGGAG
GGCCTGACCA ACGTCCTCAA GCACGCCGGC CCCGGCACCT CGGCGGTCGT CGAGCTTGCG
GTGGCCGGCG GCTCGCTCGG GGTGAACGTG CTCAGCGGCG CGCCTCCCGG CCCCCGCGAC
GCGCTGCCCT CCTCCGGCCA GGGCATCGCG GGCATGCGGG AACGGGCCAG GGCGCTCGGC
GGAGACCTCT CCGCCGGTCC CGACGGGGCC GGCGGCTGGC GGCTGGCGGC GACGCTCCCC
CTGGAACGCG CCGACCGCGT GGGCGCGGAG ACCCCTTCGC GCAGGAGACT GCCCTGGTGG
CGTGAGGAAA GTGAGACCTC GTGA
 
Protein sequence
MREPGPDPDT FFPMRSRGPV LDICLAAAGV VLDLLISGKS GQSGTAPWWV AGAFALLAGV 
PLGFVRRRPF AVCLHQAIFL VVCDQLGAHT SNTLQILLPV AVATLAYRGT WRWIAPAAVL
TGVATAINLA DPGIAFTAGT WYLPIGMSAV PVVIGRYLRS PIEPVSEVER RPGLDLLLAG
GGVAFMVLDT WTEWDSGQLP VWAAGWFAIG CGLTAGLVRS LPATAFALQA LLVLLADQNA
GFAANSMQGL FLVTLGAFAM RAPWSWTAAA YLIASGVTAL NIVGGELSHV TPPRVAALLA
MVAAPIVIGR YVGVRRAAAD LELAMAEEAR QLTAERARAD LLAERERIAR DVHDIVAHHV
GAMVLRAGAA QYAAPSGPVA EALSDIRSTG HQVLEDLRGL LNVLRDPDLA HLPVADPEEV
VRDAVERMNA AGLRVELRLG PEAELAPLVT RASAARIVQE GLTNVLKHAG PGTSAVVELA
VAGGSLGVNV LSGAPPGPRD ALPSSGQGIA GMRERARALG GDLSAGPDGA GGWRLAATLP
LERADRVGAE TPSRRRLPWW REESETS