Gene Sros_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4092 
Symbol 
ID8667386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4550116 
End bp4552128 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content75% 
IMG OID 
ProductSuperfamily I DNA and RNA helicase-like protein 
Protein accessionYP_003339743 
Protein GI271965547 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.445905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGC TGGAAGACGA GAAGGACTAC CTCGCACGCT GCGAGACGGC ATTGCGCAGG 
ATGCTCGACG GCGCCCGCCT GAACGTGGTC GTCGGCGAGC GGGTGGCAGG CGACCGCTAC
AGCGCCGAGC GGCTCGGACG GCACCTCAAA AGCCTCGCCA AGGAGCTGGC CGAGGAGCCC
GACGGCCCGC CGTTCTTCGG CCGCCTCGAC TTCGGCTCCG GCCCCGCCGC CGGCGACCAC
CGCGCCCAGC GCTACTACAT CGGCCGCCGG CACATCTCCG GCGACGCGGG GCAGCAGCCG
ATGGTGATCG ACTGGCGCGC CCCCGTCTCG CGGGCGTTCT ACCGGGCCGG CGCCCGCGAC
CCCCAGGGCG TCGCCGTACG CCGCCGCTTC GGCTGGGCGG GCAGGACGCT GACCGGGTTC
GAAGACGAAC GGCTCGACCG CGGCGAGGAC CTGGGGGCCG CGAGCCGGAT CGTGACCGCC
GAGATCGAGC GTCCCCGCGT CGGCCCGATG CGGGACATCG TCGCCACCAT CCAGCCCGAG
CAGGACGAGC TGGTCCGCGC CGGGCTGGAG GATTCGATCT GCGTCCAGGG AGCTCCGGGG
ACGGGCAAGA CCGCCGTCGG CCTGCACCGG GCCGCCTACC TGCTCTACGC CCACCGGCAG
CGGCTCGAAC GCGGGGGCGT GCTGGTCCTC GGCCCCAACC ACGCCTTCCT CGGCTACATC
TCGGCGGTGC TGCCGGCGTT GGGCGAGGTG GACGTCGAGC AGACGACCAT GGAGCGGCTG
CTGGCCCACG CCCCGATCCG GCAGGTGGAC GGCGAGGCCG CCGCGACCGT CAAGCACGAC
GCCCGCATGG CGGACGTGCT GCGCCGGGCC CTTTACGGCC GGGTGCGCAG GCCCGCCGAG
CCGCTCACCG TCCCCGACGG CTCCTACCGC TGGCGCGTAC CGCAGGAGGA CCTCCGGCGC
ATCGTGGACG ACACCCGGCG CGAGGCCCCG CCCTACGCCG TCGGACGCGA GCGCGTGCTC
TCCCGCACCG TGGCCGCGCT GCGACGGCAG GCCGAGGCGC GCGGGCAGAC CACCGGCGCC
GCCTGGACAC GCAAGATGGG CAGGGCCGTC ACACCGTTCC TGGACGCGGT GTGGCCCGCC
GTACGACCGC ACGAGGTCGT CGCGGAGCTG CTCGGCGACC CCGCCGCGCT CGCCCGCGCC
GCGGACGGGA CGCTGACGCC GCGGGAACAG GCGGCGATCA CATGGGCCAG GCCGCCGCGG
ACGTTCAAGA GCGCCAGGTG GTCGACGGCC GACACCGTGC TCATCGACGA GGTCGCCGGC
CTGCTGGAAC GCCCGCGGAG CTACGGTCAC GTGATCGTCG ACGAGGCACA GGACCTGTCG
GCGATGCAGT GCCGGGCCGT CGCCCGCAGG AGCGAGCACG GCTCGATCAC CGTGCTGGGG
GACCTGGCGC AGGGCACGAC GCCGTGGGCC GCTCGCGACT GGCGGGAACG GCTGGCGCAC
CTGGGCAAGC CGGAAGCCCA GGTGATCGCG TTGACGACAG GTTTCCGCGT GCCCTCGGAC
GTGGTCGCGC TCGCCAACCG CCTGCTCGGG GCGCTCAAGG TGGACGTGCC GCCGACGCGC
TCGTTCCGGA CCGACGGCCG GCTCCGCGTC GAGGAGGTGT CCGATCTGCC GCACGCCACC
GTCGCCGCGG TGCGCGACGC CCTGCGCCAT GACGGCTCGA TCGCCGTCGT CGCCGCCGAC
GCCGCCGTGG AAAGGCTGGC GGCGGCACTG TGGAACGCGG GCGTCACGAT CGCCGAGGCC
GGCGAGACGG GCGGCGGCGC GCGGGTGACC GTGGTGCCCG CGACGATGGC CAAGGGCCTG
GAGTACGACC ACGTGGTGGT GGCCGAGCCG GCGGAGATCG TCGGAGCCGA GGAGCGGGGC
CTCAACCGGC TGTACGTGGT GCTCACCCGC GCCGTCTCCC GGCTGGACGT GCTGCACCGC
CGCCCCCTGC CGCCGCAGCT GACCGGCGCG TGA
 
Protein sequence
MTLLEDEKDY LARCETALRR MLDGARLNVV VGERVAGDRY SAERLGRHLK SLAKELAEEP 
DGPPFFGRLD FGSGPAAGDH RAQRYYIGRR HISGDAGQQP MVIDWRAPVS RAFYRAGARD
PQGVAVRRRF GWAGRTLTGF EDERLDRGED LGAASRIVTA EIERPRVGPM RDIVATIQPE
QDELVRAGLE DSICVQGAPG TGKTAVGLHR AAYLLYAHRQ RLERGGVLVL GPNHAFLGYI
SAVLPALGEV DVEQTTMERL LAHAPIRQVD GEAAATVKHD ARMADVLRRA LYGRVRRPAE
PLTVPDGSYR WRVPQEDLRR IVDDTRREAP PYAVGRERVL SRTVAALRRQ AEARGQTTGA
AWTRKMGRAV TPFLDAVWPA VRPHEVVAEL LGDPAALARA ADGTLTPREQ AAITWARPPR
TFKSARWSTA DTVLIDEVAG LLERPRSYGH VIVDEAQDLS AMQCRAVARR SEHGSITVLG
DLAQGTTPWA ARDWRERLAH LGKPEAQVIA LTTGFRVPSD VVALANRLLG ALKVDVPPTR
SFRTDGRLRV EEVSDLPHAT VAAVRDALRH DGSIAVVAAD AAVERLAAAL WNAGVTIAEA
GETGGGARVT VVPATMAKGL EYDHVVVAEP AEIVGAEERG LNRLYVVLTR AVSRLDVLHR
RPLPPQLTGA