Gene Sros_6788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6788 
Symbol 
ID8670097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7474527 
End bp7477649 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content75% 
IMG OID 
ProductSNF2/helicase domain protein 
Protein accessionYP_003342240 
Protein GI271968044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0383717 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGTGG TCCATGGCGC CTGGGTGGAC GGGCAGCTGG GGGTCTGGGC GGAGGACACC 
TCCCGGGCCC CCGCGCCCGC CTCCCGTGCC ACGCTCCGCC CCCATCCTTT CGCCGCCCCC
GCGGCCGTCC TGGCCGCGGC CCTCGGCGCG GCCGTCCCCG GCGGAGCCCC GGTCCCGGGG
GAGACGGCTC CCGGGCCGGC GGACCCGGGG TTCCCCGCGG GGGCCGGGGA GGGCGAGCTG
ACCCTCCTGC TCCCCGGCTC GGCCGGGGGC CCGCTGCCCT CGCCCGAGTC CGGGCTGACG
AGCTCGGCGC GGAGCCCGAG GATCTCCGCC TGGCGCGTCC CCGCGCTGCT GCTCCGTCCG
GCCGACGCGC TCTCCCTGCT CGCCTCCCCG GCCGTTCTGG ACGCCGGGGC GGGCCCGTCC
GGGCTCGACT CCGACGACGC AGCGGGGCCC GGATGGGGTC CGGGCCTCTC CCTCCGCTAC
TTCGCCGTGG TCGCCGAGCA CGCGCGCGGC CTGGTACGGC GCGGCCGGAT CCTGCCCCGG
CTCGTGGTGG AGGGCGGCGG CCACGCGGCC CGCTGGCGGC CCGTGCTCAC CGGCGCCGAC
GCGACGGCGC TGCGCGACCT CGCCACCGCG ATGCCGCCGG TCTGCAGGGC GGTGGGCGAG
GAGCGCCCGT CGGCCGACGT GCTGCGCGAG GCCCTGAACG GCCTGGCGGA CGGCGCGGCG
CGGCTGTCGC TGCCCGACCG CCTCATCCTC GGCCACCGGC CGGGGCCCAA GGCGCCCCTG
CCCGACCGCT GGCTGTACGC GCTGACCGGC GAGGACGCCG CGCTTCCCGC CGCCAGGTCC
GCCGAGGCGG CGGCGCTGTC CGGCGCGCTC GACGGGTGGT CCGCCTCGGC GCACGAGCTG
GACGGGCCGG TCCGGGCCTG CTTCCGGCTC ATCGAGCCCG CCGGGGAGGA CGCCTCCTGG
AAGGTGGAGT TCGGTCTCGC CCCCCGCGTC TCCCCGGCCG GCTCCGGCCG CTCCCCCGAC
GGGCCCGGAG AGGCGGACGG GCAGGTCGGA TACCTGTCGG CCGACCAGAT CAGGGCCGGC
GAGAGGGCTC CCTGGCTGCC CGAGCGGCCC GAGGAGGTCC TCCGTGCCGA CCTGAGCCGG
GCCGTCCGCC TCCATCCCGA CCTCTACAGC GCGCTGCGCG ACCCCGAGCC GTCCGGCCTG
ACCGTCGAGA CGGCCTGGGC CTTCTCCTTC CTCCGCAACG GCGCGCCCAT GCTGAGAGCC
GCCGGTTACG GCGTGCGCCT GCCGGCCTGG GCGGGACGCC AGGGCCTGGG GCTGAAGCTG
ACCACGCGCA CGGTCGCCGG CGAGGAGGGG TTCGGCCTGG ACCGGCGGGT GAGCTTCCGG
CTGGACGTGG CCATCGGCGA CCACACGATC ACCGGGGAGG AGCTGGCCGG ACTGGCCGAG
CTGAGGATCC CCCTGGTCCA GGTCAAGGGG CAGTGGATCG AGCTGGACGA CCAGCAGCTC
AAGGCGGCTC TGAAGGTCGT CGAGCAGCGG GGCGGCGGCG AGCGGACCGT GGGCGAGGTG
CTCCGCGAGG TCGTGGACGG CGGCGACGAC GAGCTGCCGC TGGTCGCGGT GGACGCCGAC
GGCCCGCTCG GCGACCTCCT CTCCGGCGAG GCCGAGCGCC GGCTCACCCC GGTCGCGGTG
CCGCGGACCC TGGAGGGGAC GCTCCGCCCC TACCAGGAGC GCGGCCTGTC GTGGCTGAGC
TTCCTGTCGG GCCTGGGGCT GGGCGGCATC CTCGCCGACG ACATGGGGCT CGGGAAGACC
GTGTCCACCC TTTCCCTGCT CCTGTCCGAG CGCGAGGGCG GCGCCCACCC GCCGACGCTG
CTGATCTGCC CGATGTCGCT GGTCGGCAAC TGGCAGAAGG AGGCCGCCAG GTTCGCCCCC
TCGCTGCGGG TCTACGTCCA CCACGGCGGC ACGCGCAAGC GGGACGGCGA GCTGGCCGGG
GCCGTGCGGG AGGCTGACCT GGTCGTCACC ACGTACGGCA CGGCGCTGCG GGACCTGGGG
GCGCTGGCGG CCCTGGAGTG GGGCAGGGTG GTCTGCGACG AGGCGCAGGC GATCAAGAAC
AGCGCGGCGC AGCAGTCGCA GGCCGTGCGG TCGATTCCCG CCCGCACCCG GCTGGCGCTG
ACCGGCACCC CGGTGGAGAA CCACCTCTCC GAGCTCTGGT CGATCATGGA GTTCTGCAAC
CCCGGGCTGC TCGGCCCGGC CAAGCGCTTC CGCAGGCGCT ACCAGGACCC GATCGAGACG
CGGCGCGACG AGAGCGCCAC CACGGCGCTG AAGCGGGCCA CCGGGCCGTT CGTGCTGCGG
CGGCTCAAGA CCGACCGGTC GATCATCTCC GACCTGCCGG AGAAGCTGGA GATGAAAGTG
TGGTGCACGC TCACCCGGGA GCAGGCGGAG CTGTACAAGG CCGTGGTGAA CGACATGCTG
GACAGGATCG ACGGCTCCCG GGGCATCGAG CGGCGGGGCA ACGTGCTGGC CACGATGACC
CGGCTCAAGC AGATCTGCAA CCACCCCGCC CACCTGCTCA AGGACGGCTC GCGGCTGGCC
GGCCGGTCGG GGAAGCTGGC CCGCCTGGAG GAGCTGGCCG AGGAGATCGT CGAGGAGGGC
GACAAGGCCC TGGTGTTCAC CCAGTACACC GAGTTCGGCT CCCTCCTGCA GCCCTACCTG
GCCGCCCATC TGGACCGGCC GGTGCTGTGG CTGCACGGCG GGCTGCCCAA GAACAGGCGC
GAGGAGCTGG TGGAGCGCTT CCAGCGCGAC GACGAGCCGA TGCTGTTCCT GCTGTCGTTG
AAGGCGGCCG GGACCGGCCT CAACCTGACC GCCGCCAACC ACGTGATCCA CGTGGACCGG
TGGTGGAACC CGGCGGTGGA GAACCAGGCG ACCGACCGGG CCTTCCGGAT CGGCCAGACA
AGGAACGTCC AGGTCAGGAA GTTCATCTGC GTGGACACCC TGGAGGAGCG GATCGACGAG
ATGATCGAAC GGAAGAAGGC ACTCGCCGAG AGCGTGGTCG GCGCCGGCGA GGACTGGATC
ACGAACCTGT CCACCGACCA GCTGCGCGAG CTGTTCCGCC TCGGCCCCGG GGCGGTGAGC
TGA
 
Protein sequence
MLVVHGAWVD GQLGVWAEDT SRAPAPASRA TLRPHPFAAP AAVLAAALGA AVPGGAPVPG 
ETAPGPADPG FPAGAGEGEL TLLLPGSAGG PLPSPESGLT SSARSPRISA WRVPALLLRP
ADALSLLASP AVLDAGAGPS GLDSDDAAGP GWGPGLSLRY FAVVAEHARG LVRRGRILPR
LVVEGGGHAA RWRPVLTGAD ATALRDLATA MPPVCRAVGE ERPSADVLRE ALNGLADGAA
RLSLPDRLIL GHRPGPKAPL PDRWLYALTG EDAALPAARS AEAAALSGAL DGWSASAHEL
DGPVRACFRL IEPAGEDASW KVEFGLAPRV SPAGSGRSPD GPGEADGQVG YLSADQIRAG
ERAPWLPERP EEVLRADLSR AVRLHPDLYS ALRDPEPSGL TVETAWAFSF LRNGAPMLRA
AGYGVRLPAW AGRQGLGLKL TTRTVAGEEG FGLDRRVSFR LDVAIGDHTI TGEELAGLAE
LRIPLVQVKG QWIELDDQQL KAALKVVEQR GGGERTVGEV LREVVDGGDD ELPLVAVDAD
GPLGDLLSGE AERRLTPVAV PRTLEGTLRP YQERGLSWLS FLSGLGLGGI LADDMGLGKT
VSTLSLLLSE REGGAHPPTL LICPMSLVGN WQKEAARFAP SLRVYVHHGG TRKRDGELAG
AVREADLVVT TYGTALRDLG ALAALEWGRV VCDEAQAIKN SAAQQSQAVR SIPARTRLAL
TGTPVENHLS ELWSIMEFCN PGLLGPAKRF RRRYQDPIET RRDESATTAL KRATGPFVLR
RLKTDRSIIS DLPEKLEMKV WCTLTREQAE LYKAVVNDML DRIDGSRGIE RRGNVLATMT
RLKQICNHPA HLLKDGSRLA GRSGKLARLE ELAEEIVEEG DKALVFTQYT EFGSLLQPYL
AAHLDRPVLW LHGGLPKNRR EELVERFQRD DEPMLFLLSL KAAGTGLNLT AANHVIHVDR
WWNPAVENQA TDRAFRIGQT RNVQVRKFIC VDTLEERIDE MIERKKALAE SVVGAGEDWI
TNLSTDQLRE LFRLGPGAVS