Gene Sros_1792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1792 
Symbol 
ID8665070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1910002 
End bp1912476 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content73% 
IMG OID 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003337525 
Protein GI271963329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCACA GGGACGGCAC CCCCGACGGC GACGTGATCA CGGTCGTCGG GGCCAGGACC 
CACAACCTGC GGAACGTCAG CGTCACTTTC CCCAAAAACA GGATCGTCGC CTTCACCGGT
GTCAGCGGCA GCGGGAAGAC CTCGCTCGCC ATCGACACGG TGCACGCCGA GGCCCAGCTG
AGGTATCTGA ACGGCGTCTC GCCGTTCTTC CGCCAGTTCA TCTCGCCCAG AAACCGCCCG
CAGGTCGATC GGATCTCCGG ACTGGGCGCG ACCCTGGCGG TGGACCAGCG CCGGCTGAAC
CGCAACCCGC GCTCCACCCT CGGCAGCGTC ACCGGGGTCG ACGACTTCCT CGGCCTGCTG
TTCGCCCGTA TGCCGGCGCT GGGGCCCGAA GCCCGGGAGT TCCCCGAGCT ACGCGGGCTG
ACCAGTGCTT ATTTCGACAG GTACAGCATG GAAGGCCGGT GCAAGGCGTG CGGAGGCAGG
GGCGTCGAGG TGCGCCCGGC CGAGGATCGG ATCGTGACCC GGCCGGACCT GCCGTTGCTC
GCGGGCGCCG GGACGTGGTT CGACCGGGCC AACTCCGGGG AGTTCTGGGC GCTGCCGGCC
CTCGCCGAGC GGTACGGCGC GGACCTCGCT CTGCCCTGGC GCGAACTGCC GGAGGAGTTC
CGGCGGGCGG TGCTCCACGG CACGGGTGAC GGGCCGATCA CCTACAAGAT CACCACCAAG
CAGCGGAAGG CGGGCGCCGA TGTCACCGTC GAGCGCAGCG TGCCGTTGAG GGGAGCGGTC
TCCGAGGTCT CCAGGCTTCA CGACGCCGCG GGCACCGACG CGGCCAGAGA GCACTACGCC
CGGTTCATGC GCCGGACCGG GTGCCCCCGC TGCGGCGGCA CCGGCTTCGG CGAGGTCGGC
CTGACGGTCA GGCTCGCCGG CCTGGCCTAT CCCGAGGTGA TCAGACTCCC CGTCGAGGAT
CTGCGCGGGT GGGTGGACAC GATCGAACCG ACGCTCACCC CGACGCAGCG GAAGGTGGCC
GGCGACATCC TGCCCGGCCT GGCCCGCCGC GTGCGGCTGA TGGTCGAGCT GGGGCTCGGG
CACCTCCAGA TCACCCGCAG CGCCCCGTCG ATGTCGGGCG GCGAGCTCCA GCGGGCGCGG
ATCACCGCCC AGCTGAACAC GGACCTGACC GGGATCGTCT TCGTCCTCGA CGAGCCGGGG
GCGGGCCTGC ACCCCGCCGA CAAGCACCCG CTGCGCGAGA TCCTCCACGA CCTGCGCGAC
GCGGGCAACA CCGTCCTGCT CGTGGAGCAC GACCCCGAGC TCATCGCCCT GGCCGACTGG
GTGGTGGACC TCGGCCCCGG AGCCGGGCGC GAGGGCGGCA GGCTCGTCGC GTCCGCTCCG
CCGTGGGAGC TGAGCCGCGA CGGGCGGTCG ATCACCGGCG CCTATCTCGG CGGACGCGGT
CCCCGTGTCC GCCGGACCCG GCGCTCCGAC CCCGCCACGA CGCCCCGGCT CGGCCTGATC
GGCGTCCAGG CGCACAACGT CCGCCTGGAC CGGGTGGACA TCCCCCTCCA CGCGCTCACC
TGCATCACCG GGGTCAGCGG CAGCGGCAAG AGCAGCCTGC TGCACGAGGC GCTCGGCGCG
AGCCTCGACG CCGTGCTCCG CGGCGAGCGC CCGCGGGCCG TCGCCCGGAT CGAAGGCGCC
GGGCTGCTCG ACTGGGTGAC GGTCGTGGAC CAGAACCCCA TCGGCCGCAC ACCGCGTTCC
AGCCCCGCCA CCTACACCAA GGCGTTCGAC ACGATCCGCA GGCTCTACGC CTCCACTCCG
GCGGCCAGGA GCCGCGGGCT CGGCGCGGGA GCGTTCTCCT TCAACTCCCG CGGAGGCCGG
TGTGAGGCGT GTGCCGGCTA CGGCAGGCGC CAGGTGGACA TGCACTTCAT GCCGGACATG
TGGGTCGGGT GCGACGTCTG CGAAGGACGC CGCTTCACCC CCGAGCTGCT CGCCGTCACC
TACCGCGGCA AGGCCGTCGA CGAGGTGCTG GACATGACCG TGGACGAGGC GGCCGAGTTC
TTCGGCGAAC GCGCCGACCT CGCCGCGACG CTGCGCGCCG CCCAGCAGGC CGGTCTGGGC
TATCTCCGGC TCGGCCAGAG CGCGACCGAG CTCTCCGGAG GCGAGGCGCA GCGCCTGAAG
CTGGCCAACG CGATCATGCT GGGCGGCGGG AGCCGCGGCC GGGGCCTGGT GATCCTCGAC
GAGCCGGTCA CCGGCCTGCA CCCCTCCGAC GTGCAGCGCG TGGTCGACGC GTTCGACACG
CTGCTCGCGT GCGGCAACTC GGTCGTCATC GCCGAACACG ACCTGCACGT GGCCGCCTGC
GCCGACTGGA TCGTCGACAT GGGACCGGGC GCCGGCGACA GGGGCGGGCG GATCGTCAAC
GAGGGCCCTC CGGCGACGAT CGCGGCGGGG CCCGGTGTGA CCGCGGGATA CCTGCGGCCC
CTCCTCGCGG GCTGA
 
Protein sequence
MSHRDGTPDG DVITVVGART HNLRNVSVTF PKNRIVAFTG VSGSGKTSLA IDTVHAEAQL 
RYLNGVSPFF RQFISPRNRP QVDRISGLGA TLAVDQRRLN RNPRSTLGSV TGVDDFLGLL
FARMPALGPE AREFPELRGL TSAYFDRYSM EGRCKACGGR GVEVRPAEDR IVTRPDLPLL
AGAGTWFDRA NSGEFWALPA LAERYGADLA LPWRELPEEF RRAVLHGTGD GPITYKITTK
QRKAGADVTV ERSVPLRGAV SEVSRLHDAA GTDAAREHYA RFMRRTGCPR CGGTGFGEVG
LTVRLAGLAY PEVIRLPVED LRGWVDTIEP TLTPTQRKVA GDILPGLARR VRLMVELGLG
HLQITRSAPS MSGGELQRAR ITAQLNTDLT GIVFVLDEPG AGLHPADKHP LREILHDLRD
AGNTVLLVEH DPELIALADW VVDLGPGAGR EGGRLVASAP PWELSRDGRS ITGAYLGGRG
PRVRRTRRSD PATTPRLGLI GVQAHNVRLD RVDIPLHALT CITGVSGSGK SSLLHEALGA
SLDAVLRGER PRAVARIEGA GLLDWVTVVD QNPIGRTPRS SPATYTKAFD TIRRLYASTP
AARSRGLGAG AFSFNSRGGR CEACAGYGRR QVDMHFMPDM WVGCDVCEGR RFTPELLAVT
YRGKAVDEVL DMTVDEAAEF FGERADLAAT LRAAQQAGLG YLRLGQSATE LSGGEAQRLK
LANAIMLGGG SRGRGLVILD EPVTGLHPSD VQRVVDAFDT LLACGNSVVI AEHDLHVAAC
ADWIVDMGPG AGDRGGRIVN EGPPATIAAG PGVTAGYLRP LLAG