Gene Sros_7437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7437 
Symbol 
ID8670758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8213018 
End bp8216323 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content73% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003342863 
Protein GI271968667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTAG CTCGTGGGTT GGTCGTGGCG ATGGTGGCGG GCGCGGTCGT GGCAGCGCCC 
GCCGATCACT TGGGCGCTGT CGCCGATGTG CGGCAACAGG GCGCGGACGC CGTCACACGA
CAGCGGAGCG CGGAACCCGC GGAGAAGCCG GCTCTCGCTC TCGCCCGCCG GACGGGGCAG
CCGGTCGAGG TGGACGCGAT GCGCACGGAG ACGCGCCAGG TCTTCGCCAG ACCGGACGGC
ACCTTCGTGC TGGAACAGCA CGCGCGCCCG GTGCGCGTGC GCCAGGCGGG CCGGTGGGTG
GCCGCGGACG CGACGCTCCG GCTCCGGCCC GACGGCGCGG TCGCCCCTGT CGCGACGGCG
GTCGACCTGA CGTTCTCGGG CGGTGGGAGC GCGCCGCTCG CCCGGATGGC GCGCGGCGGC
AAGGCGCTGG AACTGGAGTG GCCGGGCGCG CTGCCGAAGC CGGCGCTGAG CGGTGACACC
GCGACCTATC CCGAGGTGCT GCCGGGAGTG GATCTCCAGG TGAAGGCGGA CGTCGACGGT
TTCTCCCACC TCCTGGTGGT CAAGTCGCGG GCGGCGGCCG AGAACAAGGC CCTGGCCGAG
CTGGGCTTCC CGCTGTCGGC CGACGGGCTG TCGATGAAGT CCACCAGGGG CGGCAACCTC
GAGGCGGTCG ACGCGAACGG CGGAACGATC TTCACTGCCT CGCCGCCGAT GATGTGGGAC
TCGTCGGGGG CCTCGACCGC GCGCACGCTC GCCGCGGGGG AGGGCTCCCC GGCGGACGCC
CGTCACGAGG TGATGGGCCT CGAACTCGCC GGCGGGAAGC TGCTGCTCAG GCCGGACCGG
GACATGATCG AGGACCCGAG GACCAGGTTC CCGGTCTACC TCGACCCGTT CTTCTCGGCG
GCCCGCGGCG CGTGGACGTC CGTCTGGAGC AACAATCCAA GCTCCAACTT CCTGAACGCC
AACGACGTCG CCAGGGCGGG ACACGTCCCC GGGCAGACCA ACCGGTCGTT CTTCCAGATG
AACACCGGCA CGGCCATCCA CGGCAAGCAG ATCATCAAGG CGACGCTGCG GACATACGAG
ACGTGGTCCT ACTCGTGCAG CGCGCGGAAG GTCGAGGCCT GGTCGACCAA TCCGATCAGC
AAGAGCACCA CCTGGAACAA GCAGCCCACG TGGGCCAAAC TCCTCGACAC CGTGAACGTC
GCGAAGGGCT GGGGGCCGGC CTGCCTGCCC GGCGGTGTCG AGTTCGACGT CACCAGCCAG
GCCGTGGACG CCGCGGCGAA GAAGTGGCCC AACATGACCA TCGGGCTCAG GGCCACGAAC
GAGTCGGACA ACTACAGCTG GAAGAAGTTC AAGAACAACC CCAGCCTGGT GATCGAGTAC
AACTCGCTGC CCGCCGCGCC CGTCGCCGCC GACGCGTGGT CGGACCCGGG AGGCGGCTGT
GTCGCCGGTG AGGGGCGGCC GACCATCGGC AGCACCACAC CGAAGCTGTG GGCCAAGCTG
CGTGACGCCG ACAACTCGGT CAGGGGCCGG TTCGAGTGGT GGAACGCCGC CGGCACGAAG
GTGGGCGAGC GTCTCACCGA GCCGCAGGGA ACGGGCAGCG CCTTCTACGC GGACGTCCCG
CAGGGCGCCT TCGGCGACGG CTCCGTGATC CGCTGGCGGG TCCGCGGCGA GGACGGCAGG
GCCAGTAGCG CGTGGAGCCC CTGGTGCGAG GCGACCGTCG ACGCCACCGC CCCCGGCAGG
GAACCGGGGG TGACCTCGCC GGAGTACGCC GAGGGAGGCT GGAACGACGG CGTCGGCCGG
GCCGGCTCCT TCACCTTCAC CGCCAACGGC GTCGCCGACG TCGTCGGCTA CGTCTACGGG
CTGGACACCG CGCCCAAGGT GGAGGTCGCG GCCGGCCTGC AGGACGGGTC GGCGGCGATC
CGGCTCACCC CCCGCCACGA CGGCCCCAAC GTGCTCTCGG TGCGCAGCAA GGACCGCGCG
GGACACCAGG GCCCGATCCG CACCTACGTG TTCAACGTCA ACGCGGGAAC GGGCCCTGAC
GGGCACTGGG CGCTGGACGA CGGGCAGGGC ACCGGGGCCG CCGACCGCAC GGGCGCCCAC
CAGGCCACCC TGTACGGCCC GGCCGGCTGG ACCACCGGCA GGACCGGCCT GGGACTGCGG
CTCGACGGCG CCGGCGGCCA CGCGCGGACC ACCGGCCCCG TCGTGTCCAC CCTGAGCAAC
TTCACGGTCG CCGCCTGGGT GCGGTTGACC GGTGTCAACG CCACCGCGAC CGCGGTCAGC
CAGGACGGCG GGCGGACCGG CGGATTCTCC CTGCAGTACT CCAAGGCCGA CAACCGCTGG
GCGCTGGGCC GTACCGGCGC GGACGCCGAC GGAGCGCCGG CCGTGCGCGC GCTGTCGGAG
GCCGCGCCGA GGCTCGGCGA GTGGACCCAC CTCACCGGCG TCTACGACGC AGCCGCGGAG
ACGCTGTCCC TGTACGTCAA CGGGCGCCTG GAGTCGACGG TCCCGTTCAC CCAGCCGTGG
GACGCGACCG GGCCCCTGGC GATCGGCAGG GCGAAGGCCG ACGGAGGGGC GGCGGAGTTC
TGGCCGGGCG ACATCGACGA CGTCCGCGTC CACGGCCGGG CGATGTTCGG CGACGAGGTC
GCCGACCTGG TGAACAGCGC CGCGACGTTG GTCGGGCACT GGAAACTGGA CGAGGGGGCC
GGTGCCGCCG CCGCCGACTC CTCCGGACGG GCCTCGGCGG CGGCGCTGGG CGGCGGCGCG
TCCTGGACCG AAGGCTGGCT GGACGGCGCG CTCGCCCTCG ACGGCACGGG CGGTTACGCG
CAGGCCGCCG CTCCCGCGGT CGACACCAGA GCCGGATTCA CCGTCGCCGC CTGGACCCAG
CTCGACTACC TGCCCACCCG CGACGCGTCC GCCGTCTCCC AGGCCGGGAA CAGGGCCAGT
GGCTTCCAGC TCGGCTATGA CAGGGAGCAG GGGCGGTGGG TCCTCGGCAT GGCCGCCGCC
GACACCGACA CCACCGCGCT GGTGCGGACC CGGTCCGACG CCCTCCCGGT GCCGCTGGAG
TGGGCCCACG TGGCCGGGGT CTACGACCCG CTCCAGGGCG AGCTGCGGGT CTACGTCAAC
GGACGGCTGT CGGACAGCAC CGTCACCGAT CACGTGAGCG CGTGGAACGC CGTCGGCCCC
CTGCAGCTGG GCCGCGCCAA GAACGCGGGT GTCTTCACCG GTTACTGGCC GGGGACCGTC
GACGACGTCC GCGCCTATGA CGGGGTGCTG AGCGCCGAGC AGATCTCCCA GCTGGCCGCC
CAGTAG
 
Protein sequence
MRLARGLVVA MVAGAVVAAP ADHLGAVADV RQQGADAVTR QRSAEPAEKP ALALARRTGQ 
PVEVDAMRTE TRQVFARPDG TFVLEQHARP VRVRQAGRWV AADATLRLRP DGAVAPVATA
VDLTFSGGGS APLARMARGG KALELEWPGA LPKPALSGDT ATYPEVLPGV DLQVKADVDG
FSHLLVVKSR AAAENKALAE LGFPLSADGL SMKSTRGGNL EAVDANGGTI FTASPPMMWD
SSGASTARTL AAGEGSPADA RHEVMGLELA GGKLLLRPDR DMIEDPRTRF PVYLDPFFSA
ARGAWTSVWS NNPSSNFLNA NDVARAGHVP GQTNRSFFQM NTGTAIHGKQ IIKATLRTYE
TWSYSCSARK VEAWSTNPIS KSTTWNKQPT WAKLLDTVNV AKGWGPACLP GGVEFDVTSQ
AVDAAAKKWP NMTIGLRATN ESDNYSWKKF KNNPSLVIEY NSLPAAPVAA DAWSDPGGGC
VAGEGRPTIG STTPKLWAKL RDADNSVRGR FEWWNAAGTK VGERLTEPQG TGSAFYADVP
QGAFGDGSVI RWRVRGEDGR ASSAWSPWCE ATVDATAPGR EPGVTSPEYA EGGWNDGVGR
AGSFTFTANG VADVVGYVYG LDTAPKVEVA AGLQDGSAAI RLTPRHDGPN VLSVRSKDRA
GHQGPIRTYV FNVNAGTGPD GHWALDDGQG TGAADRTGAH QATLYGPAGW TTGRTGLGLR
LDGAGGHART TGPVVSTLSN FTVAAWVRLT GVNATATAVS QDGGRTGGFS LQYSKADNRW
ALGRTGADAD GAPAVRALSE AAPRLGEWTH LTGVYDAAAE TLSLYVNGRL ESTVPFTQPW
DATGPLAIGR AKADGGAAEF WPGDIDDVRV HGRAMFGDEV ADLVNSAATL VGHWKLDEGA
GAAAADSSGR ASAAALGGGA SWTEGWLDGA LALDGTGGYA QAAAPAVDTR AGFTVAAWTQ
LDYLPTRDAS AVSQAGNRAS GFQLGYDREQ GRWVLGMAAA DTDTTALVRT RSDALPVPLE
WAHVAGVYDP LQGELRVYVN GRLSDSTVTD HVSAWNAVGP LQLGRAKNAG VFTGYWPGTV
DDVRAYDGVL SAEQISQLAA Q