Gene Sros_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1953 
Symbol 
ID8665235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2095218 
End bp2097278 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content71% 
IMG OID 
Productglutathione import ATP-binding protein GsiA 
Protein accessionYP_003337684 
Protein GI271963488 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATCC TTGAGGTGAA CGACCTCAAC GTCACCTTCC GCGGCGGCAT CAAGGCCGTC 
CGGGGCGTGA GCTACCAGCT CAGCCGGGGC GAGGTCCTCG GCATCGTCGG CGAGTCCGGC
TCCGGCAAGT CGGTGACCTC GCTGGCGGTG ATGGGCCTGC TCCCGGCCGG CGCCGAGGTC
AGCGGCTCGG TCCGGCTGCA CGGCCGGGAA CTGCTCGACA TGCCGGAGGA CGAGCTCGTC
CGGCTCCGCG GCAAGACCAT CGGGATGATC TTCCAGGACC CGCTGTCGGC CTTCACCCCC
GTCTACACCA TCGGCGACCA GATCGCCGAG GCCGTGCGGA TCCACCAGAA GATCGGCAAG
GACAAGGCGG CCAGGCGGGC GGTCGAGCTG CTGGACCTGG TCGGCATCCC GCACCCGGAC
GTGCGGGCCA AGGCGTTCCC GCACGAGTTC TCCGGCGGCA TGCGCCAGCG CGCCATGATC
GCCATGGCCA TCGCCAACGA CCCCGACGTG CTGATCTGCG ACGAGCCGAC CACCGCGCTC
GACGTCACCA TCCAGGCCCA GGTCCTGGAG GTGCTCAAGA CCGCGCAGCG GGAGACCGGC
GCCGGCATCG TGATGATCAC GCATGACCTC GGGGTCGTCG CCGGCATCGC CGACCGGGTG
CTGGTCATGT ACGCCGGCAA GCCGGTCGAG CTCGGCACCG TGGACGAGAT CTACTACCGT
CCGCGCCAGC CGTACACGAT GGGCCTGCTC GCCTCGATCC CGCGCATGGA CCGCCCGACC
GCGCGGCTCA TCCCGATCGA CGGCAACCCG CCCTCGCCCG CCGCGCTGCC GCCGGGCTGC
CCGTTCGCGC CCCGCTGCCC GATGAGGGTC AGCGCGTGCG ACGAGGCGGA GCCCGAGCTG
GAGCGGATCG GCCCCGGCAC CCGGATGTCG GCCTGCATCC GCTCGCACGA GATCGAGCTC
AAGGGCCTCG ACGGTGCCAC GATCTACCCG GTGCCCGAGG CGCCCGCCGA GACGGCCGAG
CCCCGGCCGC GCGCCGAGCG CGACACCGTG CTCAGCGTCG AGAACATGAT CCGGCACTAC
CCGCTCATGA AGGGCGCGGT GTTCAAGCGC CGGGTCGGCA CCGTGCACGC GGTGGACGGC
ATCAGCTTCG ACGTCGCCGA GGGCGAGACG CTCGCCCTGG TCGGCGAGTC GGGCTGCGGC
AAGACCACCA CCCTCCAGCA GATCATGCAG CTGGAGGCGC CGCAGAGCGG CACGGTCGTC
GTGCTCGGCA AGGACAGCGC CACGCTGGCC AAGGCCGAGC GCAAGGCGCT CCGCCGGGAC
CTGCAGATCG TCTTCCAGGA CCCGATGGCC GCGCTCGACC CGCGCATGCC GGTCGGCGAC
ATCCTCGCCG AGCCGCTGCG CGCGCACGGT CACAAGGACG TCAAGGGCAG GATCGCCGAG
CTGCTCAGCC TGGTGGGCCT GGACCCCAGC CACGCCCAGC GCTACCCGCA GCACTTCTCC
GGCGGACAGC GCCAGCGCAT CGGCATCGCC CGCGCGCTGG CCCTGGAGCC CCGGCTGATC
GTGCTCGACG AGCCGGTGTC GGCACTCGAC GTGTCCATCC AGGCGGGTGT GATCAACCTG
CTGGAGGACC TGAAGGTCAG GCTCGGCCTG TCCTATCTGT TCGTCGCGCA CGACCTGTCG
GTGGTCCGGC ACCTCGCGGA CCGGATCGCC GTCATGTATC TCGGCAGGAT CGCCGAGATC
GGCACGGTCG ACGAGGTGTA CGGCAGGCCC GCCCACCCCT ACACGCGGGC GCTGCTGTCG
GCGATCCCGC TGCCCGACCC CGAGCTGGAG CGCTCACGCG AGCGGATCCT GCTCGAAGGC
GACCTGCCCA GCCCGGCCGA CCCCCCGTCG GGCTGCCGCT TCCGCACCCG CTGTCCCAAG
CGCGCCCTGC TGGGCGCCGA GGATGCACGC AGGTGCGAGG AGGAAGAGCC CTCTGTTGTA
CGGCTCGCGT CCGCCGTCGA TCACGGCGCC GCCTGCCACT ACCCCGAAGA GGCCGAGGTC
GTCGTGACCT CGCGTCACTG A
 
Protein sequence
MPILEVNDLN VTFRGGIKAV RGVSYQLSRG EVLGIVGESG SGKSVTSLAV MGLLPAGAEV 
SGSVRLHGRE LLDMPEDELV RLRGKTIGMI FQDPLSAFTP VYTIGDQIAE AVRIHQKIGK
DKAARRAVEL LDLVGIPHPD VRAKAFPHEF SGGMRQRAMI AMAIANDPDV LICDEPTTAL
DVTIQAQVLE VLKTAQRETG AGIVMITHDL GVVAGIADRV LVMYAGKPVE LGTVDEIYYR
PRQPYTMGLL ASIPRMDRPT ARLIPIDGNP PSPAALPPGC PFAPRCPMRV SACDEAEPEL
ERIGPGTRMS ACIRSHEIEL KGLDGATIYP VPEAPAETAE PRPRAERDTV LSVENMIRHY
PLMKGAVFKR RVGTVHAVDG ISFDVAEGET LALVGESGCG KTTTLQQIMQ LEAPQSGTVV
VLGKDSATLA KAERKALRRD LQIVFQDPMA ALDPRMPVGD ILAEPLRAHG HKDVKGRIAE
LLSLVGLDPS HAQRYPQHFS GGQRQRIGIA RALALEPRLI VLDEPVSALD VSIQAGVINL
LEDLKVRLGL SYLFVAHDLS VVRHLADRIA VMYLGRIAEI GTVDEVYGRP AHPYTRALLS
AIPLPDPELE RSRERILLEG DLPSPADPPS GCRFRTRCPK RALLGAEDAR RCEEEEPSVV
RLASAVDHGA ACHYPEEAEV VVTSRH