Gene Sros_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2036 
Symbol 
ID8665318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2185713 
End bp2189045 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content77% 
IMG OID 
ProductATPase-like protein 
Protein accessionYP_003337764 
Protein GI271963568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.856154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATTCG GGATCCTGGG CACCACGCGG GTGTGGCGTG ACGACGGGAG CGAGGTGGCC 
ACCGGCGGAC CGGCCCGGCG GGCGCTGCTG ACCCTGCTGC TCGCCCGCCC CGGCGAGGTG
GTGACGGCCG ACCGCCTCGT CGACGACCTC TACGGCGGGC GTCCGCCGGC CGGGGCGGCG
CACGCCCTGC AGTCCCAGGT ATCCCGGCTC CGCGGAGACC TGGGGCGGGA GGCCGCGATC
GAGCTCCTCC CCGCGGGCTA CCGGCTCGCG GTGGATCCGG ACGACGTCGA CGCCCACCGC
TTCGAGCGGC TCTCCGTGGA GGGGCGGCGG TCCCTGGAGG CCGGGGACCC CGGGCAGGCG
TCCGCGCTCC TGCGCGCGGC CCTCGGCCTG TGGCGGGGCC CCGCGCTCGC CGACGCCGCG
GACGCCGGCT CCGTCCAGGC CCAGGCCCTG CGGCTGGAGG AGCGGAGGGC CGGCGCCCTG
GAGGACCGGA TCGAGGCGGA CCTGCGGCGG GGCGGGCATC GGGCCGTCAT ACCCGAGCTG
CGGGAGCTGG TCGGCCGGCA CCCCCTGCGG GAGAGGCTGC ACGGCCTCCT CATGCGGGCA
CTGCGGGCCG ACGGCAGGCA GGCCGAGGCC CTCGTCGCCT TCGAGGAGGC ACGGCGGCTC
CTGGCCGAGG AGCTCGGCGC GGACCCCTCG GCCGAACTCG CCGCGCTCCA CCGGACCCTG
CTCCGGGGCG AGCCGCCGCC CGCGGCTTCC GCCCCCCATC GGGCCCCGTC ACGGGGCGGG
CGGCCGTTCG AGACCCCCGC GCCCCACCGG GCCGCGCCCC GGGGCGGGCG GGCGTTCGAG
ACCCCCGCGC TCCCCGCGCA GCTGACGAGC TTCGTCGGCC GCGCGGAGGA CGTGGCCAAG
GTCACCGAGC TGCTCGGCAC GGCGCGCCTG GTCACCCTGA TCGGTCCCGG CGGCGCGGGC
AAGACCCGGC TGTCGATCGA GGTCGCCGCC CGCCGGCCCG ACGTGTGCCT CGTCGAGCTC
GCGCCGCTGC GCGACGGGCC GGAGCCGGCG CCGGCCGTGC TCGGCGCGCT CGGCCTGCGC
GAGAGCGGGC TCCTGGCCAT GCCGGCCGGG ACGACGCCCA TCTCCCGGCT GATCGCCGCG
CTGGCCGACC GGCCGCCGCT GCTCGTCCTC GACAACTGCG AGCACATCGT GGAGGCGGTC
GCGGCGCTGG CCGGACGGTT GCTCGCGGGC TGCCCGGAGC TGCGCATCCT GGCGACGAGC
CGGGAACCGC TCGCCATCAC CGGCGAGCAC CTGTGGCCGG TCCGCCCGCT CGCCCCCGCC
CCGGCCGCGC GCCTCTTCGC CGACCGGGCG GCCGCCGTAC GGTCGGGCTT CGTCGTGGAC
GGCGCCGACG CCGAGGTCGT ACGGCGGATC TGCCGGGCGC TCGACGGCCT GCCGCTGGCG
ATCGAGCTCG CGGCGGCGAG AATGCGCACG CACGACCTCG CCGAGCTCGC CTCCCGCCTG
GACGACCGGT TCCGCCTGCT GTCGCGGGGC AGCCGTACGG CCGAGGCCAG GCACCAGACG
TTGCGCGCGG TCGTGGCGTG GAGCTGGGAT CTGCTGTCGG AGGCCGAGCA GACGATGGCC
AGGCGCCTGA CGGTCTTCTC CGGCGGCGCG ACCGCCGAGG CGGCGGCGCG GGTCTGCGGG
GGACCCGACG CCGGGGACGT GCTGGACTGG CTCGCCGACA AGTCGCTGCT GGAGGTCGGC
GACGGCCGCT ACCGCATGCT CGACACGATC CACGCCTTCT GCGCCGAGCG GCTGGACGCG
GCCGGGGAGT CCGGCGCGCT GCGGCGCGCG CACGCGGAGC ACTTCCTGGC ACTGGCCCAG
GCCGCGGATC CCCGCCTGCG GCGTTCCGGG CAGCTGGAGT GGCTGGGGAT CCTGAGCGCG
GACCACGAGA ACCTCCTGGC CGCGCTGCGC TGGGCCGTGG AGGCGGGAGA GGTCGAGCTC
GCGCTCAGAC TGCTCGCCTC GTTGTCGTCA TATTTCTGGA TGCGCGGCAT GCGCACCTCG
GCGACGGCGC AGGCGGTCGC GCTCCTGGAC ATGATCGGTG ACAACCCTCC TCCGGACCTC
GGCGACGAGT ACGTGCTCTG CGCGTTGACG GCGGCGGCCA GCGACGCCGG CCGGGAGGCG
TGGGAGCGCC ACCGGGCCAC GGCCGAGTCG ATCGTCGTCG ACCCGGACCG GCCACGCCGT
CACCCGGTCA TCACCCTCCT CTGGCCGATG ATCAACGCCG GTGCAGGAGA CCCGCGCGCC
GCGCTCTCGG TGATCGCTCG CGGGCGGGTC GGGTCCGACC CGTGGGAGAG GGCCGTGGTG
CATCTGCTGT GGGGATACCC GCAGCTCGCC GCCGGGGACC TGGCGCAGGC CGAGCACGAG
TTCGCCTCGG CCGCCGACGC GTTCCGCTCG CTCGGCGACC GGTGGGGGAC GTCGCTGGCT
CTGGACGCGC TGGCCGGTCT GGCGGGCCTG TACGGCGACC CCTCGAAGGC GATCGTCCTG
ATCGACGAGG CCCTCACCCT CACCGAGCAG CTCGGCGCGG TCGAGGACCT CCCCGACCTG
CTCTGCAACC GGGGCGACCA CCGCGTCCGC ATCGCACTGG CCGGGCGGGC CGCGCCGGAT
GGGCGGGACG GGCCGGACGG GCCGGACGCG GCGGGCACCG GTCTCGCGGA GGCGCGCGCC
GACTACGAGC GGGCGGCCGA GATCGCGCGC CACGCGGGCA GCCCCACCTA CCTGGCCGGG
GCGCTGCGCG GCCTCGGCGA CATCGCGCGG TTGGAGGGCG ACCCTGCCGG GGCGCGCCGG
CTGTACGAGC AGGCGATGGA ACGGTTCGAG ACGCACTGGG TGAAGAGCGC GGGCAACCGG
ACGGCCACCC TCTTCGGCCT CGGCAAGGTC GCCGAGGCCG CCGGCGACCT GGACGGGGCC
CGTTCCCTGC ACCGGCAGGC CGTCGAGGTT GCGACGGCGG CGGGATCGAT CGTCGAGAGC
GCCCGCGCCG TCGAGGCCCT GGCGGGGCTC GCGCTGCTGG AGGGCGACGC GCCGGCCGCG
GCCCTGCTGC TCGGCGCGGC CGCCGGCCTG CGGGGGATCC TCACCGAGGA CGATCCCGAG
GTCTCCCGGA CGGCCGCCGC CACGAGGGCG GCCCTGGGCG CGGGGACGTA CGAGGCCGCC
CACCGCCGGG CCGCGCGGCT GCCCCAGGAG GACGCGCTCC GCCTGGCCGG TGTGCCCGAG
GCCGTCATCC AGGCGTCGCC GATCAACGCC GTCGCCGGCT ACCAGGCCCG GAACTCGGGT
CCGTGCGACC AGCCGCAAAG TAGCTGTGGG TAA
 
Protein sequence
MRFGILGTTR VWRDDGSEVA TGGPARRALL TLLLARPGEV VTADRLVDDL YGGRPPAGAA 
HALQSQVSRL RGDLGREAAI ELLPAGYRLA VDPDDVDAHR FERLSVEGRR SLEAGDPGQA
SALLRAALGL WRGPALADAA DAGSVQAQAL RLEERRAGAL EDRIEADLRR GGHRAVIPEL
RELVGRHPLR ERLHGLLMRA LRADGRQAEA LVAFEEARRL LAEELGADPS AELAALHRTL
LRGEPPPAAS APHRAPSRGG RPFETPAPHR AAPRGGRAFE TPALPAQLTS FVGRAEDVAK
VTELLGTARL VTLIGPGGAG KTRLSIEVAA RRPDVCLVEL APLRDGPEPA PAVLGALGLR
ESGLLAMPAG TTPISRLIAA LADRPPLLVL DNCEHIVEAV AALAGRLLAG CPELRILATS
REPLAITGEH LWPVRPLAPA PAARLFADRA AAVRSGFVVD GADAEVVRRI CRALDGLPLA
IELAAARMRT HDLAELASRL DDRFRLLSRG SRTAEARHQT LRAVVAWSWD LLSEAEQTMA
RRLTVFSGGA TAEAAARVCG GPDAGDVLDW LADKSLLEVG DGRYRMLDTI HAFCAERLDA
AGESGALRRA HAEHFLALAQ AADPRLRRSG QLEWLGILSA DHENLLAALR WAVEAGEVEL
ALRLLASLSS YFWMRGMRTS ATAQAVALLD MIGDNPPPDL GDEYVLCALT AAASDAGREA
WERHRATAES IVVDPDRPRR HPVITLLWPM INAGAGDPRA ALSVIARGRV GSDPWERAVV
HLLWGYPQLA AGDLAQAEHE FASAADAFRS LGDRWGTSLA LDALAGLAGL YGDPSKAIVL
IDEALTLTEQ LGAVEDLPDL LCNRGDHRVR IALAGRAAPD GRDGPDGPDA AGTGLAEARA
DYERAAEIAR HAGSPTYLAG ALRGLGDIAR LEGDPAGARR LYEQAMERFE THWVKSAGNR
TATLFGLGKV AEAAGDLDGA RSLHRQAVEV ATAAGSIVES ARAVEALAGL ALLEGDAPAA
ALLLGAAAGL RGILTEDDPE VSRTAAATRA ALGAGTYEAA HRRAARLPQE DALRLAGVPE
AVIQASPINA VAGYQARNSG PCDQPQSSCG