Gene Sros_4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4643 
Symbol 
ID8667937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5162953 
End bp5166099 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content72% 
IMG OID 
ProductATPase-like protein 
Protein accessionYP_003340243 
Protein GI271966047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.731274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATTG AGGTGCTGGG ACCGGTCCGG GCCTATGCCG ACGACGGTGC GCCGATCGAT 
GTCGGCGGGA CGAGGGTCCG TGCGCTGCTG GCCCGACTGG CCCTGGCCGA GGGCGAGATG
GTGTCGGTCG ATGCACTCGT TGACGGCCTG TGGGGGGAGC GTCCGCGCGG GGGCACCGTC
AATGCGCTGC ATGGGCTGGT GCATCGGCTG CGCAAGGCTC TGGGCGGGGC CGGGGTCGTG
GAGAGGGCGG CCGGCGGGTA CCGCCTGCAC GTCCGCGCCG GGGACGTCGA CGCCTGTCGG
TTCGAGGAGA AGGCGAGGCG GGGCGGTCGC GAACTGGCCG CAGGCGCCGC GCAGCGAGCG
GACGCGCTGC TGGACGAGGC GCTGGCGCTG TGGCGGGGAG ACGCGCTGGC CGACGTGCGC
GATGCCCCCT TCGCCGGTAC GGCCGGCGCG CGTCTGGAGG AGCTGCGCGC CGCGGCCGTT
GAAGACCGGT TCGAGGCGGA GCTGAGACTG GGCCGCCACG ACGAGATCCT GGCCGATATG
GCGGCGGTGG CCGCCGGCCA CCCGTTGCGC GAACGGCTGG CCGGATTGCG CATGCGAGCC
TTGCACGCGG CAGGCCGCCG GTCCGATGCG CTGGCCGTGT TCGAACAGGT CCGCGGCACG
CTCGCCGAGG AACTGGGCGT CGACCCGTCC GAGGAACTGC GTAGAACACA CCTGGCCGTG
TTGCGGGGCG AGCTGGAGAT CCCCGAGGCG GAACAGGCGC GGCCGGAGGC GGTGCCGGGA
CGCCTGCCGG CCCAGTTGAC CAGCTTCGTC GGCCGGGCGG AGGAGCTGAG ACTGCTCGCC
GGGTTGCTGG AGACCTCGCG GCTGGTCACC GTCGTAGGAC CCGGGGGAGT AGGCAAGACC
CGCCTGGCCG TGGAGGCGGT GAGCCGGCAT CGGGCCCATC GGCGCGGCCG GGTCTGGCTC
GTTTCCCTGG CCGGGGTGAC CACGGCGGAC GGGCTGCCCG GCGCGGTGCT GGGCACGCTC
AGCGTCGCCG ATGTCCGGCC GTCCGGTACG CCGCTGGAGC GGGTGGTCAA TCTGCTCGCC
GGTGGTGAAG GCGTGCTGGT GCTCGACAAC TGCGAACAGA TCTCCGGACC CGTCGCGGAG
TTCGCCGGGC AGCTGCTGGA GCGCCAGCCG TACTTGACCA TCCTGGCCAC GAGCCGGGAA
CCGCTGGAAG TCATGGGTGA GACGCTGTGC CGTCTGGGCC CGCTCGGACT GCCGCCCGCG
CACGCGGATT CCGCTCAGGC CGGAGAGTCG GCCGCGGTGC GGCTGTTCCT CGATCGCGCG
GCAGCCGTAC AGCCCGGCTT CACGCTGGAC GCGTCGACCG CGGCCTCGGT CGCGGACATC
GTGCGACGGC TGGACGGGCT TCCGCTGGCC CTGGAGTTGG CCGCGGCGCG GCTGCGGACC
ATGAGCGCCG ACCAGGTCGC CCGGCGGCTG GATGACCGCT TCCGGCTGCT CAGCACCGGT
AACCGGGCCG CCCAGCCGCG GCAGCAGACA CTCCATGCGG TCATCGAGTG GAGCTGGGAC
CTCCTCACCG ATCAAGAACG GATGCTGGCC CGCCGAATTT CGATCTTTCT GGCGAGAACC
GGGGTCGCCG CGATCGAGGT GGTCTGTTCG GATGAGGCGC TCCCCGCCGG CGAGGTCATC
TATCTGCTGG ACTCCCTGGT CGACAAGTCC ATCGTGGAGC GGGCCGGCGA CGGCTACCGG
ATGCTGGAGA CCGTCCGAGC CCATTCGGCC GACAAGCTTC GCCTGGCGGG GGAAGCCGAA
GCCGTCCTGC GCAGGCTCGT GCGGCACTTC GCCGACCTGG CCGAGGAACA CGAGCCGCTG
CTGCGCTCGG ACAAGCAGGT GGAGTCGCTG CGGCTGTTCC AGGCCGAGTA CGACAACCTG
ATGTTCGCCC TGCAAACGGC CATCGACACC GGTGACGCCG ATGCGGCGGC CCGCCTCCTC
GGCCCGCTGT ACTGGTACTG GGTCATGCTC CGCTACGACG CTCGGGCCGA TGCCTACGTC
GCCAAGGTCG CCGAGTTCGG CGACGCGCTG CCCGCGGACG CCCGGGCCGC GTTCACCGCG
ATCCACCTGG TGGCCGGCGG GGGCGGGCCG GTCACCGACC CCGAGCGGCT GCGCGCGCTC
ATCGACGACT GCGCGCGCAC GGGCGCGCTG CGGCGTTATC CGATGCTGCT GACGACAGTG
CTGGTGATGG CGGCGATACT CGGGCTGGAC GAGCTGGCCG ACCAGGAGAT CGCCCGGGTG
CGCAGCGGCT CGGACCGCTG GGCGATCGCC TGCACCTTCA TGATCGAAGC CATGCGGTAT
CGCGAACGGG GCGACCGGGA AAGCTCCGCG ACCGCGATGA CAGCGGCGCT GCACGCGTTC
GAGGAGGCGG GCGATCAGTG GTGGACGGCG AAGACGCTGT ACGGCCTGGC GCAGATCCAC
GCCATCGGCG GCGAACACGA CGAGGCGATC GCCGCCTACG AGCACAGCAT CGCGCTCGCC
ACCGGCCTCG GCTCGCAGGA CGAGGTCTCG ACCCGGCTCG GGCTCGCCAC CGAACGCATG
CGCGCCGGCG ACCTGACCGG CGCCCGGCAC GACATCGAAA CCGCCGAACG AGCGGTCTGG
GAGCGCGGTC AGCCCGTGCT GGAGATCGAG GTCCTGGGCA GCCTGGCCGA GCTGTACCGC
CGCTCCGGCG AGATCGAACG GGCCGACCGG GAACTCGACC GGATGGAGAC ACTCGCCCGC
CGGCTGCGCC TTACGGCGGA GACGATCGAG AACATGCTGG TACCCGCCAG GATGGCGAAC
CTCCTCACCG CCGGGGACGC CGCACCCGCG CGCGAGCTGC TGCCCCGTGC CGTGCGGGCG
GCGCGGGCGC ACATGGGCAC CCCTCGGGCC GCCCAGCTCC TGGCCCGGCT GCTGTTCCTG
GAGGACGATC CGGCCGGCGC GGCCACCGCG CTCGGCCTGA GCCAGGCCAT CCGCGGCACC
TTCGACCACG GCGACATCGA GCTGCGCTCC CTTGCGGAGG TGCTCGCGGA ACGGCTCGGC
CGTACCGACT ACGACACCGC CTACCAACGG GGCGCCGGCA TGACACCGCA CGAGGCCACC
GACCGGCTGA CAGAGCTGCG CGTATAA
 
Protein sequence
MRIEVLGPVR AYADDGAPID VGGTRVRALL ARLALAEGEM VSVDALVDGL WGERPRGGTV 
NALHGLVHRL RKALGGAGVV ERAAGGYRLH VRAGDVDACR FEEKARRGGR ELAAGAAQRA
DALLDEALAL WRGDALADVR DAPFAGTAGA RLEELRAAAV EDRFEAELRL GRHDEILADM
AAVAAGHPLR ERLAGLRMRA LHAAGRRSDA LAVFEQVRGT LAEELGVDPS EELRRTHLAV
LRGELEIPEA EQARPEAVPG RLPAQLTSFV GRAEELRLLA GLLETSRLVT VVGPGGVGKT
RLAVEAVSRH RAHRRGRVWL VSLAGVTTAD GLPGAVLGTL SVADVRPSGT PLERVVNLLA
GGEGVLVLDN CEQISGPVAE FAGQLLERQP YLTILATSRE PLEVMGETLC RLGPLGLPPA
HADSAQAGES AAVRLFLDRA AAVQPGFTLD ASTAASVADI VRRLDGLPLA LELAAARLRT
MSADQVARRL DDRFRLLSTG NRAAQPRQQT LHAVIEWSWD LLTDQERMLA RRISIFLART
GVAAIEVVCS DEALPAGEVI YLLDSLVDKS IVERAGDGYR MLETVRAHSA DKLRLAGEAE
AVLRRLVRHF ADLAEEHEPL LRSDKQVESL RLFQAEYDNL MFALQTAIDT GDADAAARLL
GPLYWYWVML RYDARADAYV AKVAEFGDAL PADARAAFTA IHLVAGGGGP VTDPERLRAL
IDDCARTGAL RRYPMLLTTV LVMAAILGLD ELADQEIARV RSGSDRWAIA CTFMIEAMRY
RERGDRESSA TAMTAALHAF EEAGDQWWTA KTLYGLAQIH AIGGEHDEAI AAYEHSIALA
TGLGSQDEVS TRLGLATERM RAGDLTGARH DIETAERAVW ERGQPVLEIE VLGSLAELYR
RSGEIERADR ELDRMETLAR RLRLTAETIE NMLVPARMAN LLTAGDAAPA RELLPRAVRA
ARAHMGTPRA AQLLARLLFL EDDPAGAATA LGLSQAIRGT FDHGDIELRS LAEVLAERLG
RTDYDTAYQR GAGMTPHEAT DRLTELRV