Gene Sros_5141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5141 
Symbol 
ID8668435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5652472 
End bp5655690 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content75% 
IMG OID 
ProductATPase-like protein 
Protein accessionYP_003340662 
Protein GI271966466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.101031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCTG CGATCGAGTC GGGCGGCGAC AGTGGACGGG TGCGCGTCGG CATCCTCGGT 
CCCCTGGTGC TGGACACCGC TACCGGCCCG ACCCTGGTCG GGGGCGCACG GCTGCGGGCG
TTGCTGGCCC GGCTGGTGCT GGACGCCGGG CGTGCCGTCC GGCCGGCGAC CTTGGTCGAG
GCGCTGTGGG GCGAGGCGGC ACCGGCCGGC CACCTGCACG CGCTGCAATC GCTGGTTTCC
CGGTTGCGGC GCGTCCTGGG CGACCCCGGG CTGCTCACCT CGGGCCCGGC CGGGTACCTG
CTGGCCGTCG AGCCGGACGC GGTCGACGCC GTCCGGTTCG AGCGGCTGGC CCGCGCGGGC
CGGCGCTCGC ACGCGCAGTC CCGGCCCGCC GAGGCGGCCG CCACCTTGCG CGAGGCCCTG
GGCCTGTGGC GCGGCCCCGC CTTGGCGGAC GTGCGCGAGG CACCGTTCGC CGCCGCCGAG
GCCGAACGGC TCGAGCGGGC CCGGTTGGCC GCCCTGGAGG ACCGGGTCGA AGCCGAGCTC
GCGCTGGGTA CCGACCTCGA CCTGGTCGCC GAGCTCGAAT CGCTGACCGC CGCGCACCCG
TTGCGCGAAC GGCTGCACGC CCAGCTGATC CGAGCCCTGG CGCTGAACGG CCGCGGCGCC
GAGGCGCTGG CCGCCTACCA GCGGATCCGC GGCCTGCTGG CGGACAGTTT CGGCAGCGAT
CCCGGACCGC AGCTCCAGGA GGCCCACCTG GCGGTGTTGC GCGGTGAACT CCCCCGCTCC
CGCCGATCGC ACGGCAACCT GGACGTCCCG CTCACCAGCT TCGTGGGCCG CGACGACGAC
GTCCGCCGCG TGGTGGAACT GCTCGGTCGA GTTCGGCTGG TCACCCTGGT CGGCCCCGGA
GGGGCGGGCA AAACCCGGCT GGCCAACGCC ATCGGCCGGC AGCTCACGCC GTCCGGCGGG
GTGTGGTCCG TCCCGCTGGC CCCGGTCGGC GCGGACGACG TGCCCCGCGT GGTGCTCGAC
CTGTCGCGGG TGCGCGAAGA GGGCGTGCCG CGGCCTGTCA CCCCGGAGGA GGTCCTGGAC
CACCTGGCCG AGACACTCGC GGACGACGAC CTGGTGCTGG TGCTGGACAA CTGCGAGCAT
GTGATCGAGG CCGCCGCGAC GCTCGCCGGG GCCCTTCTCG GCCGATGCCC GAGGCTGCGG
GTGCTGGCCA CCAGTCGGGA ACCCCTGCGG ATCGACGGCG AGACCTTGCA CCCGGTGCTC
CCGCTGGAGG TGCCCGAACC GGGGTCGACG GTCGAGCGGG CCCGCGCCTG CGCGGCGGTC
CGGCTGTTCC ACGACCGGGC CGCCGCGGTG CGGCCGGGCT TCACTCCGGA AGGCAGCGCG
CTGACGGCGG CGATCGAGAT CTGTCGCCGC CTGGACGGCC TGCCGCTGGC GATCGAGCTG
GCCGCGGCAC GGTTGCGCGC CCTGCCGGTC GAGGTGGTCG CGGCACGGCT GGACGACCGA
TTCCGGCTGC TCACCGGGGG CAGCCGCACC GCATTGCCGC GACACCGGAC CTTGGGCGCC
GCGGTGGCCT GGAGCTGGGA CCTGCTCGAC ACCGACGAGC GAGTGCTGCT GGAACGGCTG
TCGGTCGTGC CCGGCACCTT CACCGAGGAC GCGGCGGAGG CGATCGGCGG GTTGGGCGAC
GTCCGGGAGC TTCTGACGGC GTTGGTCGAC AAGTCCCTGC TGCACCCGGC GGAGCCTGCC
GACCCGCTCG AGCCGCGCTA CCGCGTGCTG GAGACCATCC GGGAGTACGG CCTCGAGCAA
CTGGCCCGGC GTGACGAGGT CGACGTCGTG CGTGGGAGGC ACGCCGGGTT CTTCCTTCAG
CTGGCCGAGA CCGCGGACCC GTACCTACGC ACATCCGACC AACTGCGCTG GCTGGCCCGC
CTATCCGCAG AACGGGACAA CCTGTCGGCC GCGATCCGCT GGGCGGCCGA GTCCGGCAAC
GCCGACCTGG CGGTCCGGCT CGGCGTCGCG CTGTGCTGGT TCTGGTTCAT GCGGGACCAC
CCGCCGGAGT CGCTGGACCT GCTCGGCCGG GTGCTCCAGG CTCGCGGCCC GACCGAGCCG
CAGGCACGCG CGCTGGTGGT CGCCGCGCAC GCGCTCGCCA CCACTGAGGC CATCAGCCGA
CCGGACGAGT CGGAAGCGGC CTTCGACCGG ATCGGGAAGG CGCTGGAGCA CGTCACCCCC
GGCACCCATC CCATTCTGGA GATGGCCCGG TTGGCCCTCG CCGTAGGCTC GGGACGCGAT
CGAACCGCAC CGGACATGCT CGGTTCCCCG GACGATCGGA CGGATCCGTG GAGCCGATCG
CTGGCTCTGC TGGTTCGAGG CGTGCTGACC ATGAACACCG GGCACGCCGC CGAGGCGACG
CACTCGTTGT CGCGCGCGCT GACCGGCTTC GAGGAACTCG GCGAGCGGTG GGGCCTGGGG
ATCACGCTGA GCACCCTGAA TTCGGCGCTG CAGCGGTCCG GCGACCCGGC CGGTGCGCTG
GGGCTGGCCG AGCGGGCCGG CCGGTACTTC CGGGAGCTCG GCATGCCCGA GCACACGATG
GAGAGCGAGG TGGCGGCCGC GCTGCATCTG GCCCAAGCCG GTGACGTGGA TGGCGGGCGG
CGGCAGCTGA CGGACCTGCT CGACCAGGTC GAGCGGACCG GGTCGGCGGA GTCGCAGGCT
CAGGTGTGCC TGGGCCTGGC ACGGCTGGAG TGGCGGGCCG GGCGGCCGGG GTCGGCCCGC
GAGCACGCCC AGGCCGGCTT GACCGAGGCG CCACCCGGCC GGTCGACCCC GCACCTGGCC
GCGTTGCTGC TGGGCGTGCT CGCCCACGTG GACGTCGCGG AGGGTTCCCC GGACAAGGCG
GTACGCCGGC TCGACCATCC GGCGGTGCAC CTGACGCTGA CCTGGCATAC GCCGGTCGCG
GCCTCGATCG CGGTCGTGGT CGCCGCGATC GAGCTCTGCC GCGACCAGCC GGAGCGGGCG
GGACGGCTGC TCGGGGTCGC CACCGTGCTG CGTGGCTGGG ACGACCACGG CGACGCCGAC
GTGCTGATGA TCACACAGCG GGCCGCGGCT GCCCTGGGCG CTGACGGGTT CGCGGCCGCG
CACGCCTCGG GTGCGGCGAT GTCGCGGGCC GAGGCCGAAA GCCTGATGTC CGCGATCATC
ACCGCACCCG AGGCCGGCCG TACCGGTCAG CCGGCGTGA
 
Protein sequence
MAPAIESGGD SGRVRVGILG PLVLDTATGP TLVGGARLRA LLARLVLDAG RAVRPATLVE 
ALWGEAAPAG HLHALQSLVS RLRRVLGDPG LLTSGPAGYL LAVEPDAVDA VRFERLARAG
RRSHAQSRPA EAAATLREAL GLWRGPALAD VREAPFAAAE AERLERARLA ALEDRVEAEL
ALGTDLDLVA ELESLTAAHP LRERLHAQLI RALALNGRGA EALAAYQRIR GLLADSFGSD
PGPQLQEAHL AVLRGELPRS RRSHGNLDVP LTSFVGRDDD VRRVVELLGR VRLVTLVGPG
GAGKTRLANA IGRQLTPSGG VWSVPLAPVG ADDVPRVVLD LSRVREEGVP RPVTPEEVLD
HLAETLADDD LVLVLDNCEH VIEAAATLAG ALLGRCPRLR VLATSREPLR IDGETLHPVL
PLEVPEPGST VERARACAAV RLFHDRAAAV RPGFTPEGSA LTAAIEICRR LDGLPLAIEL
AAARLRALPV EVVAARLDDR FRLLTGGSRT ALPRHRTLGA AVAWSWDLLD TDERVLLERL
SVVPGTFTED AAEAIGGLGD VRELLTALVD KSLLHPAEPA DPLEPRYRVL ETIREYGLEQ
LARRDEVDVV RGRHAGFFLQ LAETADPYLR TSDQLRWLAR LSAERDNLSA AIRWAAESGN
ADLAVRLGVA LCWFWFMRDH PPESLDLLGR VLQARGPTEP QARALVVAAH ALATTEAISR
PDESEAAFDR IGKALEHVTP GTHPILEMAR LALAVGSGRD RTAPDMLGSP DDRTDPWSRS
LALLVRGVLT MNTGHAAEAT HSLSRALTGF EELGERWGLG ITLSTLNSAL QRSGDPAGAL
GLAERAGRYF RELGMPEHTM ESEVAAALHL AQAGDVDGGR RQLTDLLDQV ERTGSAESQA
QVCLGLARLE WRAGRPGSAR EHAQAGLTEA PPGRSTPHLA ALLLGVLAHV DVAEGSPDKA
VRRLDHPAVH LTLTWHTPVA ASIAVVVAAI ELCRDQPERA GRLLGVATVL RGWDDHGDAD
VLMITQRAAA ALGADGFAAA HASGAAMSRA EAESLMSAII TAPEAGRTGQ PA