Gene Sros_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3699 
Symbol 
ID8666987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4094635 
End bp4096872 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content73% 
IMG OID 
Productheavy metal-transporting ATPase 
Protein accessionYP_003339365 
Protein GI271965169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0922687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCC TCACCGACGA CAGACCCCCG AGCGCGGTCG AACTCTCGAT CGGCGGCATG 
ACCTGCGCGT CCTGCGCCAA CCGGATCGAG CGCAAGCTGA ACAAGCTCGA CGGCGTCACC
GCGACCGTCA ACTACGCCAC CGAGAAGGCC AAGGTCACCT TCCCCGAGGG CGTGGATCCC
CAGACGCTGA TCGCCGAGGT GGAGAAGGCG GGCTACACCG CCGAGCTGCC CGCCCCGCCC
AGTGCCGAGG GCGCGCCGCA GGAGCCTCAC GACGAGCTCC GCTCCCTGCG CGCCCGGCTG
ATCACCTCCG TGGTGCTCGC CGTGCCGGTG ATCGCGATGG CGATGATCCC GCCCCTGCAG
TTCACCAACT GGCAGTGGCT GTCGCTGACC CTCGCCGCGC CGGTCGTGGT CTACGCGGGC
TGGCCATTCC ACAAGGCCGC CTGGACCAAC CTGCGCCACG GCGCCGCCAC CATGGACACC
CTGGTCTCGC TCGGCACGAT CGCGGCACTG GGCTGGTCGC TGTGGGCGCT GTTCTTCGGC
AGCGCGGGCA CCCCGGGCAT GACGCACCCG TTCGCGTTCA CCATCGAGCG CACCGACGGC
TCGGGCAACA TCTACCTTGA GGCCGCCGCG GGCGTGACGG CCTTCATCCT GGCCGGCCGC
TACTTCGAGG CCCGTTCCAA GCGCCGCGCC GGGGCGGCCC TGCGCGCCCT GCTGGAGCTC
GGCGCCAAGG ACGTCGCCGT GCTACGCGAC GGCCGCGAGG TCCGGATCCC GTCCGACCAG
CTCAAGGCCG GCGACCGGTT CGTGGTCCGG CCGGGTGAGA AGATCGCCAC CGACGGCGTG
GTCGAGGAGG GCTCCTCCGC GGTCGACGCC TCCATGCTCA CCGGCGAGTC AGTGCCGGTG
GAGGTACGGC CCGGTGACAC CGTGACCGGC GCGACCGTCA ACGCCGGGGG CCGCCTGGTC
GTCCGCGCCA CCCGCGTCGG CTCCGACACC CAGCTCGCCC AGATGGCCAA GCTGGTGGAG
GACGCGCAGA CCGGCAAGGC GCAGGTCCAG CGGCTGGCCG ACCGCATCTC CGGCATCTTC
GTCCCGATCG TGATCGCCCT GGCCGTCGGC ACGCTCGGCT TCTGGCTCGG CACCGGCGGC
GGCGCCGGCG CCGCCTTCAC CGCCGCGGTG GCCGTGCTGA TCATCGCCTG CCCCTGCGCC
CTGGGCCTGG CCACCCCGAC CGCGCTGCTG GTGGGGACCG GCCGGGGCGC CCAGCTCGGC
ATCCTGATCA AGGGCCCCGA GGTGCTGGAG TCCACCCGCG CCATCGACAC CGTCGTGCTC
GACAAGACCG GCACCGTCAC CGAGGGCAAG ATGACCCTCA CCGACGTGCA CCTCGCCGAC
GGCGAGGACC ACGACGAGGT GCTGCGCCTG GCCGGCGCCC TGGAGCACGC CTCCGAGCAC
CCCATCGCCC AGGCGATCGC CCGGGGCGCT GCCGAGCGGG TGGGAGAGCT GCCCGCACCG
GAGGACTTCG CCAACGTCGA GGGGCTCGGC GTGCAGGGCA TCGTCGACGG GCACGCCGTG
CTGGTCGGCC GTCCCCGGCT GCTGGCCGAG TGGTCGCAGC ACCTGTCCGC CGAGCTGGAG
CGGGCGCTGC AGGAGGCTCA GGCCGCCGGC CGTACGGCTG TCGCGGTCGG CTGGGACGGC
AAGGCCCGCG CGGTCCTCGT CGTGGCCGAC ACCGTCAAGC CGACCTCGGC CGAGGCGATC
AGGCAGCTGC GCGCCCTGGG GCTGACCCCG GTGCTGCTCA CCGGCGACAA CGAGGCCGTC
GCCCGGTCCG TGGCCGCCGA GGTGGGCATC GACGAGGTGA TCGCCGAGGT CCTGCCCGCC
GACAAGGTCG ACGTGGTCAA GCGCCTGCAG GCCGAGGGCC GGTCGGTGGC CATGGTCGGT
GACGGCGTCA ACGACGCCGC CGCGCTCGCC CAGGCCGATC TGGGCCTGGC CATGGGCACC
GGGACGGACG CGGCCATCGA GGCCTCCGAC CTCACCCTGG TCCGCGGCGA CCTGCGGGTG
GCCGCCGACG CCATCCGCCT GTCCCGCCGC ACCCTGCGCA CCATCAAGGG CAACCTGTTC
TGGGCCTTCG CCTACAACGT GGCCGCCCTG CCCCTGGCCG CGCTCGGCCT GCTCAACCCG
ATGATCGCCG GAGCCGCCAT GGCGTTCTCC TCGGTCTTCG TGGTCAGCAA CAGCCTGCGG
TTGCGCGGCT TCAAGTAA
 
Protein sequence
MSSLTDDRPP SAVELSIGGM TCASCANRIE RKLNKLDGVT ATVNYATEKA KVTFPEGVDP 
QTLIAEVEKA GYTAELPAPP SAEGAPQEPH DELRSLRARL ITSVVLAVPV IAMAMIPPLQ
FTNWQWLSLT LAAPVVVYAG WPFHKAAWTN LRHGAATMDT LVSLGTIAAL GWSLWALFFG
SAGTPGMTHP FAFTIERTDG SGNIYLEAAA GVTAFILAGR YFEARSKRRA GAALRALLEL
GAKDVAVLRD GREVRIPSDQ LKAGDRFVVR PGEKIATDGV VEEGSSAVDA SMLTGESVPV
EVRPGDTVTG ATVNAGGRLV VRATRVGSDT QLAQMAKLVE DAQTGKAQVQ RLADRISGIF
VPIVIALAVG TLGFWLGTGG GAGAAFTAAV AVLIIACPCA LGLATPTALL VGTGRGAQLG
ILIKGPEVLE STRAIDTVVL DKTGTVTEGK MTLTDVHLAD GEDHDEVLRL AGALEHASEH
PIAQAIARGA AERVGELPAP EDFANVEGLG VQGIVDGHAV LVGRPRLLAE WSQHLSAELE
RALQEAQAAG RTAVAVGWDG KARAVLVVAD TVKPTSAEAI RQLRALGLTP VLLTGDNEAV
ARSVAAEVGI DEVIAEVLPA DKVDVVKRLQ AEGRSVAMVG DGVNDAAALA QADLGLAMGT
GTDAAIEASD LTLVRGDLRV AADAIRLSRR TLRTIKGNLF WAFAYNVAAL PLAALGLLNP
MIAGAAMAFS SVFVVSNSLR LRGFK