Gene Sros_4803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4803 
Symbol 
ID8668097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5319771 
End bp5322059 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content72% 
IMG OID 
Productheavy metal translocating P-type ATPase 
Protein accessionYP_003340369 
Protein GI271966173 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.457318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.13713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGGT TCGCCACCAT CGCCCTTCTC CTGGTCACTG TGACCGGATT ACTGGCCGGA 
CTCATGCTTC ACCTGGCCGG TGCGGAGGCG GCCGGCGACA TCGTGTGGAC GGTGGTCACC
GTCGCGGCTC TGGTGCCCGC GACCGGATGG GTGATCTCGG CGCTGCGCAG CGGCCGGTTG
GGCGTGGACG CCATCGCCGT GCTCGCCCTT GCCGGGGCGC TCGCCGTACG GGAGTATCTC
GCCGGAGCGT TGATCGGCGT GATGCTCGCG ACCGGCCGGG CGCTGGAGGA CTACGCCCTG
CACCGGGCCA GGAGTGACCT GAACGCCCTG TATGAACGCG CGCCGCATTC GGCCCGGCGC
TATGAGGACG GGGTGCCTCG CCTTGTGCCG GTCGAGCAGG TGTGCCCCGG TGACCTGCTA
TTGATCCCCA GCGGGGAGAT CGTCCCCGTC GACGGGCTGG TGACGGGCGA TCTCGCGTTG
CTGGACGAGT CTGCGCTGAC GGGGGAGTCG ATACCGGTGG AGCACGCCGT GGGCGAGGGC
GTGCGCAGCG GTGTGGTGAA CGCCGGTGCC GCCTTCGGCC TGCGGGCCAC CCGGACCGCG
GCGGACAGCA CCTATGCGGC AGTGGTGGCG CTGGCGCGGC AGACCGAAAC CGGCAGCGCC
CCCGTCGTAC GGCTCGCCGA CCGGTTCGCC GCCTGGTTCC TGCCGGCCAC GCTCCTGCTG
GCGGCCGCCG CCTGGCTGCT GTCCGGCGAG CCGGTGCGGG CCGTTGCCGT GCTGGTCGTC
GCCACCCCTT GCCCGCTGTT GCTCGCCGCA CCCGCGGCCA TCGCCTCCGG ACTGTCTCGC
ACGGCGCGAC ACGGGGTGGT GGTGAAGGGC GGCGGAGCCT TGGAGAGACT GGGCCAGGCC
CGCACGCTCA TCCTGGACAA GACCGGCACG CTGACCTCCG GACGGCCGCA GGTGGTCGAC
GTCGTGGCCG CCCCCGGCGC CCGCGCCGAT GATGTGCTGC GCCTGGCCGC CGCGGTGGAC
CAGATGTCCT CTCACGTGCT CGCCGCTGCG ATCGTCGACG CCGCCCGCGT GCACCGGGCC
GTGCTGCCGC AACCGGCTGA GGTCCGGGAG AAGCCCGGCA CCGGGACCAC GGGGGTGGTG
GAGGGGCACC GGATAGAGGT CGGCAAGGCC ACGGCCCCCT CTCGCTCGGA ATGGGAGCGG
GCCCAGCGGG CGCGGGCCGC GTTGGACGGC GCCATGACCG TCTGGGTGAC GGTGGATGGC
CAGGCCAGCG GAGTGATCCT GCTCCGCGAC GCCATACGGG CCGATGCCGC GCGGACTCTG
AGACGGCTGC GCAGCGCCGG GATCGAGCGG ATGGTCATGC TCACTGGGGA CCGGGCCGAA
GTCGCCGAAA GCGTGGGCAT CGTCCTCGGC GTCGACCAGG TGCTGGCTGA ACAGACCCCC
GCCGCCAAGG TCCGGGCGGT ACGGCAGGAG GCGGCCAGGG CGGTGACCGT GATGGTAGGC
GACGGGATCA ACGACGCCCC GGCCCTGGCG GCGGCCGACG TCGGGGTGGC GATGGGCGCC
CGGGGCTCGG CCGCCTCCAC CCAGGCAGCC GACGTAGTGC TCACCACTGA CCGCCTGGAC
CGGCTCGCTG ACGCCATGGA CGTCGCCCGC CGCTCCCGCC GCATCGCCGT ACAGAGCGCG
GCCATGGGGA TGGCGCTGTC ACTGCTGGCC ATGGCGGCCG CCGCAGCCGG AGCACTGGTG
CCTGCGGTCG GCGCCCTCCT GCAGGAAGCC ATCGACATAA CGGTGATCGT CAACGCGCTA
CGGGCGCTGC GCCCCGTGCG GGGAACTCGG ACGGTGGTCG ACCCCGCCAC CGAGGCCCTG
CTACGCCGAT TCGAGACGGA GCACTCCTCA CTGCGACCGC CCCTGGAACT GATCCGCGAG
ACAGCTGACG AACTGGGCGA AACTCCCTCT GCGGCGTCCC TGGACCGGCT GCGAGAGGTC
CACAGTTTCC TCACCGAACG CCTGCTGCCC CATGAGCGGG CCGAGGAACA GCGGCTGTAT
CCGGCGATGG GGCAGGTGCT GGGCAGTCTC GAGGCCACGA TGACCATGAG CAGGGCGCAC
GCCGAGATCG AACGCCTCGT GCGGCGTGTG GGAAATCACC TTGCCCTCAC CGAGACCGAC
GGGGTGCGCG CGGAGCAAAT GGACGACCTG CGGGCCTGCC TCTACGGTCT GCACGCCGTC
CTGGTCCTCC ACTTCGACCA GGAGGAGGAG GCCTACTTCT CCCTGGCTGC CAACTCGGCG
GCAGAATGA
 
Protein sequence
MRRFATIALL LVTVTGLLAG LMLHLAGAEA AGDIVWTVVT VAALVPATGW VISALRSGRL 
GVDAIAVLAL AGALAVREYL AGALIGVMLA TGRALEDYAL HRARSDLNAL YERAPHSARR
YEDGVPRLVP VEQVCPGDLL LIPSGEIVPV DGLVTGDLAL LDESALTGES IPVEHAVGEG
VRSGVVNAGA AFGLRATRTA ADSTYAAVVA LARQTETGSA PVVRLADRFA AWFLPATLLL
AAAAWLLSGE PVRAVAVLVV ATPCPLLLAA PAAIASGLSR TARHGVVVKG GGALERLGQA
RTLILDKTGT LTSGRPQVVD VVAAPGARAD DVLRLAAAVD QMSSHVLAAA IVDAARVHRA
VLPQPAEVRE KPGTGTTGVV EGHRIEVGKA TAPSRSEWER AQRARAALDG AMTVWVTVDG
QASGVILLRD AIRADAARTL RRLRSAGIER MVMLTGDRAE VAESVGIVLG VDQVLAEQTP
AAKVRAVRQE AARAVTVMVG DGINDAPALA AADVGVAMGA RGSAASTQAA DVVLTTDRLD
RLADAMDVAR RSRRIAVQSA AMGMALSLLA MAAAAAGALV PAVGALLQEA IDITVIVNAL
RALRPVRGTR TVVDPATEAL LRRFETEHSS LRPPLELIRE TADELGETPS AASLDRLREV
HSFLTERLLP HERAEEQRLY PAMGQVLGSL EATMTMSRAH AEIERLVRRV GNHLALTETD
GVRAEQMDDL RACLYGLHAV LVLHFDQEEE AYFSLAANSA AE