Gene Sros_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2983 
Symbol 
ID8666270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3251978 
End bp3254113 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content68% 
IMG OID 
Productexcinuclease ABC subunit B 
Protein accessionYP_003338680 
Protein GI271964484 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.293164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTCCGC CTACAGTGGG TGGAGTGAGG CCGGTAACTG ATTTGCAGCG CAAGGTGGCG 
CCCTTCGAGG TCGTCACCGA CATGACCCCT TCGGGCGACC AGCCCACCGC GATCGCCGAG
CTCGAACGGC GCGTCAAGGC GGGTGACAAG GACAACGTCC TGCTGGGTGC CACGGGCACG
GGCAAGACCG CCACGGTCGC CTGGCTGATC GAGCGGCTGC AGCGGCCGAC CCTGGTCATC
CAGCCCAACA AGACGCTCGC CGCGCAGTTC GCCAACGAGC TGCGCGAGAT GATGCCCAAC
AACGCGATCG AATACTTCGT CTCCTACTAC GACTACTACC AGCCCGAGGC TTACGTCCCG
CAGAGCGACA CCTACATCGA GAAGGACTCC TCGATCAACG ACGAGGTCGA GCGGCTGCGC
CACTCGGCGA CCAACTCGCT GCTGACCCGC CGCGACACGA TCGTGGTGGC GTCGGTGTCG
TGCATCTACG GCCTGGGCAC CCCGCAGGAG TACGTCGACC GCATGACCTC GCTCAAGGTC
GGGCAGGAGA TCGAGCGCGA CAGCCTGCTG CGCCGCCTGG TCGACATGCA GTACACCCGT
AACGACCTGG CCTTCACCCG GGGCACCTTC CGGGTGCGCG GCGACACGAT CGAGATCATC
CCGAAGTACG AGGAGCTCGC GGTCCGCATC GAGATGTTCG GCGACGAGAT CGAGAAGCTC
TCCACCATGC ACCCGCTGAC CGGCGAGGTG ATCACGGAGG ACGAGGAGCT CTACATCTTC
CCCGCCTCCC ACTACGTCGC GGGCACCGAG CGGATGGAGA AGGCCGTCCG GGGCATCGAG
GCCGAGCTGG CCCAGACCCT GGAGACGATG GAGCGGCAGG GCAAGCTCCT GGAGGCCCAG
CGGCTGCGCA TGCGCACCAC CTACGACCTG GAGATGATGC GCCAGATCGG CACCTGCTCC
GGCATCGAGA ACTACTCCCG CCACATGGAC GGCCGCGCCC CGGGCAGCGC CCCCAACACG
CTGCTCGACT ACTTTCCCGA GGACTTCCTG CTCGTCCTGG ACGAGTCGCA CCAGACCGTG
CCGCAGATCG GCGCGATGTA CGAAGGCGAC GCCTCCCGCA AGCGCACGCT CGTCGAGCAC
GGCTTCCGCC TGCCGTCGGC GATGGACAAC CGCCCGCTCA AGTGGGAGGA GTTCCTGGAG
CGGATCGACC AGACCGTCTA CCTGTCGGCC ACCCCAGGCA CCTACGAGCT GGGCCGCTCC
AAGGGCGACG TGGTGGAGCA GGTCATCCGG CCGACCGGCC TGGTCGACCC CGAGGTGATC
GTCAAGCCCA CGAAGTCCCA GATCGACGAC CTGGTCCACG AGATCAGGAC CCGGACCGAG
AAGGACGAGC GGGTCCTGGT CACCACGCTG ACCAAGAAGA TGTCCGAGGA CCTCACCGAC
TACCTCCTGG AGCTCGGCAT CCGGGTCCGC TACCTGCACA GCGAGGTCGA CACCCTGCGC
CGCATCGAGC TCCTCCGGGA GCTGCGGATG GGCGAGTTCG ACGTCCTGGT CGGCATCAAC
CTGCTCCGTG AGGGCCTGGA CCTGCCCGAG GTGTCGCTGG TGGCCATCCT CGACGCCGAC
AAGGAGGGCT TCCTCCGCTC GGAGACCTCG CTGATCCAGA CCATCGGCCG CGCGGCCCGT
AACGTCTCCG GTCAGGTGCA CATGTACGCC GACCGGATCA CCCCCTCCAT GGAGCGGGCG
ATCGAGGAGA CCAACCGGCG CCGGGCCAAG CAGACGGCCT ACAACGAGGC GAACGGCATC
GACCCGCAGC CGCTCCGCAA GAAGATCGCC GACATCCTCG ACTCGCTCAA CCGCGAGGAC
GCCGACACCG CGCAGCTCCT CGGCGGCAGC GGCCGCCAGC AGTCGCGCGG CAAGGCCCCG
GTCCCCGGCT TCGTGGTCAA GCAGGTCGGC CAGCACGCCA AGGCGATCGC GGGGGAGATG
CCCCGCGCCC AGCTGGAGGC GCTGGTCGAA TCGCTCACCG ACCAGATGCA CCAGGCCGCC
ACGGATCTCC AGTTCGAGGT CGCGGCCCGG CTCCGCGACG AGATCAAGGA GCTCAAGCGG
GAGGTCCGAG ACATGCGGGA GGCAGGAGTC TCCTGA
 
Protein sequence
MAPPTVGGVR PVTDLQRKVA PFEVVTDMTP SGDQPTAIAE LERRVKAGDK DNVLLGATGT 
GKTATVAWLI ERLQRPTLVI QPNKTLAAQF ANELREMMPN NAIEYFVSYY DYYQPEAYVP
QSDTYIEKDS SINDEVERLR HSATNSLLTR RDTIVVASVS CIYGLGTPQE YVDRMTSLKV
GQEIERDSLL RRLVDMQYTR NDLAFTRGTF RVRGDTIEII PKYEELAVRI EMFGDEIEKL
STMHPLTGEV ITEDEELYIF PASHYVAGTE RMEKAVRGIE AELAQTLETM ERQGKLLEAQ
RLRMRTTYDL EMMRQIGTCS GIENYSRHMD GRAPGSAPNT LLDYFPEDFL LVLDESHQTV
PQIGAMYEGD ASRKRTLVEH GFRLPSAMDN RPLKWEEFLE RIDQTVYLSA TPGTYELGRS
KGDVVEQVIR PTGLVDPEVI VKPTKSQIDD LVHEIRTRTE KDERVLVTTL TKKMSEDLTD
YLLELGIRVR YLHSEVDTLR RIELLRELRM GEFDVLVGIN LLREGLDLPE VSLVAILDAD
KEGFLRSETS LIQTIGRAAR NVSGQVHMYA DRITPSMERA IEETNRRRAK QTAYNEANGI
DPQPLRKKIA DILDSLNRED ADTAQLLGGS GRQQSRGKAP VPGFVVKQVG QHAKAIAGEM
PRAQLEALVE SLTDQMHQAA TDLQFEVAAR LRDEIKELKR EVRDMREAGV S