Gene Sros_6551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6551 
Symbol 
ID8669860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7188332 
End bp7191517 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content75% 
IMG OID 
Productalpha-L-rhamnosidase 
Protein accessionYP_003342007 
Protein GI271967811 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.585079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000045666 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACAGG CCGTCGCGGC CGCCCTCGCC GTCACCGATC TCCGGATGGA CGGCCGGACC 
GAACCCCTGG GCCTGGGCGA GAGCGCGCCG AGGTTCTCCT GGCGGCTCGC CGCTCCGGGC
CGGGGCCGGG CCCAGAGCGC CTACCGGATC GTCGTGGGCC GCGAGTGCGA CCCGCTCGCC
GAGGGCGCCG AGGTCCTGTG GGACACCGGC GAGGTCGGCT CGTCCCGGAC GTTCGACATC
GCCTACGAGG GCGCGGAGCT GCGCCCGCGC ACGCGCTACC AGTGGCGTGC GCGGGCGGCC
GACGAGGCCG GCACGTGGGG CGACTGGAGC CCGGCGCACT GGTTCGAGAC CGGGCTCGGC
GACGTCACGG GGTGGACCGC CGACTGGATC GGCACGCCCG CCGTCCCCGG CTCGCAGCGG
CTCCCGTCCC TGGAGAACGT GCCCCGCATC TGGGCCGCCG GTCCGCTCGG CCCGTCCGAG
GCGCGGACCG GCGGTTTCCG CACCCGTTTC ACCGTGCCCG GCGGGGCACG TCCCCTGTCG
GCGTGGCTGG TGATCGGCGG CGCGCTCGAT CCACAGGTCC ATCTCAACGG GATGCCCGTG
GCGGGGGAGG TCCGCGACGG GGCGTTCGTG GCGGACGCCG GCGACCTGAT CGACTCCGGG
GAGAACGTCC TGGCGCTGCG CGCCGGCGCC GTGGACGGCC TGCCCGGTGG GCTGTGCGTG
CGTCTGGAGG TCGTCCTGGA AGGCCGCCCC GCCCTGCCCC AGCTCGTCCC GGTGGCCGGG
GAGAGCGCCG TGACCGTGAC CTCGGACGGC CGCTGGCGGG CGGCGGGTGA GCCCGCGCCC
GGGTTCGAAC AGCCCGGCTT CGACGACACC GGCTGGTCGC TCGCCGAGGA GGTGGGGCTG
CACGGCGACC CGCCGCGGGG ACGCGAGCCG GTCGCGGACC GCCCGAGCCC CTACCTGCGC
CGGGAGTTCG ACGTGCCCAG GCCGGTGAGC CGTGCCCGGC TGTACGCCAC CGCGCTCGGC
GTCTACGAGC TGCACCTCAA CGGCATGCGG GTGTCCGCCG ACCACCTGGC GCCCGGCTGG
ACCGACTACC GCCACCGTGT CACCTACCAG ACCTACGATG TGACGGCACT GGTCGCCGAG
GGGGGCAACG CGCTCGGCGC GGTGCTGGCC GACGGCTGGT ACGCCGGCAA CATCAGCTGG
TTCGGCTCCT TCCAGTACGG CAGGCGGCTG GCCCTGCGCG CCGAACTGGA GATCGTCCAC
GACGACGGCA CCACGACCAG GCTGCGCACC GACTCCGACT GGCGGGCCGG CAGCGGCGCG
ATCAGGTACG CCGACCTGCA GAACGGTGAG CGGCAGGATC TGGCGGCCGA GCCGGTGGGC
TGGACCGCGT GCGGGTTCGA CGACTCCGGC TGGCTGCCCG CCGTACCGGT GAGCCCTCCG
GCAGGGCGGC CGGCGGCGGC CGTCGCCCCG CCCGTCCGGG TGCACGAGGA GCTGGCGCCC
CGGGCGGTGT GGGAATCGAG CCCCGGCGTC TGGATCGCCG ACTTCGGCCA GAACGTGGTC
GGCTGGGTCC GGCTCACCGC CCGCGCCGGC CGGGATCGGC CGGTCGTGCT GCGCCACGCC
GAGGTGCTCG ACCACGAGGG GGCGCTGCAC CTGGCCAACC TGCGCTCGGC CCGCGCCACC
GACGAGTTCC TGCCGCGTGG CGGGGCGGGC GCGGAGACCT TCGAACCCCG CTTCACCTTC
CACGGCTTCC GCTATGTCGA GGTGAGCGGG CTGCGCGAGC CGCTCACCTC GGACGCGATC
CGGGCCCGGG TGGCCTACGC CGCGATGGAG CCGGCGGGCG AGTTCGCCTG CTCCGACGAG
CGGCTCAACA GGCTGCAGGG CAACATCGTC CGGGGCCAGC GGGGGAACTT CCTGTCGATC
CCGACGGACT GCCCGCAGCG GGACGAACGG CTGGGCTGGA CCGGTGACAT CTGGGCGTTC
GCCCCTACGG CGCTGTTCAA CTACGACGCC CGGGCGTTTT TGCACAGCTG GCTCACCGAC
GTGGTGGACG CGCAGGCCGA GGACGGCGCC GTGACCCACG TCGTGCCCGA CGTGCTCTCC
GGCCGCGGCC TGTCACCCAA CCCCAGGGAG GCCGGCTCGC CCGGCTGGGG GGACGCCATC
GTGATGCTGC CCTGGGCGCT GTACCGCCTG TGCGGCGACG CCGACGCCGT GGCCCGGTAC
TACGGGCCGA TGCGCCGCTG GCTGGCCTAC CTGGACAGCC GCTCGACCGA CGGGATCTTC
CCCGACGAGG GCTTCGGCGA CTGGCTGAGC ATCGGGGCCG ACACCCCCAA ACGGCTCGTC
GGCACCGCGA TCTTCGCGCT GTCGGCGCGG CAGCTGGCCG AGCTGGCCGC CGCGCTCGGC
CGCGCCGACG ACGAGCGGGC CTGCCTGGAG GTGTACGCCC GGGTCCGGCG GGCCTTCCGC
GCCGCCTTCG TGCAGGGGCC GGGGGTGGTG GAGAGCGGCA CGCAGACCGC CTACGTGCTG
GCCATCACCG CGGGCCTGCT GGAGGAGGCC GAGCTGCCCC GGGCCGCCGC GCGCCTGGTA
CGCGACATCG AGGCGCGGGG CGGGCATCTG TCCACCGGCT TCCTCGGCAC GCCGTTCCTG
CTCGACGCGC TGACCCGGTC CGGGCACCTC GGCACGGCCT ACCGGCTGCT GCTGCAGGAG
AGCTTCCCGT CGTGGCTCTA CCCGGTCGTG CACGGCGACG CCACGACCAT GTGGGAGCGC
TGGGACGGCT GGAGCCACCA CCGCGGCCTG CAGGACCCGG GCATGAACTC CTTCAACCAC
TACGCCTACG GCGCGGTGGG CGCCTGGATG TACGAGACGA TCGGCGGCCT CGCCCCCGCC
TCTCCCGGCT ACCGGGCCAT CGTGGTACGG CCGCGCCCCG GCGGGGAACT CACCTGGGCC
AGGACGGCGT ACCGGACCAG GCACGGCCGG GTGGAGATCG CCTGGCGGCG GGAGGGCGGG
GACTTCACCC TGGAGGTGAG GGTGCCGCCG AACACCCGTG CCGAGGTCTG GGTCCCCGGC
GGGCCCGCGG GCGTGACCGA GTCGGGCCGC CCGGCGGCCG AGTCGCCCGG CGTCGCCCTC
GACCGGGTCG TGGACGGGCA CGCCGTCTAC GAGGTCGGCT CGGGCTCCTA CGCCTTCCAC
GTCTGA
 
Protein sequence
MEQAVAAALA VTDLRMDGRT EPLGLGESAP RFSWRLAAPG RGRAQSAYRI VVGRECDPLA 
EGAEVLWDTG EVGSSRTFDI AYEGAELRPR TRYQWRARAA DEAGTWGDWS PAHWFETGLG
DVTGWTADWI GTPAVPGSQR LPSLENVPRI WAAGPLGPSE ARTGGFRTRF TVPGGARPLS
AWLVIGGALD PQVHLNGMPV AGEVRDGAFV ADAGDLIDSG ENVLALRAGA VDGLPGGLCV
RLEVVLEGRP ALPQLVPVAG ESAVTVTSDG RWRAAGEPAP GFEQPGFDDT GWSLAEEVGL
HGDPPRGREP VADRPSPYLR REFDVPRPVS RARLYATALG VYELHLNGMR VSADHLAPGW
TDYRHRVTYQ TYDVTALVAE GGNALGAVLA DGWYAGNISW FGSFQYGRRL ALRAELEIVH
DDGTTTRLRT DSDWRAGSGA IRYADLQNGE RQDLAAEPVG WTACGFDDSG WLPAVPVSPP
AGRPAAAVAP PVRVHEELAP RAVWESSPGV WIADFGQNVV GWVRLTARAG RDRPVVLRHA
EVLDHEGALH LANLRSARAT DEFLPRGGAG AETFEPRFTF HGFRYVEVSG LREPLTSDAI
RARVAYAAME PAGEFACSDE RLNRLQGNIV RGQRGNFLSI PTDCPQRDER LGWTGDIWAF
APTALFNYDA RAFLHSWLTD VVDAQAEDGA VTHVVPDVLS GRGLSPNPRE AGSPGWGDAI
VMLPWALYRL CGDADAVARY YGPMRRWLAY LDSRSTDGIF PDEGFGDWLS IGADTPKRLV
GTAIFALSAR QLAELAAALG RADDERACLE VYARVRRAFR AAFVQGPGVV ESGTQTAYVL
AITAGLLEEA ELPRAAARLV RDIEARGGHL STGFLGTPFL LDALTRSGHL GTAYRLLLQE
SFPSWLYPVV HGDATTMWER WDGWSHHRGL QDPGMNSFNH YAYGAVGAWM YETIGGLAPA
SPGYRAIVVR PRPGGELTWA RTAYRTRHGR VEIAWRREGG DFTLEVRVPP NTRAEVWVPG
GPAGVTESGR PAAESPGVAL DRVVDGHAVY EVGSGSYAFH V