Gene Sros_7166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_7166 
Symbol 
ID8670477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7912761 
End bp7914116 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content68% 
IMG OID 
Productalpha-glucoside ABC transporter 
Protein accessionYP_003342604 
Protein GI271968408 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0136035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTCC CCTCACGGGC CGGCCGCGTC ACCCTGGTGG CGGGCACCAT CGGACTGGCG 
CTGGCCGTCA CCGCCTGCGG ATCCACGGAG CCCAAGCCAT CGGCGAGCGG AGGCGGCAGC
ACCTCGGCCG AATGCGCGGC CTTCAGCAAG TACGGCAAGC ACGACGGCAA GACCGTCAGC
ATCTACTCCC CGATCCGCGA CGCCGAGGCC GACCTGTTCC AGGAGGCGTG GAAGCCGTTC
GCTAAGTGCA CGGGGATGAA GATCACTTAT GAGGGCACCG GTGAGTTCGA GGCCCAGATC
CAGGTGCGCG CGGATGGCGG CAACCCGCCG GACATCGCCT TCTTCCCGCA GCCGGGCCTG
CTGGAGCGTT TCGCCAAGGC CGGCAAGCTC AAGCCCGCCT CGGCCGAGGT GAAGAAGCTG
ACCGACGAGG GCTGGTCGCC CGACTGGGCC AAGTACTCCA CCGTCGACGG CACCTTCTAC
GGCGCGCCGC TGGGTGCCAG CGTGAAGTCC TTCGTCTGGT ACTCGCCGAA GATGTTCAAG
GAGAAGGGCT GGGCCGTCCC CACGACCTGG GACGAGCTGA TGACGCTCTC CGGCACGATC
GCCGGCTCCG GCGTCAAGCC GTGGTGCGCG GGCATCGAGT CCGGTGACGC GACCGGATGG
CCGGCCACGG ACTGGATCGA GGACATCCTG CTCCGCGAGA TCGGCCCCGA GGGCTACGAC
GACTGGGTGG GCCACAAGCT CGCCTTCAAC GACCCCAAGG TCGTCGCGGC GGTGGACAAG
GTCGGTGCGA TCCTCAAGAA CGACAAGTTC GTCAACGGCG GCTACGGGCC GGTGAAGTCG
ATCGCCAGCA CCGCCTTCCA GGAGGGCGGC GTCCCGATCA CGCAGGGCAA GTGCGCGCTG
CACCGCCAGG CGTCCTTCTA CGCCAACTTC TGGCCCGAGG GCACGAAGGT GGCCGAGGAC
GGCGACGTGT TCGCCTTCTA CCTGCCGGGC AACGACCCGT CCAAGAAGCC GGTCCTCGGC
GCGGGCGAGT TCGTGGCGGC CTTCACCGAC CGTCCCGAGG TCCAGGCCGT GCAGGCCTAC
CTCGCCTCCG CGGAGTTCCC CAACACCCGG ATGAAGCTGG GGACCTACGT CTCCGCGCGC
AAGGGCGTCG ATCCCGCCAA CGCGCAGAAC CCCATCGACA AGCTCTCCAT CGAGATGCTG
CAGGACCCGG AGACGGTCTT CCGCTTCGAC GGCTCGGACC TGATGCCGGC CGCAGTCGGC
GCCGGCACCT TCTGGAAGGG CATGACCGAC TGGATCAACG GCAAGGACAC CAAGACGGTC
CTCGACTACA TCGAAGCCTC CTGGCCTAAG TCCTGA
 
Protein sequence
MSLPSRAGRV TLVAGTIGLA LAVTACGSTE PKPSASGGGS TSAECAAFSK YGKHDGKTVS 
IYSPIRDAEA DLFQEAWKPF AKCTGMKITY EGTGEFEAQI QVRADGGNPP DIAFFPQPGL
LERFAKAGKL KPASAEVKKL TDEGWSPDWA KYSTVDGTFY GAPLGASVKS FVWYSPKMFK
EKGWAVPTTW DELMTLSGTI AGSGVKPWCA GIESGDATGW PATDWIEDIL LREIGPEGYD
DWVGHKLAFN DPKVVAAVDK VGAILKNDKF VNGGYGPVKS IASTAFQEGG VPITQGKCAL
HRQASFYANF WPEGTKVAED GDVFAFYLPG NDPSKKPVLG AGEFVAAFTD RPEVQAVQAY
LASAEFPNTR MKLGTYVSAR KGVDPANAQN PIDKLSIEML QDPETVFRFD GSDLMPAAVG
AGTFWKGMTD WINGKDTKTV LDYIEASWPK S