Gene Sros_5002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5002 
Symbol 
ID8668296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5527605 
End bp5528885 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003340544 
Protein GI271966348 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.287585 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.158884 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGGT ACTGGACGGG ACTGGCCATG GCGGCCGTCA TGGTCGTCTC GGGGTGCGGG 
CTCGACGCGG CCGACCCGCA GGACGAGGCC GCGCCGACCG CGGCCGCCAC CGGCGAGGTG
AAGGGCAAGG TCACGCTGCA GACGTGGGCG CTGAAGCCCA AGTTCACCTC GTACATGGAG
AGCGTGATCA GCGCCTTCGA GAAGAAGCAC CCGGGCACCG ACGTGGCGTG GCTCGACCAG
CCCGCCGACG CCTACTCGGA CAAGGTGCTC AGCCAGGCCG CGGGCGGCAC TCTGCCCGAC
GTCACCAACC TGCCACCCGA CTTCGCCCGT CCGCTGGTCA AGCAGGGCAT GCTGCTCGAC
GTGTCCACGG TGGACTCCAA GCTGGCCGAC GAGTACGTCG CCGGTGGCCT CGACGCATAC
AGGTTCAGCG GCCACCAGGG CACCTACGGC TACCCGTGGT ACCTCAACAC GGACGTCAAC
TACTGGAACA GCGAGTTGAT GGCGGAGTAC GGGCTGGACC CGAAGCAGCC GCCGGCGTCC
TTCGACGAAC TGGTCGAGCA GGCCCGGACC ATGAAGGAGA AGTCGGGCGG CAAGGTCCTG
CTGATGAGCC GCCGCCCTGA GATCGGGGAC CTGAGGGAGG CGGGGGTCAA GCTCCTGTCC
GACGACGGCA AGAAGTTCGT CTTCAACACG CCCGAGGCCG CCGCACTGGT CGACAAGTAC
CGCGCCGCCT TCAAGGAGGG CCTGATGCCG CGCGACGTGC TGACCGACAC CTACGCCGGC
AACGCCAAGC TGTTCAACGA AGGCGCGGTG GCCTGGACCA CCGGCGGCGG CAACCACATC
ACGAGCGTGA CCAACGAGAA CCCCTCGCTC GCGCCCAAGA TCGTGGCCTC CCCCGACTTC
GGTACCCCGC CGCTGTACGT TCAGGGCCTG TCCATCTCGA AGAAGACCAC GAACCTGCCG
ACGGCGATCG CGCTGGCCCG CTGGGTGACC AGCCCGGAGA ACCAGGCCGC GTTCGCCCAC
CTGGTCAGCA TCTTCCCGAG CACGAAGTCC TCGGCGAACG ACCCGTTCTT CAGCAAGAGC
GACGGCACCA ACGCCGGCGA CGCCAAGGTC ATCGCCTTCA ACTCGCTGGC CAAGGCGGAG
GTTCTCCAGC CCTACGAGGT CACCGACGCG ATGAGCAAGT TCATCCGGCA GCAGATCTCG
GCCGCCCTCA ACGGTCAGAT CTCCTCGAAA GAGGCCCTGG ACGCCGCGGT CGCCAAGTGC
AACGAGCTGC TCAACCAGTG A
 
Protein sequence
MRRYWTGLAM AAVMVVSGCG LDAADPQDEA APTAAATGEV KGKVTLQTWA LKPKFTSYME 
SVISAFEKKH PGTDVAWLDQ PADAYSDKVL SQAAGGTLPD VTNLPPDFAR PLVKQGMLLD
VSTVDSKLAD EYVAGGLDAY RFSGHQGTYG YPWYLNTDVN YWNSELMAEY GLDPKQPPAS
FDELVEQART MKEKSGGKVL LMSRRPEIGD LREAGVKLLS DDGKKFVFNT PEAAALVDKY
RAAFKEGLMP RDVLTDTYAG NAKLFNEGAV AWTTGGGNHI TSVTNENPSL APKIVASPDF
GTPPLYVQGL SISKKTTNLP TAIALARWVT SPENQAAFAH LVSIFPSTKS SANDPFFSKS
DGTNAGDAKV IAFNSLAKAE VLQPYEVTDA MSKFIRQQIS AALNGQISSK EALDAAVAKC
NELLNQ