Gene Sros_4889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4889 
Symbol 
ID8668183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5414584 
End bp5415873 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content69% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003340449 
Protein GI271966253 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.278868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.377169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCAC CATCTCGTCT GCTCGGCGCG GCGATCGGCG CCGTCCTCGC CCTGTCCCTC 
TCCGGCTGCG GGTCGTCCGC CTCGGAGGGG ACCGGTCAGG TGAAGCTGAC CCTCTGGCAC
AACAGCGCGG ACCCGGCGCC GCTGCTGGAG ATGTACAAGA AGTTCGAGAA GCAGTCCGGC
CACAGGATCG AGCTCGTGTC GATCCCGTCC GACGGCTTCG AGGACACCAC CCAGACCAAG
TGGGCCACCG GCGACCGCCC GGACATCCTG GAGTACCACG CGACGGCGAG CGGCCTGCTG
GCCCTCAACC CGGCCGAGAA CCTGCGCGAC CTGACAGGCG AGGCGTACAT CGCCAGGTCC
GGCGACCTCT ACCAGGCCGC CGGGTCCGTC AACGGCAAGG TCTACGCCGC CATCACCGGC
TTCCCCCAGG TCTTCGGCCT CTACTACAAC AAGAAGGTCT TCACCGCGGC CGGGCTGACC
CCGCCCACGA ACTTCGCCGA GCTCGCCGCC GCCTGCCCCA AGCTCAAGGC CGCCGGGGTC
ACTCCGGTCT TCGAGTCGGG CGGGTCGATC TGGCCGGTGC AGATCCTGCC CATCCTCTAC
CTGGCAGGCG CCAACCAGTC CAACGCCTAC GGCAAGGCCA TCGCGGGCCA CAGCAGCACG
CTGGCCGACG CGGGCTCGCC CTTCGTCTCC GGCCTGACCG CCTACGCCAA GCTGAAGGGC
GACGGCTGCT TCAACAAGGA CATCGTCACC GCCAAGTTCG AGGACTCCAT GAAGGCCCTC
GTGACCGGCG AGGCCGCCAT GGTCGCCCAG CACTCCGACA TGCTCCCGGC CCTCCTCGCG
GCCGCGGGCG GCGACCAGAA GACCGTCGAC GAGTCCGTCG GCTTCGTCGG CCTGTCGAGC
GACAAGCCGC TCGTGACCTA CGCGCCCGGC CCGATCGGCA CGTTCTACCT GCCCAAGACC
GGTGACGCGG CGCGGGAGAA GGCGTCGCTC GACTTCGTGC GCTTCATGAC CGGCCCGGCC
TACGCCGAGT ACATCACCGC GTCCAAGACC TTCCCCGTCC TCAAGGACGT GCCCGACCCG
CAGGGCGTCT CCTCCGTGCT GCAGGACGTC AAGAAGGCCT ACGACACCGG CGCGGTCATC
GCCTTCAACT CCGACATCCC CGGCATGGGC GGGCTGGCCC AGCTCATGTC CGAGCTGATC
GCCGGGCAGA AGGATCCGCA GAAGGCGGCG ACCCAGCTAC AGGGCCAGGT CGAGCAGGCG
GCCAAGGCGG CAGGACTGCC CGGATGGTGA
 
Protein sequence
MRAPSRLLGA AIGAVLALSL SGCGSSASEG TGQVKLTLWH NSADPAPLLE MYKKFEKQSG 
HRIELVSIPS DGFEDTTQTK WATGDRPDIL EYHATASGLL ALNPAENLRD LTGEAYIARS
GDLYQAAGSV NGKVYAAITG FPQVFGLYYN KKVFTAAGLT PPTNFAELAA ACPKLKAAGV
TPVFESGGSI WPVQILPILY LAGANQSNAY GKAIAGHSST LADAGSPFVS GLTAYAKLKG
DGCFNKDIVT AKFEDSMKAL VTGEAAMVAQ HSDMLPALLA AAGGDQKTVD ESVGFVGLSS
DKPLVTYAPG PIGTFYLPKT GDAAREKASL DFVRFMTGPA YAEYITASKT FPVLKDVPDP
QGVSSVLQDV KKAYDTGAVI AFNSDIPGMG GLAQLMSELI AGQKDPQKAA TQLQGQVEQA
AKAAGLPGW