Gene Sros_5738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5738 
Symbol 
ID8669032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6281546 
End bp6282904 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content68% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003341229 
Protein GI271967033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.599702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0824986 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGTT CCCTCCGGCT GGCCGCGGCA GCCACCGCGC TCGGCCTGAC GATCACAGCT 
TGCGGCTCGA CGGCACCCGA GGTCTCCGCC GACGGCGGCG GACTCGGCAC GGCGGAGAAC
CCCGTGTCGA TCAGGGTCAT CGCCAACGAC GCCTTCGCCA AGCAGTGGCA GGACCAGCTG
GTCCCCGAGT TCAACAAGAA GTACCCGAAC ATCAAGGTCT CGGTGGACGG CGTGCCGTAC
AACGACCAGC TTCCCAAGAC CATGCTGGAG CTGACCGGGC CCACCGCGAC GTACGACGTG
GTGCTCGGCG ACGACCCGTG GGTCCCGCAG CTCGCCAAGA CCGGCGGCCT GATGGACCTC
AAGGCCGACG TGACCAAGTG GACCGACCCG GGCTACGACT GGGCCGACTT CAACCCCTCC
CCGCTCGCCG CGGGCCAGTG GGACGGCAAG CAGTACGCCG TCCCGGTCCG CTCCAACCTG
CTGCTGATGC TCTACAACAA GACGCTGTAC AAGAAGGCCG GGGTCCAGGA ACCCACCACG
GAGACGACGT GGGAGCAGTA CTTCAAGGAC GCCGAGAAGC TGGTCCAGGA CACCGACGGC
GACGGGAAGA CCGACGCCTG GGCGATCGGC ACCTACTTCA CCAAGGACCC GCTCACCCCG
ACGATCTGGC AGACCGTCCT GAACTCCAAC GGCGTGGCGC TGCTGGATGA CAACCTCAAG
GTCGCCTTCG ACAACGAGAC CGGCGTGAAG GCCCTTCAGA CCCACGTCGA CCTGCTGAAG
TACGCCCCGC CGGGAGCGAG CACCTACCAG TTCAACGAGC CGCTCGAGGC CTTCCGCCAG
GGCAGGACCG CCACCATGTT CATGTGGGGC AGCGTCTACA AGGGCTCGGC GGTGGACAAG
GCGTCCACCA CGCTGACGCC CGAGGAGGTC GGCGTCACCA CCCTTCCGGC GGGTTCGGCC
GGTCCGGGTG CGCACCGCGG TGTCTGGTCG GCGGGCATCG CCAAGAAGTC CCAGCACCCG
GCCGCCGCGT GGACGTGGCT GCAGTGGGTC ACCTCCAAGG AGGGCGAGAA GTTCACCGGG
AGCGCGTTCG GGACGTTCCC GGCGCGTAAC TCCTCGCTGG ACGGCACCCC GCCGGCCGAG
TGGGCCGCCC CCGTCTACAA GGCGCTCAAG GACGGCTACA CCGTGGTGGA CAAGAACAAG
ATGTGGCGCC CGCGGCTCCC CGAGTCCGAC GCGGTCCAGC AGATCCTCGC TCTGCAGACG
AGCCGGGCGA TGTCCGGCCA GGCGACCTCG GAGCAGGCGA TCGACCAGGC GGCGAAGGAC
GTCACCGAGC TGCTGAAGTC CAAGGGCTAC CAGCAGTGA
 
Protein sequence
MRRSLRLAAA ATALGLTITA CGSTAPEVSA DGGGLGTAEN PVSIRVIAND AFAKQWQDQL 
VPEFNKKYPN IKVSVDGVPY NDQLPKTMLE LTGPTATYDV VLGDDPWVPQ LAKTGGLMDL
KADVTKWTDP GYDWADFNPS PLAAGQWDGK QYAVPVRSNL LLMLYNKTLY KKAGVQEPTT
ETTWEQYFKD AEKLVQDTDG DGKTDAWAIG TYFTKDPLTP TIWQTVLNSN GVALLDDNLK
VAFDNETGVK ALQTHVDLLK YAPPGASTYQ FNEPLEAFRQ GRTATMFMWG SVYKGSAVDK
ASTTLTPEEV GVTTLPAGSA GPGAHRGVWS AGIAKKSQHP AAAWTWLQWV TSKEGEKFTG
SAFGTFPARN SSLDGTPPAE WAAPVYKALK DGYTVVDKNK MWRPRLPESD AVQQILALQT
SRAMSGQATS EQAIDQAAKD VTELLKSKGY QQ