Gene Sros_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2255 
Symbol 
ID8665537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2433371 
End bp2434693 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content69% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003337980 
Protein GI271963784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.190437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.413243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGAT TCCTCCCCCG CCGCCAGATG CTCAGGACCG TGGCCGCCGT GACCGTGGCG 
GGCGCGCTCG GCCTCGGCGC CGCCGCCTGC GGCGGCGGCT CGACCGGCGA GGCCGCCAAC
GAGCTCGTCT ACTGGTCGAT GTGGAAGGCG GGCGAGCCCC AGGCCAAGGT GCTGGAGTCC
GCCATCGCCT CCTTCACCCG GGAGACCGGC GTCAAGGTCA AGGTCGAGTG GAAGGGGCGC
GACGTCTCCA AGCAGCTCGC GCCCACGCTC AACACCAGGA ACGTCCCCGC CGACCTGGTC
GACTCCGCCG ACCGGTTCGT GAAGTCGACC TTCGTCGCCA CCGGACAGGG CCTGGACCTC
TCTCCCGTCT ACGACATGGA GATCCCGGGC GAGACCGGCA AGAAGGTCGG CGACGTGATC
GATCCCAAGT ACCGCGAGTA CGCGACCTCG GACGGCAAGG CCTGGCTGGT GCCGTACGAG
GTGCTGGCCG AGCAGATCTG GTACGACGGC AACGCGCTCA AGGACGTGGC CGCCGCGCCC
CCGAAGACCT GGGACGACTT CGTCTCCCTG CTGGACAGGC GCAAGACCGC GCGGGGCGAC
GGGCCGCTGG CGCTGGACGC CGACATCGCC GACTACTCGG CCTTCTGGAC CTACCACGCG
ATCCTGCGCG ACCTCGGGCC CGGCGCGTTC GGCGCCGCCG CGACGGACGC GACCGGCGCC
AAGTTCGACG ACCCGGCCTT CGTGACCGCG GTCCAGAAGA TCGAAGAGCT CGTCAAGGGC
GGCTACTTCG TCAAGGGCTA CGACGGCAGC AAGTTCCCGG CCGTCCAGCA GAAGTGGGCC
GGCGGCGGGG CCGACTTCCT GCTGCTGGGC ACCTTCGCGC CGAGCGAGAC CAAGCCGTCG
GCCAAGGAGG GCTTCGCCTA CCGCTCCTTC CCCTTCCCCG AGGGCGCCAA GGGCGAGCAG
ACCCAGGAGA TCTCGCTGAT CGGGTTCGCG ATCCCGGCCA AGGCCCGCAA CGCCGAGGCT
GCCAAGAGGT TCGTCGCGTA TTTCATGAAC AAGGAGCGGC TGTCGAAGAT CGCCTCCGAG
ACGGACAACA TCACCCCGCG CGCGGACATC GAGGTGCCCG CCGTACTGGC CGACGTGAAG
AAGACGCTGG ACACCGCGCG GACGCATCCG GCCCTCGACG GCGTGAAGAT GGACCACACC
GACTGGTACA CCAAGGTCTT CCAGCCGGTG AACACCGAGC TGATCACCGG TAAGGTCTCC
GCGGCCGACT TCGTGGCCAA GCTCAAGAGC ACCTCGGTCG ACTTCTGGAA GCTCAACGGC
TGA
 
Protein sequence
MNGFLPRRQM LRTVAAVTVA GALGLGAAAC GGGSTGEAAN ELVYWSMWKA GEPQAKVLES 
AIASFTRETG VKVKVEWKGR DVSKQLAPTL NTRNVPADLV DSADRFVKST FVATGQGLDL
SPVYDMEIPG ETGKKVGDVI DPKYREYATS DGKAWLVPYE VLAEQIWYDG NALKDVAAAP
PKTWDDFVSL LDRRKTARGD GPLALDADIA DYSAFWTYHA ILRDLGPGAF GAAATDATGA
KFDDPAFVTA VQKIEELVKG GYFVKGYDGS KFPAVQQKWA GGGADFLLLG TFAPSETKPS
AKEGFAYRSF PFPEGAKGEQ TQEISLIGFA IPAKARNAEA AKRFVAYFMN KERLSKIASE
TDNITPRADI EVPAVLADVK KTLDTARTHP ALDGVKMDHT DWYTKVFQPV NTELITGKVS
AADFVAKLKS TSVDFWKLNG