Gene Sros_3538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3538 
Symbol 
ID8666826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3925103 
End bp3926323 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID 
ProductMaltose-binding periplasmic protein/domains- like protein 
Protein accessionYP_003339217 
Protein GI271965021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.385239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGA GAGTCACAGG GACGGCGGCC GGCATGGCGC TGCTCATCGG GCTTGCGGCC 
TGCGGCGGCG ACGGCTCCGG AGAGGCCGCG GAGACCACGA CGCTGACCTA CTGGATGTTC
CAGGACCGCA CGCCGCAGGC CGGAGAGGTC GTCGAGAAGC TCCGCACCGA CTTCGAGAAG
GCCAACAACG TCACGGTGAA GATCGTCAAG ATCCCCAAGG ACGACTACAA CACCAAGCTG
GGCAGCGCGG TCGCCGGCGG GACCGTCCCG GACGTCGGCA TTCTCGACCA GCCGCTGGTG
TCCCGGTACG CGCTGGACGG CACGATCAAG GAGATGCCCG CGGGCACGGT CGACGAGAAG
GCCTACTACG CCGGGGCACT GAACACCAAC CGGGTGAACG GCAAGCTCTA CGGCCTGCCG
GTGGACCACA CCGCCGTCGC CCTCTTCTAC AACAGGAAGC TCGTCCCCAC GCCGCCCAAG
ACCTGGGACG AGCTCAAGCA GATCACCGCG AAGATCCACC AGGACGACCC GCAGACCGCC
GGCATGGTGG TGCCCAAGGG AGACGGCTAC GGCGGCTGGA TCTGGCCGGG CGTCCTCGCG
GGCGCAGGCG GCTCCCTCGT GGACGAGAAG GCCAAGAAGA TCCTCTTCGA CCAGCAGCCC
GGGGTCGACG CGCTGCAGCT CTGGGTGGAC CTGCTCTCCT CCTCGCCCAG GAAGATCACC
GACTCGGACA AGGCATTCGA GAACGGCCTG GCCGGCATGA TGATCTCCGG CCCGTGGGAC
GTCGCCAACA TCAAGGACCA GTTCCCCGAC CTGGAGTTCG GCACCGCGCC GCTGCCGTAC
AAGACCGAGC CCGCAGGCAA CATCGGCGGC GAGAACGCGG TGGTCTTCAC CAAGGGCAGG
AACGCCGATC TGGCCTGGAA GTGGCTCAAG CACCTGACCA GCGCCGAGAA CAACACCCAG
CTCGCCCAGG CGCTCGGCGG GTTCCCGACC AACATCGCGG CCGCCGAGAA GGACGCCGCC
ACCTTCGGTC CCGAGCAGGC CGCCTTCCTG GAGCAGCTGA AGGTCGCCCA GGCCCGTCCG
GCCCTGCCGC AGTGGATCCA GGTCAACGAC GAGATCATCG CCCCCGCCAT CGAGTCGGCG
CTGAGCGGCA AGGTCACCTC CCAGCAGGCA CTGAGCGACG CGGCGGCCAA GACCCGCGCC
CTGCTCGGCT GGACCGGCTG A
 
Protein sequence
MIRRVTGTAA GMALLIGLAA CGGDGSGEAA ETTTLTYWMF QDRTPQAGEV VEKLRTDFEK 
ANNVTVKIVK IPKDDYNTKL GSAVAGGTVP DVGILDQPLV SRYALDGTIK EMPAGTVDEK
AYYAGALNTN RVNGKLYGLP VDHTAVALFY NRKLVPTPPK TWDELKQITA KIHQDDPQTA
GMVVPKGDGY GGWIWPGVLA GAGGSLVDEK AKKILFDQQP GVDALQLWVD LLSSSPRKIT
DSDKAFENGL AGMMISGPWD VANIKDQFPD LEFGTAPLPY KTEPAGNIGG ENAVVFTKGR
NADLAWKWLK HLTSAENNTQ LAQALGGFPT NIAAAEKDAA TFGPEQAAFL EQLKVAQARP
ALPQWIQVND EIIAPAIESA LSGKVTSQQA LSDAAAKTRA LLGWTG