Gene Sros_5820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5820 
Symbol 
ID8669114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6386518 
End bp6387861 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content67% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003341308 
Protein GI271967112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.8778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCGAC GCTTTCTCAT TCTCGCCACC GCCGCCCTCA CGGCCGGGGC ACTGACCGCC 
TGCACCGCGG GCAACCAGAG CGCACCCGCG CTCGGGGCGC AGCCCAAGCC CTCGGGCTCC
GCCTCCGCCT CCCTCCCGGC CGCCGCCATC GAGCTGTGGC ACGGATTCTC GGCGCCCGCC
GAGGTGAAGG CGTTCGAGGA CGCCATCGCC GGGTTCCGCC AGAAGTTCCC GCAGATCACC
GTCAAGCTGG TCAAGGGAGT CCAGGACGAC CAGATCACCC AGGCGGTGCG CGGAGGGAAG
GCCCCGGACG TCGCGTCCTC CTTCACCACC GACAACGTCG CCCAGTGGTG CAAGAGCGGA
ACGTTCCAGG ACCTCACCCC GGTGATCAAG CAGGACGGCA TCGACCTGTC GGTGCTGCCG
GAGGCCTCGC GCTCCTACAC CGAGTTCGAC GGCAGACGCT GCGTGATGCC GCTGCTCGCC
GACGCCTACG GGCTCTACTA CAACAAGGCC CTGATGAAGG GCGAGCAGCC GCCCAAGACG
CTGTCGGAGC TGACCGAGCT CACCAAGAAG CTCACGGTCC GCGACGCCGA CGGGACCATC
AAGGTCGCCG GTTTCATCCC GAGCTTCGAG TACTACGAGA ACACCGCCTC GCACCTCGCC
CCCATGGTCG GCGCCAAGTG GTACAACCCG GACGGCACCT CGGCGATCGG CTCCGACCCG
GCCTGGAAGC AGCTCCTGCA GTGGCAGAAG GAGCTCGTCG ACTGGTACGG CCACGACAAG
CTCGACAAGT TCCGCAAGAG CCTGGGCCAG GAGTGGTCGG CCGACCACCC GTTCTACAAG
GGCAAGGTCG CCATGGTGCT CGACGGCGAA TGGCGCAACG CCATGATCGC CAATGAGGCC
AAGGACCTGG ACTACGGCAC CGCACCGCTC CCGGTCGCCG ACGACAAGCC CGACCTGTAC
GGCAGCGGCT TCACCGCGGG CACGGTGATC GGCGTGCCCA AGGGCGCCAA GAACCCGCAG
GCCGCCTGGG AGCTGGTGAA GTATCTGACC ACCGACACCA CCGCCCTGGT CACCCTCTCC
AACGCCCTGC GCAACGTGCC GACCACCAAG GCCTCGCTGG AGTCGCCGGA CCTGAAGAAG
GACGCGAACT TCCAGACCTT CATCGACATC TTCGCCCACC CCAGGACCAG CACGATGCCC
TCCAGCGTCA ACAGCACCTT CAACCAGGAG GCGATCCAGG AGTTCATGCA CCAGTGGGAG
AAGGGCTCGG TCAAGGACCT CGACGCCGGG CTCGCCGGGG TCGACAAGCG TGTCAACGAC
AAGCTGAAGC TCTCCGGGGG CTGA
 
Protein sequence
MHRRFLILAT AALTAGALTA CTAGNQSAPA LGAQPKPSGS ASASLPAAAI ELWHGFSAPA 
EVKAFEDAIA GFRQKFPQIT VKLVKGVQDD QITQAVRGGK APDVASSFTT DNVAQWCKSG
TFQDLTPVIK QDGIDLSVLP EASRSYTEFD GRRCVMPLLA DAYGLYYNKA LMKGEQPPKT
LSELTELTKK LTVRDADGTI KVAGFIPSFE YYENTASHLA PMVGAKWYNP DGTSAIGSDP
AWKQLLQWQK ELVDWYGHDK LDKFRKSLGQ EWSADHPFYK GKVAMVLDGE WRNAMIANEA
KDLDYGTAPL PVADDKPDLY GSGFTAGTVI GVPKGAKNPQ AAWELVKYLT TDTTALVTLS
NALRNVPTTK ASLESPDLKK DANFQTFIDI FAHPRTSTMP SSVNSTFNQE AIQEFMHQWE
KGSVKDLDAG LAGVDKRVND KLKLSGG