Gene Sros_6741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_6741 
Symbol 
ID8670050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp7418496 
End bp7419611 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content67% 
IMG OID 
ProductD-xylose-binding periplasmic ABC transporter protein 
Protein accessionYP_003342193 
Protein GI271967997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.177383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGG GGATCCTCAG CCTGACCGCC GCCGCTGCGG CGATGACCCT CGGTCTCACC 
GCCTGCGGGG GCGAGAGCGG CGACACCACC ACCGCCCAGA ACAGCGCCGC CCCCGCCGAG
AGCAAGGCCG CCGGAAAGGT CGGCGTCATC CTGCCGGACA GCAAGTCCTC CGCCCGCTGG
GAGACCGCGG ACCGCAAGTA CCTGGAGGAG GCGTTCAAGG CCGCGGGCGT CGCCTACGAC
ATCCAGAACG CCCAGGGTGA CAAGACCCAG TTCCAGACCA TCGCCGACCA GATGATCACC
AATGGCGCGA CCGTGCTGAT GATCGTCAAC CTGGACAGCG GCACCGGCAA GGCCGTGCTC
GACAAGGCCA AGGCCCAGGG TGTGGCCACC ATCGACTACG ACCGCCTCAC CCTCAACGGC
GGCGCCTCCT ACTACGTCAG CTTCGACAAC ACCAAGGTCG GCACCCTGCA GGGTGAGGGC
CTGGTGAAGT GCCTGACCGA CAAGAAGGCC GACAAGCCGA TCGTGGCCGA GCTCAACGGC
TCGCCCACCG ACAACAACGC CACGCTGTTC AAGAACGGCT ACGACGGCGT GCTCAAGCCC
AAGTACGACG CCAAGGAGTA CGTCAAGGGC CCGGACCAGT CCGTGCCGGA CTGGGACAAC
GCGCAGGCGG GCACGATCTT CGAGCAGATG CTCACCGAGC AGCCGAAGAT CGCCGGCGTG
CTGGCCGCCA ACGACGGCCT GGGCAACGCC GCCATCGCCG TGCTGAAGAA GAACAGCCTC
AACGGCAAGG TCCCGGTCAC CGGCCAGGAC GCCACCGTGC AGGGTCTGCA GAACATCCTC
GCCGGCGACC AGTGCATGAC GGTCTACAAG GCGATCAAGA AGGAGGCCGA CGCGGGGGCC
GCGCTCGCCA TCGCGCTGGC CAAGGGTGAG AAGCCCGCGG CCTCCGGTTC GGTGAAGGAC
ACCGAGAGCG GCGCGGACGT GCCGGCGGTC CTGCTCGACC CGCAGGCCAT CTTCTTCGAC
AGCGTCAAGG ACGTCGTGGC AGACGGGTTC GTGACCAAGG ACGAGCTGTG CGCCGGCGAG
TTCGCCGCCA AGTGCACCGA GGCCGGAATC CAGTAA
 
Protein sequence
MRKGILSLTA AAAAMTLGLT ACGGESGDTT TAQNSAAPAE SKAAGKVGVI LPDSKSSARW 
ETADRKYLEE AFKAAGVAYD IQNAQGDKTQ FQTIADQMIT NGATVLMIVN LDSGTGKAVL
DKAKAQGVAT IDYDRLTLNG GASYYVSFDN TKVGTLQGEG LVKCLTDKKA DKPIVAELNG
SPTDNNATLF KNGYDGVLKP KYDAKEYVKG PDQSVPDWDN AQAGTIFEQM LTEQPKIAGV
LAANDGLGNA AIAVLKKNSL NGKVPVTGQD ATVQGLQNIL AGDQCMTVYK AIKKEADAGA
ALAIALAKGE KPAASGSVKD TESGADVPAV LLDPQAIFFD SVKDVVADGF VTKDELCAGE
FAAKCTEAGI Q