Gene Sros_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3101 
Symbol 
ID8666388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3382869 
End bp3384047 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content73% 
IMG OID 
Productglycosyltransferase 
Protein accessionYP_003338792 
Protein GI271964596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.325178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATCT CCTCCCAGAG GTGTGCCGTG AACATTGCAT TCGTCTCCTC CGACGCTCTC 
TCCGCCTCCC AGAGCCAGTC CGACCTCGCC GCGCAGCGCG CCCACCTTCT GGCCATCGCC
CGCGAGCTGG GCCGGGAGCA CAAGGTCACG ATCTACACCC GCAGGCACTC CGACGCCGAC
AAGCCCCGGG TCCGCATGTT CCAGGGGGTC ACGCTGGAGA ACCTGGCCGC CGGGCCCGCG
GAGGACCTTC CCGAGGACAG CGTCCTCCCC TACCTGCCCG ACCTCGGCGA CCAGCTCATG
CGCCGCTGGG GGCAGGACCG CCCGGACGTC ATCCACGCGC ACTCCTGGAC CGGTGGCCTG
GCCGCCATCG CGGGGGCCGA CGGCCTGGGC GTGCCGTTCA CCCAGAGCTT CAGCAGCGAG
CACAGCCGCG ACGCCAAGAA GGTCCGGGTG CAGCGCGCGA TCGGCCGCCG TGCCAGCGCG
GTGATCGCCG GATGCGGGGA CGAGGAGTCC ACGCTGATCC GGCTGGGCGT GCCGCGCCGC
AACATCTCCG TGATCCCCTG CGGCGTCGAC GTCGAGCGCT TCCGGCGTCA GGGCCCGGCC
GCGGCCCGGG GCACCCGCCC CCGCCTGCTC CACGTCGGGC CGCTGACCCA GGACAAGGGC
GTCTCCACCG CCATCCGCGC CCTGGAGGGC ATCCCCGACG CCGAGCTGCT CATCGCCGGC
GGCCCGGACG TGGCGGGGCT GGCGCACGAC GCCGACGCGC ACCGCGTCAT GCTGCTGGCC
AAGGAGGTCG GCGTGGAGGA CCGGGTCACC CTGCTCGGCC AGGTCCCGCA CACCTCGGTG
CCCAAGCTGA TGCGCAGCGC CGACCTGGTC ATCTCGCTGC CGCACGAGAC CGCCACCGGT
ATCGTCGCGC TGGAGGCCAT GGCGTGCGGC GTGCCCGTCA TCGCCTCGGC GGTGGGCGCC
CACCTCGACT CCGTCGTGGA CGGGGTGACC GGCCTGCTGG TGCCGGCGGA CCGTCCCGCG
CAGACCTCCC GCCTCATCCG GGAGCTGCTC GCCGACCCGA CCCGGCGTAC GGCGCTCGGT
TTCGCCGGCG CCGACCGCGC CCGCTCCCGC TACTCCTGGG AGCGGATCAG CCAGGAGCTC
GTCCAGGTCT ACGAGAACGC CCTCGCGACG CAGCACTGA
 
Protein sequence
MPISSQRCAV NIAFVSSDAL SASQSQSDLA AQRAHLLAIA RELGREHKVT IYTRRHSDAD 
KPRVRMFQGV TLENLAAGPA EDLPEDSVLP YLPDLGDQLM RRWGQDRPDV IHAHSWTGGL
AAIAGADGLG VPFTQSFSSE HSRDAKKVRV QRAIGRRASA VIAGCGDEES TLIRLGVPRR
NISVIPCGVD VERFRRQGPA AARGTRPRLL HVGPLTQDKG VSTAIRALEG IPDAELLIAG
GPDVAGLAHD ADAHRVMLLA KEVGVEDRVT LLGQVPHTSV PKLMRSADLV ISLPHETATG
IVALEAMACG VPVIASAVGA HLDSVVDGVT GLLVPADRPA QTSRLIRELL ADPTRRTALG
FAGADRARSR YSWERISQEL VQVYENALAT QH