Gene Sros_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3961 
Symbol 
ID8667251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4411513 
End bp4412763 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID 
ProductGlycosyltransferase-like protein 
Protein accessionYP_003339614 
Protein GI271965418 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.606556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.103628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGGG CGGTCAGGCG CGGGGCACTG TGGCTGCTGA GGAGACTCAA CGGGAGACGG 
GCACGGCGTG CGCCCGCGCC GGACACGGGA ACGGTCCGCA TCGTGCTGCA GCACGCCTAC
GGGATGGGCG GAACCATCCG GACCGTGCTC AACCTCGCCG CGTACCTGGC GCGGGAGCGC
GACGTGGAGA TCGTCAGCGT GGTCCGGACG GCGGAGGAGC CCTTCTTCCC GATGCCTCCC
GGCGTGCGGG TGTCGTTCCT GGATGACAGG ACCAGGCCCC GCGGCCGGCT CGCCCGGATG
CTGTCGCGCT TCCCGAGCCT GCTGGTCCCG GAGCAGGAGA ACGTCTACCA CGCGCTCACC
CTCTGGACCG ACCTGCGCCT GGTCGCCTTC CTGCGGTCCA TGCGCACCGG CGTGGTGATC
TCCACCCGGC CGGGGTTCAA CCTGGTCACC GCGCTGTTCG CGCCTCCGGG AGTCCTCACG
GTCGGCCAGG AGCACGTGGC CCTGGACGTC CACTCCCCCG AGATCCGGCG GCTCATCAAG
AGGCGGTACG GCAGGCTGGA CGCCTTCGTG ACCCTCACCG ACGCCGATCT GCGCCAGTAC
GCCAGGACGC TGCGGGCCGA TCCGCCGGGA CGGCTGCTGC GCATCCCCAA CGCCCTGCCC
CACCTGGCCG GTGACATCTC CCCTCTGGAG GAGAAGGTCG TCATCGCCAT CGGCAGGCTG
GTGCACGCCA AGGGCTTCGA CCGTCTGGTC AGGGCGTGGG CACACGTCGC GGCGGCGCAT
CCGGACTGGG TGCTGCGCAT CTACGGCGGC GGTACGGCGA AGGCCGAGAC GAAGCTCCGC
ACCAGGATCG AGGACGCGGG TCTGGCGGAC CGGGTGTTCC TGATGGGCAG TTCTCCGGAG
ATCGGGGTCG AGCTGGCCAA GTCGTCGATC TACGTCGTCA GCTCGCGCTA CGAGGGGTTC
GGCATGACGA TCCTGGAGGC GATGAGCAAG GGCGTGCCCG TGGTGAGCTT CGACTGCCCG
CACGGTCCCA GGGAGATCAT CACCGACGAG CACGACGGGC TGCTCGTGCG TACCAAGAAG
GCACAGGATC TGGCCGAGGC CGTCTGCCGC CTGATCGAGG ACCGGCGGTT GCGCGGCACG
CTGGGCGGCA ACGCGGTGCG CACGGCCGCC CGTTACGACC TCGACGCCGT GGGCGCGCGC
TGGGACGCCC TGCTGGCCGA CCTCGCCGGC GACGCGTCGC CGCCCGCCTG A
 
Protein sequence
MRGAVRRGAL WLLRRLNGRR ARRAPAPDTG TVRIVLQHAY GMGGTIRTVL NLAAYLARER 
DVEIVSVVRT AEEPFFPMPP GVRVSFLDDR TRPRGRLARM LSRFPSLLVP EQENVYHALT
LWTDLRLVAF LRSMRTGVVI STRPGFNLVT ALFAPPGVLT VGQEHVALDV HSPEIRRLIK
RRYGRLDAFV TLTDADLRQY ARTLRADPPG RLLRIPNALP HLAGDISPLE EKVVIAIGRL
VHAKGFDRLV RAWAHVAAAH PDWVLRIYGG GTAKAETKLR TRIEDAGLAD RVFLMGSSPE
IGVELAKSSI YVVSSRYEGF GMTILEAMSK GVPVVSFDCP HGPREIITDE HDGLLVRTKK
AQDLAEAVCR LIEDRRLRGT LGGNAVRTAA RYDLDAVGAR WDALLADLAG DASPPA