Gene Sros_3832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3832 
Symbol 
ID8667122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4271103 
End bp4272344 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content77% 
IMG OID 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_003339494 
Protein GI271965298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.956349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0237391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACCGC TCACCACGCT CCGCACCACC GCTCACCGCC CGGGACTCCT GGCCGCCCTC 
CACCGTGTAA GAGAGATCGC CCCGATGAGC CCGCTCACCG TGTTCCAGGC CGCCGCCGCC
CTCGCCGTCG GGGTACGGCT GGCCCGCGGA CGCGACCGGC TGCCACCGCT GGCCCCCACC
GGGGCGACGG CCGGGCGGAT CTCGGTGGTG ATCCCGGCCC GCGACGAGGA GGGCCGTATC
GGCCCCTGCC TGTCGGCGGT GCTCACCGAT CCGGCCGTCG CGGAGGTCCT CGTCGTCGAC
GACGAGTCGA GCGACGGCAC GGCGCGGCTG GCCGCCGATC TCGGCGCGAA GGTCGTCGTG
GGCGCGCCTC TTCCGGAGGG CTGGGTGGGC AAGCAGTGGG CGCTGCTGCA GGGGGTCGAG
GCGGCCGGCG GCGACATCGT GGTGACCCTC GACGCGGACA CCCGGCCCGC GCCGGGCCTG
TTCGGCGCGC TGGCCGCGGC CCTGGACGGC TACGACCTGG TCAGCGCCGG CCCCCGGTTC
GTCTGCGACG GGATCGCCGA GCAGGCGCTG CACGCCTCGT TCCTGGCGAC GCTGGTCTAC
CGGTCCGGCC CGATCGGGCC GTCCTCCGTC CCCGCTCCGC ACCGTGTCGT GGCCAACGGC
CAGTGCATGG CCTTCCGCCG TACGGCGATG CTGGCCGCCG GCGGGTTCGC GCGGGTCCGC
GGGCACATGA CCGACGACGT GGCGCTGGCC CGGACCCTGG CCGCCGACGG CTGGGCGGTG
GGCTTCCTGG ACGCGGGCGG CCTGCTGGAG GTCGACATGC ACGAGTCGGT GGCCGAGGTG
TGGCGGGAGT GGGGGAGGTC GCTGCCGCTG CGCGACGTCA CCGGACCCGG CCGGCAGGCC
GCCGACCTGG CCGCGATCTG GCTCACCGCC GCCCTGCCCG TGCTGCGGCT GGCGGCGGGG
CGGCCCACCC GGCTCGACCT GGGGCTGCTG GCCGTACGCC TGCTGCTGAC CGGCGCGCTG
CGCGGCAGCT ACGCCCGGCC CGGCCCCGGC GTGCTGCTGT CGCCCCTGCT GGATCCGCTG
ACCGCGGTAC GGCTGACGCA GGCGACGCTG TGCCCGGTGC GCAGCTGGCG GGGCCGTACC
TATTCCGGGA TCACGGCTCC GGGGGTCACG CCCGGCGCCC GGCCCGATCG GCCGGGCCGG
GCCGGGCGGC CCGCGCCGCC TGCCCGAAGC GCAGCCCGAT GA
 
Protein sequence
MRPLTTLRTT AHRPGLLAAL HRVREIAPMS PLTVFQAAAA LAVGVRLARG RDRLPPLAPT 
GATAGRISVV IPARDEEGRI GPCLSAVLTD PAVAEVLVVD DESSDGTARL AADLGAKVVV
GAPLPEGWVG KQWALLQGVE AAGGDIVVTL DADTRPAPGL FGALAAALDG YDLVSAGPRF
VCDGIAEQAL HASFLATLVY RSGPIGPSSV PAPHRVVANG QCMAFRRTAM LAAGGFARVR
GHMTDDVALA RTLAADGWAV GFLDAGGLLE VDMHESVAEV WREWGRSLPL RDVTGPGRQA
ADLAAIWLTA ALPVLRLAAG RPTRLDLGLL AVRLLLTGAL RGSYARPGPG VLLSPLLDPL
TAVRLTQATL CPVRSWRGRT YSGITAPGVT PGARPDRPGR AGRPAPPARS AAR