Gene Sros_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1905 
Symbol 
ID8665183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2029718 
End bp2030926 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content74% 
IMG OID 
Productglycosyltransferase 
Protein accessionYP_003337636 
Protein GI271963440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCATGT CAGCCGCGCC CGCGCTCGCC CCGGCCCCTC TCGCCACCCG GCCACGCCGG 
GTGCTGATCG GCACCGACAC CTACCCGCCC GACGTGAACG GCGCCGCCTA CTTCACCCAC
CGGCTGGCCG GCGGCCTGGC CGAGCGGGGC AACGAGGTCC ACGTGGTCTG CGCCTCCGAC
GAGGGGGCGG CCAGGACCGA GCACGTGAAC GGCGTGACCG TGCACCGGCT CCGCTCGGCG
CCGGTGCTGG TGCATCCGAC CATGCGGATC TCGGTGCCCA CCCGGCTGGA CCGGCTCATG
GCTGCCATCG CCCCGGACGT GGTCCACGTC CAGGGACACT TCGTGGTCGG CCGCGCCGCG
ATCTCGGCCG CCCGGCGCGT GGGCGTCCCG GTTGTGGCGA CCAACCACTT CATGCCGGAC
AACCTCTTCC AGTTCGCGCA CATCCCCGGT CCGCTCCGCG AGCGGGCCGG CGACCTCGCC
TGGCGGGACT TCAGGCGCGT CTTCTCCCGG GCGGACCGGG TGACCACGCC GACCCGGATC
GCCGCGGGAC TGCTCGCCGG GAAGGGTTTC ACCCGTTCGG TGGAGCCGGT CTCGTGCGGC
ATCGACCTCA GCCGGTTCCG GCCGCACACC GGCCCCAAGG CGTGGGCGCG CGAGGCGTTC
GGCCTGCCCG ACCGCGACAC CGTGCTGTTC GTCGGGCGGC TGGACGAGGA GAAGCGGCTG
GACGAGCTCG TCCGCGCCCT GCCGTACATC CTCAACGGGA CCGACGCGCA GCTCGCGCTG
GTCGGGACCG GGGGGCAGCG GGCGGCGCTG GAGAGGCTGG CGGCCCGGAT CGGGGTCGGT
GACCGGGTGT TCCTCCTCGG GTTCGTCCCC GACGAGGCGC TTCCCCGGGC CTACGCCGCC
GCGGACGTCT TCGCCATGCC CGGGGTCGCG GAGCTGCAGA GCATCGCCAC CCTGGAGGCC
ATGGCCACCG GGCTGCCGGT GGTCGCCGCC GACGCGATGG CCCTCCCCCA CCTGGTACGG
CCCGGCGAGA ACGGCCGGCT GTTCCGGCCG GGTGACGTCC AGGGGCTTGC CCGCCACCTC
AACGACCTGC TCTGCGCGCC CGGCCTGCGC GGCGTGATGG GCGCGGCGAG CCGTGCGATC
GCGCTGACCC ATGACCACCA GGCCTCCCTG GCCCGGTTCG AGACGATCTA CCAGGAGGTG
GCCCGATGA
 
Protein sequence
MVMSAAPALA PAPLATRPRR VLIGTDTYPP DVNGAAYFTH RLAGGLAERG NEVHVVCASD 
EGAARTEHVN GVTVHRLRSA PVLVHPTMRI SVPTRLDRLM AAIAPDVVHV QGHFVVGRAA
ISAARRVGVP VVATNHFMPD NLFQFAHIPG PLRERAGDLA WRDFRRVFSR ADRVTTPTRI
AAGLLAGKGF TRSVEPVSCG IDLSRFRPHT GPKAWAREAF GLPDRDTVLF VGRLDEEKRL
DELVRALPYI LNGTDAQLAL VGTGGQRAAL ERLAARIGVG DRVFLLGFVP DEALPRAYAA
ADVFAMPGVA ELQSIATLEA MATGLPVVAA DAMALPHLVR PGENGRLFRP GDVQGLARHL
NDLLCAPGLR GVMGAASRAI ALTHDHQASL ARFETIYQEV AR