Gene Sros_9234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9234 
Symbol 
ID8672582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10183721 
End bp10184887 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content73% 
IMG OID 
Productputative glycosyl transferase, group 1 
Protein accessionYP_003344595 
Protein GI271970399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCC GGTACATGCT CCTGCACGCC TACGGGATGG GCGGCACGAT CCGCACCGTG 
GTCAACCAGG CCAACGCGAT GGCCGCCGCC GGGCACGACG TCGAGATCGT CAGCGCGGTA
CGGCGCCGCG ACGCCCCCCG GTTCCGCGTC GACCCGCGCG TCGAGGTCAC CGCGCTGACG
GACCAGCGCG GCGGTGTGCG CGCCGACTCG CTGGGGCGCA GGGTCTGGCG GCGGGTCCGC
GGGAAGATCG TGCCCCACGG CGAGTTCGCG GCGTCCTACT TCACCGAGCG GGTGGAGAGG
GCCGTCATCG ACTACGTCTC CGCGCTGGAG GACGGCATCC TGGTCACCAC CCGCCCGGCG
CTGAACCTCA TCTCCGCCCG CCGTACTCCC GCGAGCGTGG TGCGGATCGC GCAGGAGCAC
ATGAACCTGG CCACCCACCC CGAAAGCGTC CGCAGGGAGA TCGCCCGCCA CTACGGCCGG
CTGGACGCGG TCGCGGTGCT CACCGGGACC GACCGCAGGG ACTACCAGGC GCTGCTGCCC
GGCACCCCGG TCGTGCGGAT CCCGAACGCG GTCCACCCCC TCGACCAGGC GCCGTCGCGG
CAGGAGAACC GGCTCGTGAT CGCCGCCGGG CGGCTCGTCG CCCAGAAGGG GTTCGACCTG
CTCATCCCGG CGTTCAAGCA GGTCGTGCAC CACCATCCGG ACTGGCGGCT GCGCATCTAT
GGCACCGGCC CGAAGAAGGC CGCGCTGCGC GCTCTCGTCA AGGAGCACCG GCTCGCCGAC
AACGTCACCC TGATGGGGCG CAGTGACCGG CTGGACGAGG AGCTGGCCCA TGCCTCGCTG
TACGTGCTCA GCTCCCGGTT CGAGGGGCTG CCGATGGTGA TGATCGAGGC GATGTCGCAC
GCGCTGCCGG TCGTCGCCTT CGACTGCCCG ACCGGTCCGC GCGACGTCAT CACCGACGGG
ATCGACGGGC TGCTCGTGCC GCCCCAGGAC GTCGACGCGC TGGCGGCGGC GGTCAGCCGC
CTCATCGCCG ACCGGGAGCT GCGGCGGCGG ATGGGCGCCG CGGCCGTACG GACCGCGCGG
GACTACGCTC CCGAGGCCGT CACCCCGCTG TGGGAGAGGC TGTTCACCGA ACTGCTGCGG
GCCGAGCCCC CCGCCGCGGA GCGCTGA
 
Protein sequence
MKIRYMLLHA YGMGGTIRTV VNQANAMAAA GHDVEIVSAV RRRDAPRFRV DPRVEVTALT 
DQRGGVRADS LGRRVWRRVR GKIVPHGEFA ASYFTERVER AVIDYVSALE DGILVTTRPA
LNLISARRTP ASVVRIAQEH MNLATHPESV RREIARHYGR LDAVAVLTGT DRRDYQALLP
GTPVVRIPNA VHPLDQAPSR QENRLVIAAG RLVAQKGFDL LIPAFKQVVH HHPDWRLRIY
GTGPKKAALR ALVKEHRLAD NVTLMGRSDR LDEELAHASL YVLSSRFEGL PMVMIEAMSH
ALPVVAFDCP TGPRDVITDG IDGLLVPPQD VDALAAAVSR LIADRELRRR MGAAAVRTAR
DYAPEAVTPL WERLFTELLR AEPPAAER