Gene Sros_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1197 
Symbol 
ID8664472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1221608 
End bp1222918 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content77% 
IMG OID 
ProductSpore coat polysaccharide biosynthesis protein predicted glycosyltransferase-like protein 
Protein accessionYP_003336938 
Protein GI271962742 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.836411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.392428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGACGCG TCGGCATCCG CTGCGACGCG GGCGTCGGCC GCGGCGTCGG CCACCTGATG 
CGCTGCCTGG CCCTCGCCGA GGAACTGCGG GAGCGGCGGC TGGAGGTCGT CGTCCTCGGC
GACATGGGCG GGCTGGAGTG GGCCGCGGAG CAGCTGGCCC GGCGCGGCCT GCGGCTGCTC
CCCGGGCCGG GCGACGCGGC CGCCATGGTC CGGGCCGTCC GCCGCCTCGC GCTCGACGCC
GTGGTCGTCG ACTCCTACGA TCTCGACCCC CGCTGCTCGG GAGCGCTCCG CCAGGCCGGG
GTGCGGGTGC TGGCCGTCGT CGACGACGAC GACCGCGGCC AGGACGCCGA CATCTATCTG
GACCAGAACC TCGGCGCCGA GCGCCGGGCC GGCCGCGTTC CCTCCGGATC GATCCGGCTC
GCCGGGGTGC GCTACGCCCT CCTCCGCGAC GACGTACGGC GTCTGCGCCA GGGACCGATG
GAGCCCGGAC GGCCGGACGG CGTGCTGACG GATCACCGGC AGCCGGACCC GGGGCCGGCG
GAGCCTCAAC GGTTGGACGG AGCACTACCG GACCACCGGC AGCCGGACCC CGGGACGGCG
GAGCCTCAAC GGTTGGACGG AGCACTGCCG GACCACCGGC AGCCGGACCC CGGGGCGGCG
GAGCCTCAAC GGTTGGACGG AGCACTACCG GACCACCGGC AGCCGGACCC CGGGGCGGCG
GAGCCTCGAC AGCCGGGCGG AGCGCTGACG GATCACCGGC AGCCGGGTGG AGTGCCGGTG
GAGCCCGGAC GGCCGCCTCG GGTGCTGTGC TTCTTCGGCG GCACCGACGC GGCCGGGGCG
GCTCCCGTCG TGGTGGGAGA GCTCATCGCG ACGGGCGTGC CGTTCCTGGC GACGGCCGTC
ACCCCGCGCG AGCGGGCGCT GGACCACCTC CAACCGGCCG GCGACCAGAC GGTCCGCCGG
ATTCCCCCCA CCGACGACCT TCCCCGGCTG ATCGCGGCGG CGGACCTCGT CGTCACGGCG
GCGGGGAGCA GCATGTGGGA TCTCCTCTAC CTGGGGAAGG CCGCCGCGCT CGTCTGGGTG
GCGGCGAACC AGCGGCCGGG TTACGAGGAG GTGCTCTCGC GCGGGCTGGC CGCCGGGCTC
GGGCACCTCG ACGCCGTGGT CCGTACGGGT GGCCCCGCCC GCGCGTGCCT GCGGGAACTG
CTCACCTCCG CACGGTCCCG CGAGGAGCTG GGCGCGCGGG GGCCCGCGCT GGTCGACGGC
GAGGGCAGGG CGAGGGTCGC CGACGCCCTG CTCTCCCCGA TCTCCCCGTG A
 
Protein sequence
MRRVGIRCDA GVGRGVGHLM RCLALAEELR ERRLEVVVLG DMGGLEWAAE QLARRGLRLL 
PGPGDAAAMV RAVRRLALDA VVVDSYDLDP RCSGALRQAG VRVLAVVDDD DRGQDADIYL
DQNLGAERRA GRVPSGSIRL AGVRYALLRD DVRRLRQGPM EPGRPDGVLT DHRQPDPGPA
EPQRLDGALP DHRQPDPGTA EPQRLDGALP DHRQPDPGAA EPQRLDGALP DHRQPDPGAA
EPRQPGGALT DHRQPGGVPV EPGRPPRVLC FFGGTDAAGA APVVVGELIA TGVPFLATAV
TPRERALDHL QPAGDQTVRR IPPTDDLPRL IAAADLVVTA AGSSMWDLLY LGKAAALVWV
AANQRPGYEE VLSRGLAAGL GHLDAVVRTG GPARACLREL LTSARSREEL GARGPALVDG
EGRARVADAL LSPISP