Gene Sros_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1361 
Symbol 
ID8664636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1410314 
End bp1411435 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content74% 
IMG OID 
ProductGlycosyltransferase-like protein 
Protein accessionYP_003337099 
Protein GI271962903 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGTAT GTGTCGGGAC GATTGTCCAT CACCCCGAAG ACGCCCGGAT CATGCATCGG 
CAGATCCGGG CGCTCCTCGA CGCGGGGCAC GAGATCACCT ATGTCGCGCC CTTCACCGAC
TGCAACGTCA CGCCCGATCC CCGGATCCGG GCCGTCGACG TCCCCCGCGC GCTGGGCCGC
CACCGTAGAC GTGCCCTGAA GGCCGCCCGG GGGGCCCTGA AGCGCGGAGC CGAGGGCGCC
GACCTTCTGG TCGTCCATGA CATCGAGCTG CTTCTCCGGC TGCCCAGGCG CCGCCCCGTG
ACCGTCTGGG ACGTCCACGA GGACACGGCC GCCGCCCTGG AGGCCAAGAC GTATCTCCCG
GAGCTCCTGC GCCGGACCCT GCCGTCGCTG ATCCGCCGGG TCGAGGCCCG CGCGGAGGAC
CGGCTGCACC TCGTCCTGGC CGAGGAGGCC TACCGGGAGC GGTTCTCCGG GTCCCACCCG
GTGGTGCCCA ACACCACCTA CGTGCCCCAT CGGCCGCCCC CGCCGCCCGG CCGGAACCGG
GTGGTCTACG TGGGCCAGCT GTCCCGGGCC AGGGGCGCGG CGGAGCTGGT CGAGCTGGCC
CGGCGGCTGC TCCCCCACGG GATCAGGACG GACCTGGTGG GCGCCGCCGA CGCCGAGATC
AGGCCCATGC TGCGGGACGC GCAGCGACAG GGCCTGCTCG ACTGGTACGG CTACGTGCCC
AACCAGCACG CGCTGCGGAT GGCCGAAGGG GCGATCGCCG GGCTGTCACT CCTGCACGAC
GTGCCCAACT ACCGGCAGTC GATGCCGACC AAGGTCGTCG AGTACATGTC CCGCGGGCTC
CCGGTGGTCA CCACGCCGCT CCCGGCCGCC GCTGCCCTGG TCGGCCGGAC CGGCTGCGGG
GTGGTCACGC CCTTCGGGGA CGTGGACGCG GTGCTGGGCG CCGTACTGGC GCTGCGGGAC
GATCCCGGGG GAGCCGCGGC GATGGGCGCA CGCGGCTACG AGGAGGCGCT GCGCCACTAC
GACTGGCCCG CCCACGCGGG CGAGTTCGTG GGGCTGCTGG AGGGGTGGGC GACGGCGAGC
GCCGCCCCCG CGCGCGCCCA CCGCCGTCCC CTCGTGGTCT GA
 
Protein sequence
MRVCVGTIVH HPEDARIMHR QIRALLDAGH EITYVAPFTD CNVTPDPRIR AVDVPRALGR 
HRRRALKAAR GALKRGAEGA DLLVVHDIEL LLRLPRRRPV TVWDVHEDTA AALEAKTYLP
ELLRRTLPSL IRRVEARAED RLHLVLAEEA YRERFSGSHP VVPNTTYVPH RPPPPPGRNR
VVYVGQLSRA RGAAELVELA RRLLPHGIRT DLVGAADAEI RPMLRDAQRQ GLLDWYGYVP
NQHALRMAEG AIAGLSLLHD VPNYRQSMPT KVVEYMSRGL PVVTTPLPAA AALVGRTGCG
VVTPFGDVDA VLGAVLALRD DPGGAAAMGA RGYEEALRHY DWPAHAGEFV GLLEGWATAS
AAPARAHRRP LVV