Gene Sros_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_0834 
Symbol 
ID8664106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp856730 
End bp857959 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003336591 
Protein GI271962395 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC AGAACGAACA GCAGATCGCG GAAGCCTTCC GGGCCGCGCC GCCCATCATC 
CTGGCTCCCG GCCCCAAGGA GTCCCCCACT CAGCTCCCGC CGCCGCCCAA CGACGTGTGG
GACCTGCCCG GCGGCACCGC CTGGGTGTAC CACGGAGAGG GCAACCACGG CCTGACCCGG
CCCGTCATCC TGGCGGACGG CTTCAACACG GGGCCCAGCA CCCCTGACTT CTCCTGGAAC
GCCCTGGATT TCAACGCCTA CCCGCTCCTC AGCGAGCTGC GCCGGCGCGG CAGGGACGTC
GTCCTGCTCG GGTTCACCGA ACGCAGCGCG TCGATCATGG ACAACTCGGA GACCGCCGTC
GCGGCGATCC ACGAGGCGAT CGCGCGGCGA CAGGGCGAGC ATCCGCTCGC GGTCGGCGGC
TTCAGCATGG GCGGCCTGGT CACCCGGCAT GCCCTCGCCA AGCTGGAGAC CATGAGGATG
AACCACCAGA CAGCGCTGTA CTGGTCCTAC GACAGCCCGC ACCGGGGTGC CTGGATCCCC
ATCGCCCTCC AGGCGTTCGC GCACTACATC CGCGCGCTCG ACAGCCGGTT CTCGGACCAG
ATGAACAGCC CGGCCTCCCG CCAGCTGCTG GTGCAGCACA TCGCGGAGTG GCGCGACTCG
CCCGGCGTCG ACAAGGAGCG GACCGAGTTC CTCACCGAGC TGGACCGCGT CGGCGGCTGG
CCGCGCATAC CCCGGCTGAT CGGCGTCGCC AACGGCATCG GTTCGGGCGC CGGCAACGGT
GTGAAGCCCG GCCTGACCGC CCTGAAGGGC AAGGGCCTGG CCATCACCGG CACCGACCTG
CGCACCCAGC CGGCGGGCGG CGACTCGCTG GTCGCCAGGC TGCGGGTCGT GACCCTGCAG
CGGCCGGAGA TCCACGCTCC GGGCCTCCCC GACATCGACG GCGCCCCCGG CGGCACGCTG
GAGGGCTTCG GAATCCTCGC CGACGCGCTC AACGAGCTCG CCCGCTTCGG CTTCGGCGTC
GACGTCCCGA TCCGCTCGCA CTGCTTCGTC CCGGCGGTCA GTGCCGTCGC CATCCGGAAC
ATCGACTCCC GCGACGATCT GTACGTCGAC ATCGACAGCC TCTCGCCCGA GGACAGCGAA
CTGGACGACT TCAAGCTCGC GTCCCGGAAC GAGGAGCACA CCAAGATCAC CGAGGAACTC
TGCACCTGGA TCCTCGACCG GCTCCCGTAG
 
Protein sequence
MSEQNEQQIA EAFRAAPPII LAPGPKESPT QLPPPPNDVW DLPGGTAWVY HGEGNHGLTR 
PVILADGFNT GPSTPDFSWN ALDFNAYPLL SELRRRGRDV VLLGFTERSA SIMDNSETAV
AAIHEAIARR QGEHPLAVGG FSMGGLVTRH ALAKLETMRM NHQTALYWSY DSPHRGAWIP
IALQAFAHYI RALDSRFSDQ MNSPASRQLL VQHIAEWRDS PGVDKERTEF LTELDRVGGW
PRIPRLIGVA NGIGSGAGNG VKPGLTALKG KGLAITGTDL RTQPAGGDSL VARLRVVTLQ
RPEIHAPGLP DIDGAPGGTL EGFGILADAL NELARFGFGV DVPIRSHCFV PAVSAVAIRN
IDSRDDLYVD IDSLSPEDSE LDDFKLASRN EEHTKITEEL CTWILDRLP