Gene Sros_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1779 
Symbol 
ID8665057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1894723 
End bp1895886 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content74% 
IMG OID 
Productallantoate amidohydrolase 
Protein accessionYP_003337512 
Protein GI271963316 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.294982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGGTG CCATCGCGCA CCTCGGACGT GACGGCCGGA CCCGGGGCTA CCTGCGCGAC 
GCGTGGTCGC CCGCCGATCT GGAGCTCCGT CAGTGGTTCC GCCAGGAGGC CGCCAGACGG
GGCCTCGACC TGCGCGAAGA CCGGAACGGC AACCTGTGGG CCTGGTGGGG CGACCCGTCG
GACAGGCCGG GCGTCGTCAC GGGCAGCCAT CTGGACTCGG TACGGCAGGG CGGCGCCTTC
GACGGCCCCC TCGGGGTCGT GTCCGCCTTC GCCGCCCTCG ACGCGCTGCG GGCAAAGGGC
TTCGAGCCTC CCCGCCCGCT GGGGGTGGCC TGTTTCACCG ACGAGGAGGG CGCCAGGTTC
GGGGTCCCCT GCATGGGCTC CCGGCTGCTC ACCGGCGCGC TCGACCCCGA CAGGGCCCGT
GGCCTGACCG ACGACGGGGG CGACTCGATG GCCGAGGTGC TGCGCCGGGC CGGACGCGAC
CCCGGTGAGC TGGGCCGCGA CGACGAGACC CTCAAGCACG TCGGCGTCTT CGTGGAGCTC
CACGTCGAGC AGGGCCAGGA CCTGGTCCAC CGGGACGCCC CGGTCGGCGT GGCCGCCGCC
ATCTGGCCGC ACGGCCGCTG GCGGTTCGAC TTCCGGGGCC AGGCCAACCA CGCGGGCACC
ACCCGGCTGG AAGACCGCGA CGACCCGATG CTGCCCTTCG CCCGGACCGT GCTGCACGCC
CGCCAGGCCG CCGAGCGGGG CGGCGTGGTG GCCACCTTCG GCAGGCTCCG CGTGTCGCCC
AACAACGCCA ATGCCATCCC CGGCCTGGTC AGCGCCTGGC TGGACGCCCG GGGCGGCGAC
GAGCACGCGG TCCGGGCGCT GGTCGCAGAG CTGACCGAGT TCTCCGGGGC CGAGGTCAGC
GCCGAGTCGT GGACCCCCGT CGTCGACTTC GACGAGGTTC TGCGGGAGCG GCTCGCAGCG
GTCCTGGGAG GCGCGCCCGT CCTGCCGACG GGGGCCGGCC ACGACGCCGG GATCCTGGCG
TCCGCAGGTG TGCCCAGCGC GATGGTGTTC GTCCGCAATC CAACGGGAAT CTCACACTCC
CCGGACGAAC ACGCTGAGAT GTCCGACTGC CACGCGGGGG TCGCCGCCCT CGCCACCGCC
CTGGAGGAGC TGTGCCGGAG CTGA
 
Protein sequence
MWGAIAHLGR DGRTRGYLRD AWSPADLELR QWFRQEAARR GLDLREDRNG NLWAWWGDPS 
DRPGVVTGSH LDSVRQGGAF DGPLGVVSAF AALDALRAKG FEPPRPLGVA CFTDEEGARF
GVPCMGSRLL TGALDPDRAR GLTDDGGDSM AEVLRRAGRD PGELGRDDET LKHVGVFVEL
HVEQGQDLVH RDAPVGVAAA IWPHGRWRFD FRGQANHAGT TRLEDRDDPM LPFARTVLHA
RQAAERGGVV ATFGRLRVSP NNANAIPGLV SAWLDARGGD EHAVRALVAE LTEFSGAEVS
AESWTPVVDF DEVLRERLAA VLGGAPVLPT GAGHDAGILA SAGVPSAMVF VRNPTGISHS
PDEHAEMSDC HAGVAALATA LEELCRS