Gene Sros_8074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_8074 
Symbol 
ID8671402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp8894226 
End bp8895716 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content72% 
IMG OID 
ProductAmidase 
Protein accessionYP_003343472 
Protein GI271969276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.148196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCA TCCACAAGTC CGCCGCCGAA CTGGGCGCGC TGGTCGCAGG CGGAGAGGTC 
TCGGCCGTCG AGGTGGCCCA GGCCCACCTC GACCGGATCG CCGCCGTCGA ACCGCAGGTC
AACGCCTTCC TGCACGTCGA CGCCGAGACG ACGCTGGGGC AGGCACGCGC CGTCGACGCC
CGCAGGGCCG CCGGGGAGGA CCTCGGCCCG CTCGCCGGTG TGCCGATCGC GCACAAGGAC
GTCTTCACCA CCGTCGACAT GCCGACCACG GCCGGATCCA AGATCCTTGA GGGCTACCGC
CCGCCGTACG ACGCCACCGT GACCCGCCGC CTGCGCGAGG CCGGACTGGT GATCCTCGGC
AAGACCAACC TCGACGAGTT CGCGATGGGC TCCTCCACGG AGAACTCGGC CTACGGTCCC
ACCCGCAACC CGTGGGACCT GAGCCGCATC CCGGGCGGCT CCTCCGGCGG CTCGTCCGCC
GCGGTCGCGG CCTACGAGGC CCCGCTGTCC ACCGGCACCG ACACCGGCGG CTCGATCCGC
CAGCCCGCCG CGGTCACCGG CATCGTCGGC ATGAAGCCGA CCTACGGCGG CTCGTCCCGC
TACGGCCTGA TCGCCTTCGC GAGCTCCCTG GACACGCCGG GCCCCTTCGC CCGCAACGTC
ATGGACGCGG CGCTCCTGCA CGAGGCGTTC TCCGGACACG ACGCCATGGA CTCCACCTCC
ATCGACGCCC CGGTGCCCTC CGTCGTCGAG GCGGCGCGCA ACGGCGACGT GGCCGGGCTG
CGCATCGGCG TGGTCAAGGA GTTCGGCGGC GACGGCTACC AGGCGGGCGT GCTCGCCCGC
TTCCACGAGA CCGTCGAGCT GCTGGAGTCC CTCGGCGCCA AGGTCGTCGA GGTCTCCTGC
CCGTCGTTCA GCACGGCCCT GCCGGCCTAC TACCTGATCG CCCCGTCGGA GGCCTCCTCC
AACCTGGCCC GTTTCGACGC CATGCGCTAC GGCCTGCGCG TCGGCGACGA CGGCACGCGG
AGCGCCGAGG AGGTCATGGC GCTGACCCGG GCCGCCGGTT TCGGCCCCGA GGTCAAGCGG
CGCATCATCC TGGGCACCTA CGCGCTGTCC AGCGGCTACT ACGACGCCTA CTACGGCCAG
GCGCAGAAGG TCCGCACGCT GATCGCGCGT GACTTCGAGG CGGCCTTCCA CCAGGTGGAC
GTGCTCGTCT CGCCGACCAC GCCGACCACG GCGTTCCCGA TCGGCGAGCG GGCCGACGAC
CCCATGGCGA TGTACCTCGC CGACCTGTGC ACCATCCCGA CCAATCTGGC GGGCAACGCG
GCCATCTCGG TGCCGTGCGG CCTGGCCGAC GAGGACGGCC TGCCGGTCGG CCTGCAGGTC
ATGGCTCCGG TGCTCGGCGA CGACCGCTGC TACCGGGTCG GCGCCGCGGT GGAGAGGGCT
CTCGAAGGCC GCTGGGGCGG CAGCCTGCTG TCCAAGGCCC CGGCGCTGTA G
 
Protein sequence
MSLIHKSAAE LGALVAGGEV SAVEVAQAHL DRIAAVEPQV NAFLHVDAET TLGQARAVDA 
RRAAGEDLGP LAGVPIAHKD VFTTVDMPTT AGSKILEGYR PPYDATVTRR LREAGLVILG
KTNLDEFAMG SSTENSAYGP TRNPWDLSRI PGGSSGGSSA AVAAYEAPLS TGTDTGGSIR
QPAAVTGIVG MKPTYGGSSR YGLIAFASSL DTPGPFARNV MDAALLHEAF SGHDAMDSTS
IDAPVPSVVE AARNGDVAGL RIGVVKEFGG DGYQAGVLAR FHETVELLES LGAKVVEVSC
PSFSTALPAY YLIAPSEASS NLARFDAMRY GLRVGDDGTR SAEEVMALTR AAGFGPEVKR
RIILGTYALS SGYYDAYYGQ AQKVRTLIAR DFEAAFHQVD VLVSPTTPTT AFPIGERADD
PMAMYLADLC TIPTNLAGNA AISVPCGLAD EDGLPVGLQV MAPVLGDDRC YRVGAAVERA
LEGRWGGSLL SKAPAL