Gene Sros_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2778 
Symbol 
ID8666064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3017656 
End bp3019029 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content75% 
IMG OID 
Productallantoinase 
Protein accessionYP_003338479 
Protein GI271964283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCCG AAGATGTGAC GGACCTGGTC ATCCGGTCTC GCCGGGCCGT GCTCCCGGAA 
GGGGAGGGCC CGGCGGCGGT GGCCGTACGG CGGGGCAGGA TCGCCGGCCT GCACGCCTAC
GACGCCGTCC TGGAGGCCGC CGAACAGGTC GACCTCGGCG ACACCGCGCT GCTGCCCGGC
CTCGTGGACA CCCACGTGCA CGTCAACGAG CCCGGCCGGA CGCACTGGGA GGGCTTCGCC
TCCGCCACCC GGGCGGCCGC CGCGGGCGGG GTCACCACGA TCGTGGACAT GCCGCTCAAC
TCGCTGCCGC CGACGGTGGA CGTGGGCGCG CTGGCCGGCA AGCGGAGGGC CGCGGCCGGG
CAGTGCCTGG TGGACGTGGG CTTCTGGGGC GGTGCGGTCC CCGGCAACGT CAAGGACCTG
CGGCCGCTGC ACGAGGCCGG GGTGCACGGT TTCAAATGCT TCCTGTCGCC GTCCGGCGTG
GAGGAGTTCC CGCCGCTGGA CGTGGACGGG CTGCGGGCGG CCATGGTGGA GATCGCCTCC
TTCGACGGCC TGCTGATCGT CCACGCCGAG GACCCGGGGC TGCTCGCCGA ACCGGCCGGC
CCCGGCTACG AGGAGTTCCT CGACTCCCGC CCCGGCCGGT CGGAGCGCCG CGCGGTCGAA
CTGGTCGTCG CGCTGGCGGG GGAGACCGGT GTGCGGGCGC ACATCCTGCA CGTCTCCTCC
GCGCTCTGCC TGGAACCCCT GGCCAGGGCG CGGCGGGAGG GCGTCAGGAT CACCGCCGAG
ACCTGCCCGC ACTACCTGAC GCTGACGGCC GAGGAGGTGC CGCGGGGCGC GACCGAGTTC
AAGTGCTGCC CGCCGATCCG GACCTCCGCC AACCGCGAGC GACTCTGGCG CGGGCTCGCC
GACGGCGTGC TGAGCTGCGT CGTCTCCGAC CACTCACCGT CCACCCCGGA CCTCAAGGTG
CCCGACTTCG CCGCGGCCTG GGGCGGGATC TCCTCCCTCC AGCTCGGCCT GGCGGCCGTG
TGGACCGAGG CCTCGCGTCG CGGGCACGGC CTCGGCCAGG TGGTCCGGTG GATGGCCGCC
AACCCCGCGG CGCTGGCCGG CCTGGACGGC AAGGGCGCCA TCGCCGTGGG CAAGGACGCC
GACCTGGTCG CCTTCGACCC CGACGCCGGC CACACCGTGG ACGCGGCCGC CCTGCACCAC
AGGAACCCGG TCACGCCCTA CCACGGCAGG ACGCTGCGCG GCGTGGTCCG CGCGACCTGG
CTGCGCGGCC GGGCCGTGGG CGACCCCCCC GGCGGAGAGC TCCTGCGCCC CTCCCCGGCC
GGAGCGCGCC ACCCGGGATC GGAACGCACC ACGGAAGAAA GGCCCTCACC GTGA
 
Protein sequence
MRAEDVTDLV IRSRRAVLPE GEGPAAVAVR RGRIAGLHAY DAVLEAAEQV DLGDTALLPG 
LVDTHVHVNE PGRTHWEGFA SATRAAAAGG VTTIVDMPLN SLPPTVDVGA LAGKRRAAAG
QCLVDVGFWG GAVPGNVKDL RPLHEAGVHG FKCFLSPSGV EEFPPLDVDG LRAAMVEIAS
FDGLLIVHAE DPGLLAEPAG PGYEEFLDSR PGRSERRAVE LVVALAGETG VRAHILHVSS
ALCLEPLARA RREGVRITAE TCPHYLTLTA EEVPRGATEF KCCPPIRTSA NRERLWRGLA
DGVLSCVVSD HSPSTPDLKV PDFAAAWGGI SSLQLGLAAV WTEASRRGHG LGQVVRWMAA
NPAALAGLDG KGAIAVGKDA DLVAFDPDAG HTVDAAALHH RNPVTPYHGR TLRGVVRATW
LRGRAVGDPP GGELLRPSPA GARHPGSERT TEERPSP