Gene Sros_4856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4856 
Symbol 
ID8668150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5379226 
End bp5380566 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content71% 
IMG OID 
Productallantoinase 
Protein accessionYP_003340417 
Protein GI271966221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.229544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC TGGACCTGCT GTTCAAGGCC CGTCGAGTGG TCACCGCCGC CGGAGAGGTG 
GCACGCAGCA TCGGGGTACG GGACGGGACG GTGATCGCGG TCGAGCCGCT GGACGCGGAT
CTCGAGGCCG CCGAGGTCAT CGAGCTCGGC GACGACGAGG TGCTGCTGCC CGGCCTCGTG
GACAGCCACG TGCACGTGAA CGACCCCGGC CGGACCGAGT GGGAGGGATT CGGGAGCGCC
ACCCGGGCGG CAGCGGCCGG CGGTATCACG ACGATCATCG ACATGCCGCT GAACAGCGTC
CCGCCGACCA CCGATGTCGC GGCGCTGCAG ACGAAACGGA AGACCGCCGA GGGACGGGTG
TACGTCGACG TCGGCTTCTG GGGCGGGGCC GTACCGGGCA ACCTGCGCGA GCTGCGCGGG
CTGCACGACT CGGGCGTGTT CGGATTCAAG TGCTTCCTGC TGCACTCCGG CGTGGACGAG
TTCCCCCACC TGGAACCGGG CGAGCTGGCG GACGCGCTAC GGGAGATCGG GGCGTTCGAC
GCACTGATGA TCGTGCACGC TGAGGACCCG CACGTGATCG ACCACGCCCC GGCCGCGCAC
GGCGCGAGCT ACCGGGACTT CCTGCGCTCC AGGCCGCGGG GCGCGGAGAA TCTCGCGGTC
GCGCAGGTGA TCGAGCTGGC CCGCCGGACC GGCTGCCGGG TGCACATCCT GCACCTGTCC
AGCTCGGACG CGCTTGCGAT GATCCGGTCG GCCCGGCGCG ACGGCGTCCG GATCACCGTG
GAGACATGCC CGCACTATCT GACGTTCAGC GCGGAGGAGA TCGCCGAGGG GGCCACCCAG
TTCAAGTGCT GCCCGCCGAT CCGGGAGGCG GCGAACCGCG AATCGCTCTG GCAAGGGCTT
GCCGACGGCA CGATCGACTG CGTGGTGTCC GACCACTCGC CGTGCACGCC GGAGCTCAAA
CGGTTCGACG TCGGTGACTT CGGCGTCGCC TGGGGCGGCA TCGCGTCGCT GCAACTCGGC
CTGCCGGCGG TGTGGACCGA GGCCCGGCGC CGCGGCCACA CGCTGACCGA CGTGGTGCGC
TGGATGGCGG AACGCCCCGC GGAGCTGATG GGGGTGCACC GCAAAGGCCG GATCGAGACG
GGCTACCAGG CCGACTTCTG CGTGTTCGCG CCCGACGAGG TGTTCGTGGT CGACAGGGAA
CGGCTGCACC ACCGCAACCC GGTCACGCCG TACCACGGCC GGCCGCTCGC GGGTGTGGTC
CGCGGTAGTT GGCTGCGCGG CGTACCGATC GATATCGACA GCCTGCCGCA GGGCCGGCTG
CTCAACGGAG GAGGAGCATG A
 
Protein sequence
MAELDLLFKA RRVVTAAGEV ARSIGVRDGT VIAVEPLDAD LEAAEVIELG DDEVLLPGLV 
DSHVHVNDPG RTEWEGFGSA TRAAAAGGIT TIIDMPLNSV PPTTDVAALQ TKRKTAEGRV
YVDVGFWGGA VPGNLRELRG LHDSGVFGFK CFLLHSGVDE FPHLEPGELA DALREIGAFD
ALMIVHAEDP HVIDHAPAAH GASYRDFLRS RPRGAENLAV AQVIELARRT GCRVHILHLS
SSDALAMIRS ARRDGVRITV ETCPHYLTFS AEEIAEGATQ FKCCPPIREA ANRESLWQGL
ADGTIDCVVS DHSPCTPELK RFDVGDFGVA WGGIASLQLG LPAVWTEARR RGHTLTDVVR
WMAERPAELM GVHRKGRIET GYQADFCVFA PDEVFVVDRE RLHHRNPVTP YHGRPLAGVV
RGSWLRGVPI DIDSLPQGRL LNGGGA