Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4856 |
Symbol | |
ID | 8668150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 5379226 |
End bp | 5380566 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | allantoinase |
Protein accession | YP_003340417 |
Protein GI | 271966221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.229544 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGC TGGACCTGCT GTTCAAGGCC CGTCGAGTGG TCACCGCCGC CGGAGAGGTG GCACGCAGCA TCGGGGTACG GGACGGGACG GTGATCGCGG TCGAGCCGCT GGACGCGGAT CTCGAGGCCG CCGAGGTCAT CGAGCTCGGC GACGACGAGG TGCTGCTGCC CGGCCTCGTG GACAGCCACG TGCACGTGAA CGACCCCGGC CGGACCGAGT GGGAGGGATT CGGGAGCGCC ACCCGGGCGG CAGCGGCCGG CGGTATCACG ACGATCATCG ACATGCCGCT GAACAGCGTC CCGCCGACCA CCGATGTCGC GGCGCTGCAG ACGAAACGGA AGACCGCCGA GGGACGGGTG TACGTCGACG TCGGCTTCTG GGGCGGGGCC GTACCGGGCA ACCTGCGCGA GCTGCGCGGG CTGCACGACT CGGGCGTGTT CGGATTCAAG TGCTTCCTGC TGCACTCCGG CGTGGACGAG TTCCCCCACC TGGAACCGGG CGAGCTGGCG GACGCGCTAC GGGAGATCGG GGCGTTCGAC GCACTGATGA TCGTGCACGC TGAGGACCCG CACGTGATCG ACCACGCCCC GGCCGCGCAC GGCGCGAGCT ACCGGGACTT CCTGCGCTCC AGGCCGCGGG GCGCGGAGAA TCTCGCGGTC GCGCAGGTGA TCGAGCTGGC CCGCCGGACC GGCTGCCGGG TGCACATCCT GCACCTGTCC AGCTCGGACG CGCTTGCGAT GATCCGGTCG GCCCGGCGCG ACGGCGTCCG GATCACCGTG GAGACATGCC CGCACTATCT GACGTTCAGC GCGGAGGAGA TCGCCGAGGG GGCCACCCAG TTCAAGTGCT GCCCGCCGAT CCGGGAGGCG GCGAACCGCG AATCGCTCTG GCAAGGGCTT GCCGACGGCA CGATCGACTG CGTGGTGTCC GACCACTCGC CGTGCACGCC GGAGCTCAAA CGGTTCGACG TCGGTGACTT CGGCGTCGCC TGGGGCGGCA TCGCGTCGCT GCAACTCGGC CTGCCGGCGG TGTGGACCGA GGCCCGGCGC CGCGGCCACA CGCTGACCGA CGTGGTGCGC TGGATGGCGG AACGCCCCGC GGAGCTGATG GGGGTGCACC GCAAAGGCCG GATCGAGACG GGCTACCAGG CCGACTTCTG CGTGTTCGCG CCCGACGAGG TGTTCGTGGT CGACAGGGAA CGGCTGCACC ACCGCAACCC GGTCACGCCG TACCACGGCC GGCCGCTCGC GGGTGTGGTC CGCGGTAGTT GGCTGCGCGG CGTACCGATC GATATCGACA GCCTGCCGCA GGGCCGGCTG CTCAACGGAG GAGGAGCATG A
|
Protein sequence | MAELDLLFKA RRVVTAAGEV ARSIGVRDGT VIAVEPLDAD LEAAEVIELG DDEVLLPGLV DSHVHVNDPG RTEWEGFGSA TRAAAAGGIT TIIDMPLNSV PPTTDVAALQ TKRKTAEGRV YVDVGFWGGA VPGNLRELRG LHDSGVFGFK CFLLHSGVDE FPHLEPGELA DALREIGAFD ALMIVHAEDP HVIDHAPAAH GASYRDFLRS RPRGAENLAV AQVIELARRT GCRVHILHLS SSDALAMIRS ARRDGVRITV ETCPHYLTFS AEEIAEGATQ FKCCPPIREA ANRESLWQGL ADGTIDCVVS DHSPCTPELK RFDVGDFGVA WGGIASLQLG LPAVWTEARR RGHTLTDVVR WMAERPAELM GVHRKGRIET GYQADFCVFA PDEVFVVDRE RLHHRNPVTP YHGRPLAGVV RGSWLRGVPI DIDSLPQGRL LNGGGA
|
| |