Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_3464 |
Symbol | |
ID | 8666752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 3810103 |
End bp | 3811050 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | fused UDP-L-Ara4N formyltransferase ; UDP-GlcA C- 4'-decarboxylase |
Protein accession | YP_003339143 |
Protein GI | 271964947 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.65465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00203442 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGGGTCG TCATGTTCGG CTACCAGACG TGGGGCCACC GCACGCTGCG GGCTCTGCTG GACTCCGACC ACGAGGTGGT CCTCGTCGTT ACCCATCCCA GGAGCGACCA CGCCTACGAG AAGATCTGGG ACGACTCGGT CGCCGAGCTC GCCGAGAAGC ACGGCGTCCC GGTGCTGCTG CGCTACCGGC CCGACGACGA GGAGCTGCTC GCCGCGCTCC GCGACGCGGC ACCGGACATC ATCGTCGCCA ACAACTGGCG GACCTGGCTG CCGCCGGAGA TCTTCGACCT GCCGCCGCAC GGCACGCTGA ACGTCCACGA CTCGCTGCTC CCCGCCTACG CGGGCTTCTC GCCGCTGATC TGGGCGCTGA TCAACGGGGA GAAGGAGGTC GGCGTCACCG CGCACCGCAT GAACGCCGAG CTGGACGCGG GCGACATCGT GCTGCAGAGG GCCGTGCCGG TGGGACCGGC CGACACCGCG ACCGACCTGT TCCACCGGAC GGTCGACCTG ATCGAGCCGA TCGTGCGCGA GGCGCTCGAC CTCATCGCCT CGGGCAGGGC GCGGTGGGTC GCGCAGGATC GCAGCAAGGC GAGCTTCTTC CACAAACGGT CCGTCGAGGA CAGCCTGATC GACTGGAACT GGCCGGCGGA GGATCTGGAG CGGCTGGTGC GCGCCCAGTC GGACCCGTAC CCCAGCGCGT TCACCTACCA CCGCGGGGAG CGGATCCGGA TCGTGTCGGC GGCGGTGTCG CGGGCCCGCT ACGGCGGCAC CCCCGGGCGC GTCTTCATCC GCGAGGGGGA CGGGGTGGTC ATCGTCACCG GCGCGGACGC GCGGAGCGGG CAGCGGCCCG GACTGGTCCT CAGGCGGCTG CGCACCGACG ACGGCACGGA GCACGCGGCC GCCGACTACT TCCGGACCAT GGGCGGCTAT CTCACCTCGC AGCCGTGA
|
Protein sequence | MRVVMFGYQT WGHRTLRALL DSDHEVVLVV THPRSDHAYE KIWDDSVAEL AEKHGVPVLL RYRPDDEELL AALRDAAPDI IVANNWRTWL PPEIFDLPPH GTLNVHDSLL PAYAGFSPLI WALINGEKEV GVTAHRMNAE LDAGDIVLQR AVPVGPADTA TDLFHRTVDL IEPIVREALD LIASGRARWV AQDRSKASFF HKRSVEDSLI DWNWPAEDLE RLVRAQSDPY PSAFTYHRGE RIRIVSAAVS RARYGGTPGR VFIREGDGVV IVTGADARSG QRPGLVLRRL RTDDGTEHAA ADYFRTMGGY LTSQP
|
| |