Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sros_4161 |
Symbol | |
ID | 8667455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptosporangium roseum DSM 43021 |
Kingdom | Bacteria |
Replicon accession | NC_013595 |
Strand | + |
Start bp | 4630503 |
End bp | 4631915 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | putative tryptophan halogenase |
Protein accession | YP_003339808 |
Protein GI | 271965612 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.326464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGAAT CGACCCAGAT TTTGATCATC GGTGGTGGCC CGGCGGGTTC GACCGCGGCC GGCCTGCTGG CCCGTGAGGG CTTCCAGGTG ACCCTCCTCG AACGCGACCG CTTCCCCCGT TACCACATCG GCGAATCGAT CCTGCCGTCC TGCCGCCCGA TCTTCGAGCT GCTCGGAGTC TGGGACAAGG TCGAGGCCCA CGGCTTCCAG CCCAAGGGCG GCGCGTTCTT CCAGTGGGGC CCCGAGGAGT GGGAGGTGCG CTTCAGCAAC CTGGGCGACG ACACCCCCAA CGCCTGGCAG GTCATCCGCA GCGAGTTCGA CCAGCTCCTG CTCGACCACG CCCGCGAGCT CGGCGTCGAG GTGATCGAAG GGGTGAGCGT CCGCGACATC GAGTTCGACG GCGACCGCGC CGTCGCGGCC CGCTGGTACG ACACCAAGGA TCCCGAGCGC AGCGGCCGCA TCGAGTTCGG CCATGTCATC GACGCCTCCG GCCGCGGCGG CGTGCTGGCC ACCCGCCATC TCAAGAACCG CCGGTTCCAC GACGTGTTCC GCAACGTCGC CGCGTGGACG TACTGGAAGA ACGCCAAGCC GCTCAGCAAG GGGCCCAGCG GCGCGATCGC GGTGTGCTCG GTCGAGGACG GATGGTTCTG GGCCATCCCC CTCCACGACG GGACGCTGAG CGTCGGCCTG GTGACCGGAC GCGACCTGTT CAACGACAGC CGCGGGCGGC TGAACGGCGA CATCCAGGCG GTCTACGACG AGGCGCTCGC GAAGTGCCCG ACCGTCCTCG AACTGCTCGA CGGCGCCGAG CAGGTGAGCG GGATGAAGGT CGAGCAGGAC TACTCCTACG TCGCGGAGAA CTTCGCCGGC CCCGGCTACC TGCTCTCGGG CGACGCCGCC TGCTTCCTCG ACCCCCTGCT GTCCACCGGC GTCCACCTCG CCACCTACAG CGCGATGCTC GGCGCGGCGA GCCTGTCGAG CGTGCTGCGC GGGGAGGTCG CCGAGCAGGA CGCGTGGCGC TTCTACAACA CCGTCTACCA CCACGCCTAC CAACGGCTTC TCATCCTGGT CTCGGTGTTC TACGAGAGCT ACCGGGGCAA GGAGCACCAC TTCTACAACG CGCAGCGGCT GACCTCCGAC GAGCGCGATC ATCTGAACCT GCAGGCGGCC TTCGACCGCA TCATCACCGG CATCGCCGAC CTGAACGACG CCGAACAGGC CTACCGGCTG GTCCAGGAAC ACCTGCGGGG AGGCGAGAGC GGCGATCCCA ACCCCCTGAA CAACCTCAAC CGGGTGCACG AGGTCAAGCA GTCGCCGTTC GACCCGGCCA ATGCCGTCGG CGGCCTGTAC CTGGTGACCG AGCCCCGGCT CGGCCTGCGC TCCAACGGGG CCACGCCGTC CGACCCCTCC TGA
|
Protein sequence | MRESTQILII GGGPAGSTAA GLLAREGFQV TLLERDRFPR YHIGESILPS CRPIFELLGV WDKVEAHGFQ PKGGAFFQWG PEEWEVRFSN LGDDTPNAWQ VIRSEFDQLL LDHARELGVE VIEGVSVRDI EFDGDRAVAA RWYDTKDPER SGRIEFGHVI DASGRGGVLA TRHLKNRRFH DVFRNVAAWT YWKNAKPLSK GPSGAIAVCS VEDGWFWAIP LHDGTLSVGL VTGRDLFNDS RGRLNGDIQA VYDEALAKCP TVLELLDGAE QVSGMKVEQD YSYVAENFAG PGYLLSGDAA CFLDPLLSTG VHLATYSAML GAASLSSVLR GEVAEQDAWR FYNTVYHHAY QRLLILVSVF YESYRGKEHH FYNAQRLTSD ERDHLNLQAA FDRIITGIAD LNDAEQAYRL VQEHLRGGES GDPNPLNNLN RVHEVKQSPF DPANAVGGLY LVTEPRLGLR SNGATPSDPS
|
| |