Gene Sros_4161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4161 
Symbol 
ID8667455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4630503 
End bp4631915 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative tryptophan halogenase 
Protein accessionYP_003339808 
Protein GI271965612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.326464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAT CGACCCAGAT TTTGATCATC GGTGGTGGCC CGGCGGGTTC GACCGCGGCC 
GGCCTGCTGG CCCGTGAGGG CTTCCAGGTG ACCCTCCTCG AACGCGACCG CTTCCCCCGT
TACCACATCG GCGAATCGAT CCTGCCGTCC TGCCGCCCGA TCTTCGAGCT GCTCGGAGTC
TGGGACAAGG TCGAGGCCCA CGGCTTCCAG CCCAAGGGCG GCGCGTTCTT CCAGTGGGGC
CCCGAGGAGT GGGAGGTGCG CTTCAGCAAC CTGGGCGACG ACACCCCCAA CGCCTGGCAG
GTCATCCGCA GCGAGTTCGA CCAGCTCCTG CTCGACCACG CCCGCGAGCT CGGCGTCGAG
GTGATCGAAG GGGTGAGCGT CCGCGACATC GAGTTCGACG GCGACCGCGC CGTCGCGGCC
CGCTGGTACG ACACCAAGGA TCCCGAGCGC AGCGGCCGCA TCGAGTTCGG CCATGTCATC
GACGCCTCCG GCCGCGGCGG CGTGCTGGCC ACCCGCCATC TCAAGAACCG CCGGTTCCAC
GACGTGTTCC GCAACGTCGC CGCGTGGACG TACTGGAAGA ACGCCAAGCC GCTCAGCAAG
GGGCCCAGCG GCGCGATCGC GGTGTGCTCG GTCGAGGACG GATGGTTCTG GGCCATCCCC
CTCCACGACG GGACGCTGAG CGTCGGCCTG GTGACCGGAC GCGACCTGTT CAACGACAGC
CGCGGGCGGC TGAACGGCGA CATCCAGGCG GTCTACGACG AGGCGCTCGC GAAGTGCCCG
ACCGTCCTCG AACTGCTCGA CGGCGCCGAG CAGGTGAGCG GGATGAAGGT CGAGCAGGAC
TACTCCTACG TCGCGGAGAA CTTCGCCGGC CCCGGCTACC TGCTCTCGGG CGACGCCGCC
TGCTTCCTCG ACCCCCTGCT GTCCACCGGC GTCCACCTCG CCACCTACAG CGCGATGCTC
GGCGCGGCGA GCCTGTCGAG CGTGCTGCGC GGGGAGGTCG CCGAGCAGGA CGCGTGGCGC
TTCTACAACA CCGTCTACCA CCACGCCTAC CAACGGCTTC TCATCCTGGT CTCGGTGTTC
TACGAGAGCT ACCGGGGCAA GGAGCACCAC TTCTACAACG CGCAGCGGCT GACCTCCGAC
GAGCGCGATC ATCTGAACCT GCAGGCGGCC TTCGACCGCA TCATCACCGG CATCGCCGAC
CTGAACGACG CCGAACAGGC CTACCGGCTG GTCCAGGAAC ACCTGCGGGG AGGCGAGAGC
GGCGATCCCA ACCCCCTGAA CAACCTCAAC CGGGTGCACG AGGTCAAGCA GTCGCCGTTC
GACCCGGCCA ATGCCGTCGG CGGCCTGTAC CTGGTGACCG AGCCCCGGCT CGGCCTGCGC
TCCAACGGGG CCACGCCGTC CGACCCCTCC TGA
 
Protein sequence
MRESTQILII GGGPAGSTAA GLLAREGFQV TLLERDRFPR YHIGESILPS CRPIFELLGV 
WDKVEAHGFQ PKGGAFFQWG PEEWEVRFSN LGDDTPNAWQ VIRSEFDQLL LDHARELGVE
VIEGVSVRDI EFDGDRAVAA RWYDTKDPER SGRIEFGHVI DASGRGGVLA TRHLKNRRFH
DVFRNVAAWT YWKNAKPLSK GPSGAIAVCS VEDGWFWAIP LHDGTLSVGL VTGRDLFNDS
RGRLNGDIQA VYDEALAKCP TVLELLDGAE QVSGMKVEQD YSYVAENFAG PGYLLSGDAA
CFLDPLLSTG VHLATYSAML GAASLSSVLR GEVAEQDAWR FYNTVYHHAY QRLLILVSVF
YESYRGKEHH FYNAQRLTSD ERDHLNLQAA FDRIITGIAD LNDAEQAYRL VQEHLRGGES
GDPNPLNNLN RVHEVKQSPF DPANAVGGLY LVTEPRLGLR SNGATPSDPS