Gene Sros_3982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3982 
Symbol 
ID8667276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4437547 
End bp4439694 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content71% 
IMG OID 
Productanthranilate synthase 
Protein accessionYP_003339635 
Protein GI271965439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.758299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGACGA GCGGATATAC CACCGCCGGT GGCATCGAGG TCGAGGTCAC GGCGTCCGAT 
GTGCCCGAGA CGGTGCTTGA GGACGTCGTG ACGACGCTCG GCGAGCGTCG CGGAGGCGTG
CTCTCCTCCG GGATGGAGTA TCCCGGCCGC TACAGCCGAT GGCACCTGGC CTACGTCGAC
CCCTGCCTGG AGATCGTGGC CAGGGGCCGC AGGATCTCCG CCCGCGCCCT GAACGCGCGG
GGCAGGGTCG TGCTTCCGGC CGTCGCGTCC TGCCTGCTGG CCACCGGCAA GCCCACCGGG
GAACCGACCG CCGAGCACGT CGAGGTCTAC GTCGCCGAGT CCGAGGACAT CCTCCCCGAG
GAGATGCGCA GCCGCCGCCC CACGGTCTTC ACCGCGATCC GCGAGGTCAT CGCCGCGTTC
AAGGGCGAGA ACGAGCACCT GGGGCTGTAC GGCTCCTTCG GATACGACCT GGCCTTCCAG
TTCGAGCCGA TCCGGCAGGT CCTCACCCGG GCCGACGACC AGCGCGACCT CGTGCTGCAC
CTGCCCGACC GGGTGATGGT GATCGACCGC AAGCGGGAGA CCAGCAAGGA ATACCTCTAC
GAGTTCACCG TGGACGGGGT CTCCACCCGC GGCCTGGCCC GCGAGGGGGA GAGCATCCCG
CTGCCCCCCG CCCCGGCCGA GCTGCCCGCC GACCCGGAGA AGGGCACCTA CGCGCAGGTC
GTCGCCGCGG CCAAGGAGAA GTTCGTCCGC GGCGACCTGT TCGAGGTCGT CCCCGGCCAG
GTCTTCCACG CCGCGTGCAC CGACCCCGCC GCCTTCTACC GGGGGCTGCG CAAGGCCAAC
CCGGCGCCGT TCGAGTTCCT GTTCAACCTC GGCGAGGGCG AGCACCTGGT CGGCGCCTCC
CCGGAGATGT ACGTCCGGGT CAGCGGCGAC CGTGTCGAGA CCTGCCCGAT CTCCGGCACC
ATCGCCCGCG GCGGCAACCC GATCGAGGAC GCCGAGGCGA TCCGCACCCT CCTGTCCAGC
GTGAAGGAGG AGTCGGAGCT GACCATGTGC ACCGACGTGG ACCGCAACGA CAAGTCCCGC
ATCTGCGTGC CGGGCACCGT GCAGGTCATC GGGCGGCGGC AGATCGAGAT GTACTCCCGG
CTGATCCACA CCGTCGACCA CATCGAGGGC CGCCTGCGGC CGGAGTTCGA CGCGCTGGAC
GCCTTCCTCA CCCACATGTG GGCCGTCACC GTCACCGGCG CCCCGAAGTC CTGGGCGATG
CAGTTCATCG AGGACCACGA GGCCACCACC AGGCGCTGGT ACGGCGGGGC GGTCGGCTAC
ATCGGCTTCG ACGGCTCCAT GAACACCGGC CTGACCCTGC GCACCGCGCA GATCCGCGGC
GGCGTCGCCA CCGTCAGGGC CGGTGCCACG CTGCTGTTCG ACTCCGACCC GGAGGCCGAG
GAGCGTGAGA CCGAGCTCAA GGCCAGCGCG CTGCTCGGCG CCCTGGCCGC GGTCGGCGCG
GCCCGGACCC CGCAGGAGCG GGACGTGCCG CAGCCGGTCC GGGAGCAGCC GGGGGAGGGG
ATGAAGGTGC TGCTGGTGGA CCACGAGGAC TCCTTCGTCA ACACCCTGGC CGACTACTTC
CGCCAGCAGG GCGCGGAGGT CGTCACCCTC CGGCACGGCT TCCCCGTGAG CATGATCGAC
GAGATCGCGC CGTCCCTCGT GGTGCTGTCG CCCGGCCCCG GCTGGCCGTC GGACTTCGGC
CTGCCGGAGC TGGTCGGGGC GCTCTACGAG CGCGACCTGC CGGTGTTCGG CGTCTGCCTG
GGCCTGCAGG GCATGGTCGA GCAGGCGGGC GGCACGCTGG AGCTGCTGTC CCACCCTGAG
CACGGCAAGC GCGGTCAGGT GCGGCGGACC GGTCCCGGCG CGCTGCTGGA GGGGCTCCCG
GAGGAGTTCA CCGCGGCCCG CTATCACTCC CTCCACGCCA AGCAGCCCGG AGTCGTCGGC
TTCACCGCCA CCGCCCTCAC CCCCGACGGC GCGGTGATGG CGATCGAGGA CGTGGCCAGG
AGGCGCTTCG CCGTGCAGTT CCACCCCGAG TCGATCCTCA CGGCCGAGGG CGGGGCCGGG
GCGAAGATCA TCTCCAACGT TCTCCGGCTC TGCCGTACCT CTGGGTAA
 
Protein sequence
METSGYTTAG GIEVEVTASD VPETVLEDVV TTLGERRGGV LSSGMEYPGR YSRWHLAYVD 
PCLEIVARGR RISARALNAR GRVVLPAVAS CLLATGKPTG EPTAEHVEVY VAESEDILPE
EMRSRRPTVF TAIREVIAAF KGENEHLGLY GSFGYDLAFQ FEPIRQVLTR ADDQRDLVLH
LPDRVMVIDR KRETSKEYLY EFTVDGVSTR GLAREGESIP LPPAPAELPA DPEKGTYAQV
VAAAKEKFVR GDLFEVVPGQ VFHAACTDPA AFYRGLRKAN PAPFEFLFNL GEGEHLVGAS
PEMYVRVSGD RVETCPISGT IARGGNPIED AEAIRTLLSS VKEESELTMC TDVDRNDKSR
ICVPGTVQVI GRRQIEMYSR LIHTVDHIEG RLRPEFDALD AFLTHMWAVT VTGAPKSWAM
QFIEDHEATT RRWYGGAVGY IGFDGSMNTG LTLRTAQIRG GVATVRAGAT LLFDSDPEAE
ERETELKASA LLGALAAVGA ARTPQERDVP QPVREQPGEG MKVLLVDHED SFVNTLADYF
RQQGAEVVTL RHGFPVSMID EIAPSLVVLS PGPGWPSDFG LPELVGALYE RDLPVFGVCL
GLQGMVEQAG GTLELLSHPE HGKRGQVRRT GPGALLEGLP EEFTAARYHS LHAKQPGVVG
FTATALTPDG AVMAIEDVAR RRFAVQFHPE SILTAEGGAG AKIISNVLRL CRTSG