Gene Sros_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2002 
Symbol 
ID8665284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2155232 
End bp2156815 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content73% 
IMG OID 
ProductCarboxylesterase 
Protein accessionYP_003337733 
Protein GI271963537 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.692121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.289287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCCG CGCTGGCGGC CACGCTCCTG CTGCCGGCGT GCTCCGCCGC GCCGGAGTGG 
AACGCCAGGC CGGCACCGCC GGCGGTTGAC GAGGGGGTCG CCCGGACCGA CGCGGGAACC
GTGCGCGGGA CGGTGACCGG CGAGCACCGG ATCTTCCAGG GGATCCCGTA CGCCGCCCCG
CCGGTGGGCG ACCTCCGCTG GAACTCGCCC CGGCCGGTGA AGGCGTGGAC CGGCACCAGG
GACGCCACCG GACCGGGCAG CATGTGCCCG CAGGTGGGCA GCGACTACGC CGAGACGGCC
AGCACCGACG AGGACTGCCT CTTCCTCAAC GTGACCGTGC CCCGCACCCC CGGCACGAAG
AAGCCGGTGA TGGTGTGGAT CCACGGCGAC GGCGCGCTGG GGTCCGGAGA CCTCAGCGAC
GTGCGCCGGA TCGCCACCCG GGGGGACGTG GTGGTCGTCA CGATCAACTA CCGCCTCGGG
ATCTTCGGAG GGTTCGGCTA TCCGGGGCTC AAGGGGTCGG GCACGTACGG CCTGCAGGAC
CAGCAGGCGG CCATGCGCTG GGTGCGCCGC AACGCCGCCG CGTTCGGCGG CGACCCGGCG
AACGTGACGG TGTTCGGCGT CTCGTGGGGC GCCCTGAGCA TCAGCGGGCA CCTGACCTCA
CCGGGGGCGA AGGGCCTGTT CGACCGGGCC GTCATGCAGA GCGGCGAGGG CATGATGGAC
ATGCTCGCCG GGAGCATGGG CGAGGGCGTC CCCGCCTACC CGTACTACTC CTGGCGGACC
GAGGAGGAGA TCCAGGGGAT GGGGACGTAC GTGGCCCCGC AGCTCGGCTG CAAGGATCTC
GCCTGTCTGC GGGCCCTGCC CGTGGAGCGG ATCCTCAAGG TGCCGCAGAT CATGAACATG
TTCCAGACCT ACGCCTACGG CGGCGAGACG CTCCCGCGAC CGCCGGCCGA CGAGCTGCGC
GCGGGCCGCG CCCACAGGGT GCCGGTGATC TCCGGAGGCA CCCGCGACGA GCACCGGCTC
TTCGCCGGCA TGCTGTACGA CGCGGCCGGC AAGCCGATCA CCGCCAAGCT GTACCGCAAG
CTGCTGGCCA CCGCGTTCGG CGAGGACGCG GCCAAGGTGG GCGCGGAGTA CCCGGCCGCC
GAGTACGGCT CGCCGGGGCT GGCCTGGGCC GCCGCGATCA CCGACCGCAT GTGGGCCCGC
GGGACCTTCG AGCAGAACCG GAGCCTCGCG AGGAAGGCGC CGGTCTACGC CTACCAGTTC
GCCGACCGCG AGGCCCCGAT GTTCCTGCCC CTGGAGACGG ACTTCCCCTT CGGGGCCTTC
CACGCGGGCG AGCTGCCGTA CCTGTTCACC GAGGAGAAGG CCTCCCTCGA CCCCGCCCAG
CGGGGGCTGG CCGACCAGAT GATCGACTAC TGGACCAACT TCGCCCGCAC CGGCGACCCC
AACGGCTCCG GCCTGCCCCG CTGGGAACGC TTCGACCTGG CGGCCCCGGT CCCGCACACC
CAGTCCCTGG AGCCGGGCGC GGTCGGGCCC GTCGACTACG CCGCCGACCA CAAGCTCGCT
TTCTGGGCGG AGCTGGGCGG CTGA
 
Protein sequence
MGAALAATLL LPACSAAPEW NARPAPPAVD EGVARTDAGT VRGTVTGEHR IFQGIPYAAP 
PVGDLRWNSP RPVKAWTGTR DATGPGSMCP QVGSDYAETA STDEDCLFLN VTVPRTPGTK
KPVMVWIHGD GALGSGDLSD VRRIATRGDV VVVTINYRLG IFGGFGYPGL KGSGTYGLQD
QQAAMRWVRR NAAAFGGDPA NVTVFGVSWG ALSISGHLTS PGAKGLFDRA VMQSGEGMMD
MLAGSMGEGV PAYPYYSWRT EEEIQGMGTY VAPQLGCKDL ACLRALPVER ILKVPQIMNM
FQTYAYGGET LPRPPADELR AGRAHRVPVI SGGTRDEHRL FAGMLYDAAG KPITAKLYRK
LLATAFGEDA AKVGAEYPAA EYGSPGLAWA AAITDRMWAR GTFEQNRSLA RKAPVYAYQF
ADREAPMFLP LETDFPFGAF HAGELPYLFT EEKASLDPAQ RGLADQMIDY WTNFARTGDP
NGSGLPRWER FDLAAPVPHT QSLEPGAVGP VDYAADHKLA FWAELGG