Gene Sros_4879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4879 
Symbol 
ID8668173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5402660 
End bp5404030 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content75% 
IMG OID 
ProductCarboxylesterase 
Protein accessionYP_003340439 
Protein GI271966243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0135743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCGC GCATGATCGT GCGAACCACT GCCGGGCTGG TCGAGGGCAG GGCCGAGGAG 
GGCGTGCTCG CCTTCCTCGG CGTGCCGTAC GCCGCCCCGC CCTTCGGCGA GCACCGCTTC
GGCGCGCCCG TCCCGCCACC CGCGTGGGAC GGCGTACGGC CCGCCGTCAC GCGGGGCCCT
GCCGCCCCGC AGGCGGCCGG TGAGGACATC GGCCTGTACG CGCCCGACGT CTTCGAGGTC
GGCGAGGACT GCCTCAACCT CGACGTGCGC ACCCCTGAGA CCGGCCGGGC CGGGCTGCCC
GTCCTGGTCT GGATCCACGG CGGCGCGTTC TCCATCGGCG GCAACGGGGC GGCCTGGCAC
CAGGGCTCCC GCTTCGCCCG CGACGGCGTG GTCTGCGTGG CCATCAACTA CCGGCTCGGC
TTCGAGGGCT TCCTGGCCCT GGAGGACGCC CCGCTCAACC GGGGCGTCCT CGACTGGCTG
GCCGCGCTGG AATGGGTGCG CGACAACATC GCGGCCTTCG GCGGCGACCC CGGCAACGTG
ACGATCGCCG GGCAGTCGGC GGGCGCCGCC GCCGTCGCCA CCCTGCTGAC GATGCCCCGT
GCCGAAGGGC TGTTCCGGCG CGCGATCGTG ATGAGCGGCT CGGCCAACCT GGTCGCCACC
GAGCAGGAGG CGCGCGACTA CGCCAAGACG CTCGCCACCC GGCTGGGGAT CGACCCGACC
AGGTCCTCGT TCGCCGGGCT GCCCCCCGAG GCGCTCGTCG CCGAACAGCC GCCGCCGCTG
ACCGGCGACG CCGGCGGGCT GAACGAGTCG GTGCCGGTCA AGCCGGTCGT CGACGGCGAC
CTGATCCCGG CCGTCCCGCT GGCCGCCCTG CGCGCAGGCG CGAGCGGGCA CCTTCCACTG
CTGGTCGGCT GCACCTCCCA GGAGGCCGAC GTGTTCGTCC GCCGCCTGGC CGCCGACCTC
GACGAGGCCG CCGCGCAGGC CGCGCTCGAC CATCTCGGCG TCCAGGTCGA CGGACCCGGC
CTCACGCCCG CCGAACGCGT CGGACGCGCC TTCACCCAGC ACCTGTTCAC CCGCGAGACG
CGCTGGATCC GCGAGGCCAG GCCCGCCTAC GCCTACGAGT TCCGGTGGGA GTCGCCCGTC
GAGAGCCCGG TCGGCGGCGG ACGCGTGGGC GCGGTGCACA ACCTCGACCT GCCCTTCGTC
TTCGACGTGC TCGACGCGCC CGGCGTCGAA CGCACCGCAG GACCGGGCGC GCCGCAGGCC
GTGGCCGACG CGATGCACGC CGCCTGGGTG CGTTTCGCCC GCACCGGCGA TCCCGGCTGG
CCACTCGGGC ACACCATGGT CTTCAATGAA CCCATCGACC GGAGGTTGTG A
 
Protein sequence
MAPRMIVRTT AGLVEGRAEE GVLAFLGVPY AAPPFGEHRF GAPVPPPAWD GVRPAVTRGP 
AAPQAAGEDI GLYAPDVFEV GEDCLNLDVR TPETGRAGLP VLVWIHGGAF SIGGNGAAWH
QGSRFARDGV VCVAINYRLG FEGFLALEDA PLNRGVLDWL AALEWVRDNI AAFGGDPGNV
TIAGQSAGAA AVATLLTMPR AEGLFRRAIV MSGSANLVAT EQEARDYAKT LATRLGIDPT
RSSFAGLPPE ALVAEQPPPL TGDAGGLNES VPVKPVVDGD LIPAVPLAAL RAGASGHLPL
LVGCTSQEAD VFVRRLAADL DEAAAQAALD HLGVQVDGPG LTPAERVGRA FTQHLFTRET
RWIREARPAY AYEFRWESPV ESPVGGGRVG AVHNLDLPFV FDVLDAPGVE RTAGPGAPQA
VADAMHAAWV RFARTGDPGW PLGHTMVFNE PIDRRL