Gene Sros_5046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5046 
Symbol 
ID8668340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp5567935 
End bp5570397 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content76% 
IMG OID 
ProductL-carnitine dehydratase/bile acid-inducible protein F 
Protein accessionYP_003340579 
Protein GI271966383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0984554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0303559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCGC CGCTGCACGG CATCCGCGTC CTCGACTTCG GCCAGTACGT CGCCGGCCCG 
GCCGTGGCCC TGCTCCTCGC CGACCTGGGC GCCGAAGTGA TCCGCATCGA CCCGCCCGGC
GGTCCGCGGT GGGCCTCCCC CGCGGCCGCC GCGCTCAACG CGGGCAAGAA GAGCATCGTG
CTGGACCTGA CCGAGCCCGC CGACGTCACG ACCGCCCGCC GGCTGGTGGC CGGCGCCGAC
GTGGTGGTCG AGAACTTCCG GCCCGGTGTC ATGGCCCGGT TCGGGCTCGG ACCGGAGGAC
ACGCTCGCGC TCAACCCCCG GGTGATCCAC CTGTCGCTGC CCGGCTTCGC CGCGGCCGAC
CCGCGCCGCG ACCTACCGGC CTGGGAGGGG ATCGTGCTGG CGGCCACCGG CGGCTTCACG
GACATGGGAC TGAACCGGAT CCTCATGGGC GTCAACCCGT CCTACACGCC GCTGCCGCTG
GCCTCGGCCT ACGCCGCCGC GCTCGGCGCG CTCGCGGTCG GCGTCGGGCT GGTCGCCAGG
CGGCGCACCG GCCGCGGCGA CGCGTTCGAG GTGCCGCTCG CCGAGGCCGT GCTGGAAGGG
CTGGCGTTCA ACTCGCTGGC CGTCTCCGAC CTCCCCCCGC GCTACCTCTC GCTGCGCGAG
CACGAGATCG CCCGCCGCCG TGCCGCCGGG GAGCCGATGG ACCTCACCTA CGCGCAGCTG
CAGGAGCTCC TCGATCCGTT CTACCGCAGC TACCGCTGCC AGGACGGCAG GCTGTTCTAC
CACTGCTGCC CGGCTCACCG CACCCACGCG ATCCGGTCGC TGCAGCTGCT CGGCATCTGG
GACACCGTCC GCGCGGAGGG CATCCCGGTG GTCGACCCCT ACCTGTCCAC CGACCGCCGG
CCCGACGGCG CGGACTGCAC GCTGCTGGCC TACCCGCTCT CCGCCCGCTG GGCGGCGCGG
CTGTCGGAGC TGATCGCGGC CGCCTTCCGG CGGCACCCGG CGCTGGAGTG GGAACGGCGC
TTCGACGCGG CGGGCATCCC GGGGGCCGCC CACCGCAGCA GTCTGGAGTG GCTGCGTTCG
GAGCATCCGC GCGCGGCGGG GCTCGTCGCC ACCGTCGACG ATCCGGTCCA CGGCGAACTG
ACCGTTCCCG GGCCCGTCGT CTGGACCGAA GGCACCCCGC CCGGCCGCCG GCCCGCCCCC
GCGCTCGACG CCGACCGGGC CGCGATCCTC GCCGACCTGC CGGCGGGGCC GCCGGCACCG
CCGCCCGACC CACCGGCCGC GCCAGGCGGG CTCCCGCTGG CCGGCATCCG CATCCTCGAC
GTCACCAACG TCATCGCCGG GCCGATGATC GCCGCGACCC TCGCACGGTT CGGCGCCGAG
GTCGTCAAGA TCGATCCACC GACGCCGGGC TTCGACCCGT ACCACGCCGT GGTCATCGGC
ATGCACGCCC AGCGCGGCAA GCGCAGCGTG CTGGCCGACC TGCGCACTCC GGCCGGTCGT
GAGGTGCTCG ACCGGCTGCT GCCCACCGTC GACGTCGTCA CGTTCAACGG CAGCGAGCGC
CAGCTCGGCG AGCTCGGCCT GGACCCGCAG CGGCTGCGGG ACATCCGGCC GGGGATCGTG
CTCGTGCGGG TGGACGCCTA TGGCGGGCCG GGTCACGGGC CGCGCAGCCA CGCGGCGGGC
TACGACGACA ACGTGCAGGC GTGCACCGGG ATCATGACCC GGTTCGGCGG CGGCCCGGAC
ACCCCGGAGG AGCACGCGCA CCTCGGGACG ATCGACGCCC TCGCCGGGTT CTGCGGCGCC
TTCGCCGTCG TCGCCGCGCT GGCCGGACGC GGCGAACACG CCCCGGTGAT GCGCACCTCC
CTCGCCGCCG CCGGGCAGCT GCTGCAGATC CCGTTCCTCT TCGACGGCGC GGGCCGCGAC
GACGTGCCGG AACCGGCCGG GCCGGACGTG CTCGGCGAGC ACGCGGGATA CCGGTGCTAT
CCGGCCGCGG ACGGCTGGTT CTTCCTCGCG GGCCCGGCCG AGGTGGTGGC GACCGTGCTC
GGTCTCGACA CGGCCGCGCC GCAGGACCTG CTCGCCGCGC GGTTCCGGGA ACGCCCGGTC
GAGGCGTGGG CCGAGCTGCT CGGCCCGCAC GGGGTCGCGG TGCAACGGAT CGAGCACATC
GCCGCCCTGC GCTCCCGCGG GCTGGTACGG GAGAGCGCCG GGCCCGTCCC GCTGCGCGGC
TCGGCGGTGT TCGTCCGCCA TGACCTGCAC CCCAGCGGCC GGGAGACGGA CCTCATCGCC
CCACAGGCCG TCCGGCCCCG GCACGCGGCG GTGCGGATGC CGTCCGACGC CCCGCGCTAC
GGCGCGCACA CCCGCCAGGT GCTCGCCGAG CTCGGCTTCA CACCCACCGA GATCGAAACG
ATGGCCGCCG ACGGCGCGAT CGCCGACGGC TGGACCGCCG ACCACACCTA CCTGCCCACC
TGA
 
Protein sequence
MSSPLHGIRV LDFGQYVAGP AVALLLADLG AEVIRIDPPG GPRWASPAAA ALNAGKKSIV 
LDLTEPADVT TARRLVAGAD VVVENFRPGV MARFGLGPED TLALNPRVIH LSLPGFAAAD
PRRDLPAWEG IVLAATGGFT DMGLNRILMG VNPSYTPLPL ASAYAAALGA LAVGVGLVAR
RRTGRGDAFE VPLAEAVLEG LAFNSLAVSD LPPRYLSLRE HEIARRRAAG EPMDLTYAQL
QELLDPFYRS YRCQDGRLFY HCCPAHRTHA IRSLQLLGIW DTVRAEGIPV VDPYLSTDRR
PDGADCTLLA YPLSARWAAR LSELIAAAFR RHPALEWERR FDAAGIPGAA HRSSLEWLRS
EHPRAAGLVA TVDDPVHGEL TVPGPVVWTE GTPPGRRPAP ALDADRAAIL ADLPAGPPAP
PPDPPAAPGG LPLAGIRILD VTNVIAGPMI AATLARFGAE VVKIDPPTPG FDPYHAVVIG
MHAQRGKRSV LADLRTPAGR EVLDRLLPTV DVVTFNGSER QLGELGLDPQ RLRDIRPGIV
LVRVDAYGGP GHGPRSHAAG YDDNVQACTG IMTRFGGGPD TPEEHAHLGT IDALAGFCGA
FAVVAALAGR GEHAPVMRTS LAAAGQLLQI PFLFDGAGRD DVPEPAGPDV LGEHAGYRCY
PAADGWFFLA GPAEVVATVL GLDTAAPQDL LAARFRERPV EAWAELLGPH GVAVQRIEHI
AALRSRGLVR ESAGPVPLRG SAVFVRHDLH PSGRETDLIA PQAVRPRHAA VRMPSDAPRY
GAHTRQVLAE LGFTPTEIET MAADGAIADG WTADHTYLPT