Gene Sare_3928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3928 
Symbol 
ID5703664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4472019 
End bp4473746 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID641273353 
Productcobyric acid synthase CobQ 
Protein accessionYP_001538710 
Protein GI159039457 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1492] Cobyric acid synthase 
TIGRFAM ID[TIGR00313] cobyric acid synthase CobQ 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.111937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0307354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGCG GGCTGCTGGT CGCCGGCACC ACCTCGGACG CCGGCAAGAG CGTACTCACC 
GCCGGGATCT GTCGATGGCT GCACCGGCGG GGCGTGCGGG TGGCGCCGTT CAAGGCGCAG
AACATGTCCA ACAACTCGGC CGTGGTGGTC GGCCCGGACG GACGCGGCGG AGAGATCGGA
CGGGCCCAGG CGATGCAGGC GGCGGCCTGT GGGCTGGCAC CGGATCTGCG GTTCAATCCC
GTGCTGCTCA AGCCCGGCAG CGACCGGACC AGCCAGGTGG TGCTGCTCGG TGAGGCGGTC
GACACACTCA CCACCGGTAC GTTCCGTCAG CTTCGACCCC GTCTCGCCGA TACCGCGTAC
GCGGCGCTGG CTGACCTGCG GGCGACCCAC GACGTGGTGA TCTGCGAGGG GGCAGGCAGC
CCAGCGGAGA TCAACTTACG GGCCGGGGAC TACGTCAACA TGGGGCTGGC CCGGCACGCA
GGCCTGCCCG CGATCGTGGT CGGCGACATC GACCGCGGGG GTGTCTTCGC CTCGATGTTT
GGCACCGTAG CCCTGCTGGA GCCGGCCGAC CAGGCGCTGG TCGCCGGATT CGTCATCAAC
AAGTTCCGCG GTGACCCGAG CCTGTTGGCC CCGGGCCTGG ACATGCTGCG TCAGGTCACC
GGGCGGCCCA CGTACGGCGT GCTGCCCTGG GCGCTCGACC TCTGGCTGGA CGCGGAGGAC
TCGCTTGCGT ACGGGCGGGT ACTCGGCCGC CCGGCCGCCC CCCGTGGCAG TGACTGGCTG
GATGTGGCCG TGGTACGGCT GCCTCGGATC AGTAACGCCA CCGATGTCGA GGCACTCGCC
ACCGAGCCGG GCGTGCGGGT GCGCCTCACC GTCGAGCCAG CCGAGCTCGC CGCCGCCGAC
CTGGTCGTGC TGCCCGGGTC CAAGTCGACC GTGGCGGACC TGGACTGGCT TCGGCAGTCC
GGCCTCGCCG ATGCCGTGCT CGCCCACGCC GCCGCCGGGC GGCCTCTGCT CGGTGTCTGC
GGCGGGTTCC AGATGCTCGG GCGTCTCATC CACGATCCGG TGGAGAGTCG GCGGGGCAGC
GTACCCGGCC TGGGGCTGCT GCCGGTCGAG GTCACCTTCG ACCCGTGCAA GACGGTTCGC
CGATCTGCCG GCATCGGCTG GGATGCCGAG CCGGTCGGCG GCTACGAGAT CCACCATGGG
TACGTCTCGG CCGCCGCCCC CGACCTCGCG CCGCTGTTCG CCTACCACGA CGGCACCGGT
GAGGGCGCGG TCGCCGGATC GGTGTACGGC ACCCACTGGC ACGGGGCGTT CGAGTCCGAC
GGGTTCCGCC GCCGGTTCCT GGCCCGGGTG GCACGTCAAG CCGGACGGCA CGGTTTCCAG
GTCGCCCCGG ACACCTCCTT CGCCGGTGCT CGCGAACGCT CCCTGGACCT GCTCGGCGAT
CTGGTCGAGG AGCACCTGGA CACGGCCGGG CTGTGGCGGC TGATCGAGTC CGGCCCGTCA
CCGGGCCTGC CGTTCATCCC GCCGGGCGCG CCAGACGCGG GGCCCCGGAC CGGGAGCGGC
GCGCCAGACA CGGAGCCCGC GGACCGGGAG CCGGGGACAG CCGCGGGCAG GGGCCCGGGT
TCAGTGGCGA ACGTCGGCGG GGCGGTCGGT GACCGGGAGT CCGGCATCCC GCCAGGCGTC
CACTCCTCCG ATGACATCGG TGGCTCGGTG CAGGCCGAGC GCGCGTAG
 
Protein sequence
MSGGLLVAGT TSDAGKSVLT AGICRWLHRR GVRVAPFKAQ NMSNNSAVVV GPDGRGGEIG 
RAQAMQAAAC GLAPDLRFNP VLLKPGSDRT SQVVLLGEAV DTLTTGTFRQ LRPRLADTAY
AALADLRATH DVVICEGAGS PAEINLRAGD YVNMGLARHA GLPAIVVGDI DRGGVFASMF
GTVALLEPAD QALVAGFVIN KFRGDPSLLA PGLDMLRQVT GRPTYGVLPW ALDLWLDAED
SLAYGRVLGR PAAPRGSDWL DVAVVRLPRI SNATDVEALA TEPGVRVRLT VEPAELAAAD
LVVLPGSKST VADLDWLRQS GLADAVLAHA AAGRPLLGVC GGFQMLGRLI HDPVESRRGS
VPGLGLLPVE VTFDPCKTVR RSAGIGWDAE PVGGYEIHHG YVSAAAPDLA PLFAYHDGTG
EGAVAGSVYG THWHGAFESD GFRRRFLARV ARQAGRHGFQ VAPDTSFAGA RERSLDLLGD
LVEEHLDTAG LWRLIESGPS PGLPFIPPGA PDAGPRTGSG APDTEPADRE PGTAAGRGPG
SVANVGGAVG DRESGIPPGV HSSDDIGGSV QAERA