Gene Sare_3869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3869 
Symbol 
ID5707463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4404598 
End bp4405917 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content70% 
IMG OID641273290 
Productglyoxalase/bleomycin resistance protein/dioxygenase 
Protein accessionYP_001538652 
Protein GI159039399 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.527726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000822129 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGAATG GCGGTAGCCG CCCGATCGCA CCGGTCCGCA AGCTCGTCGG CGCGTTGCTG 
GGCACGGTGG CGACCTTCGT GGTCCTGTTC GGCCTGGGCA TGCCGAGTTG GTCGATCGTC
GCGCTCGGCG GCGCCATCCT GGTGCTGGCG GTCGCGCTCG CCACGGTACG GGCCGGCGGA
CGCACCTGGG TCATCGGCGA CGGGACGGTG CACACTGTCT CCGAACCGCC GACCCAGTAC
GCGTTCGGCC GCTGCGAACT ACAACTCGTC ATCGACGCCC CAGGGCTACC ACCGCGATCC
AAGAAGATCA TCGAGCCAAG GGTGCCGGTC GCAAAGTGGC CGGCACCAGG CCAGGCACTG
CCGATCCGGG TGGCGCTCGA CGACCAACGG CACGTGCGCG TGCTGTGGGA CGAAGTGCCG
ACACACGCCG AGACCGCAAG GGCCAGCGCG ATGGATCTGC CACCGGAATT CACGGATCCG
GACGTACCCA TGGAGGACCT GCTGATCCAG CAGGAGGCAC CACCGTGGGC CGATCGAGCG
CCCGAGGAGG AAATCCGCGA TCCGTACGGC GACCCGGTCC CGGAGCCCGC CGATTCCAGC
ACGATCGTCG TACACCACGT CCCCGGTGGT CCAAAGGTCC TCGACGGGGA ACTGGTCAAC
CCACCCCTCC CCGGCGACCT GCCGCGCCGC GCCGCGCCGA ACCCGCGACC ACCTGCCGAG
GAACGATTTG ATCCGCCAGC CGAGGCAGCA GCGTTCGAGG CCGCATCGCC CGGAGTCCCG
GCCCAACCGA CCGCCGGAGA GCCCGGACCG CCGACCGACG CGTTCCGCGT ACCTGCCCCG
GCGGACCCGG TCGACCTGCC CCTGGACGAC CCGGCCCCAC CGTACGCCGA GGAGACGGGC
ATCCCGGACC TCGACGAGGC GATCTTCGGG GAGCCCGCCA GTGACCCGCC GATCGCCGGA
GTGGGCTTCA CGTTGCTGGT CACCGATCTG CCGCGGTCCC TTGCCTTCTA CCGGGATCTC
GGCTTCACCG AGGTCGACCA GGGCTCCGGT AACGCGGTGC TCGCGTCCGG GGCGACCCGC
CTGGTGCTAC GGGAGGCCAC CGAGGCCGTC CCGATCAGTC GTCGGCTCGT ACACGTCAAC
CTCGAGGTTG ACAACATCGA GGCCGCGTAC GCGCAGTTAC GGGAGTCCGG TGTCCGCTTC
ACCTACCCGC CGCGGATCGT CAACCGCGGG TCAAAGCTGG AGGTGTGGGC GGCTGCCTTC
CGCGACCCGG ACGGACACGG CATCGCCCTC ACCCAGTGGC GGGAACGCGC CGAAGCATAA
 
Protein sequence
MANGGSRPIA PVRKLVGALL GTVATFVVLF GLGMPSWSIV ALGGAILVLA VALATVRAGG 
RTWVIGDGTV HTVSEPPTQY AFGRCELQLV IDAPGLPPRS KKIIEPRVPV AKWPAPGQAL
PIRVALDDQR HVRVLWDEVP THAETARASA MDLPPEFTDP DVPMEDLLIQ QEAPPWADRA
PEEEIRDPYG DPVPEPADSS TIVVHHVPGG PKVLDGELVN PPLPGDLPRR AAPNPRPPAE
ERFDPPAEAA AFEAASPGVP AQPTAGEPGP PTDAFRVPAP ADPVDLPLDD PAPPYAEETG
IPDLDEAIFG EPASDPPIAG VGFTLLVTDL PRSLAFYRDL GFTEVDQGSG NAVLASGATR
LVLREATEAV PISRRLVHVN LEVDNIEAAY AQLRESGVRF TYPPRIVNRG SKLEVWAAAF
RDPDGHGIAL TQWRERAEA