Gene Sare_1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1535 
Symbol 
ID5703516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1770670 
End bp1771788 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content68% 
IMG OID641271046 
Producthypothetical protein 
Protein accessionYP_001536422 
Protein GI159037169 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTAACA ATCGGGGGAT CGTCGTCGCC GCCACCGGTG ACCTGCTCAT CGCACGCGAC 
CGCCCACACG ACATCTTCCG GTACGTGCGA GACCTGTTGA CCGAGGCCGA CATCACCTTC
GGCCAGTTGG AGACCGCGTA CTCCGACCAG GGATCCCTGG GCTCGTCCGG ACCGCGCGGA
GGCGTACCGC ACGACGTGGA GAACCTGGTG GCGATCCCGC ACGCGGGCTT CGACGTCATT
TCGATGGCGA GCAACCACAC CGGCGACTGG GGGGCCGACG CGTTACTCGA CTGCATCGAG
CGGTGCCGGC GCCACGGCAT CACCGTGGTG GGTGCGGGCG CGGACATCGC CGAGGCGCGC
CGGCCGGGGA TCATCGAACG GGACGGGACC CGGGTCGGAT TCCTGGCCTA CTGCTCGGTC
GCGCCGGATG GCTACTACGC CGGGCCGGGT AAGCACGGTG TGGCGCCGAT GCGGGCGAGA
ACGCACTATG AACCGTTCGA GTACGACCAG CCCGGCGGCC CGCCCCTGGT CAGAACCTCG
CCGGACGAAT CCGATCTGGC GGCGCTCGTC GCGGACGTCG ACGAGTTGCG CGACCAGGTG
GACGTGCTGA TCGTGTCATT CCACTGGGGC CTGCATTTTC AGCCCGCACG GCTCGCGGAC
TACCAGCCGG TGGTGGCGCA CGCGGCGATC GACGCCGGTG CCGACGCGGT GATCGGGCAC
CACCCGCACA TCCTGAAGCC GGTGGAGGTC TACCGAGGCA AGGTCATCTT CTACAGCCTG
GGCAACTTCG CCCTCGAGAT CAACGAGCGC TGGTGGCAGT CGTACAGCAA GGAATGGTTC
GAGAAGGCGA ACGAGTTCCA CCAGGAACGT TCTCCCCACC GGGACCTGAA GGAGGAGGCC
CGGAACTCGG CGATCGTGCG GCTGCACATC GTCGACGGTC GCATCGACCG GGTTGGGATC
GTACCTGTGG TGATCAACGA GGCGCACGAG CCGGTACCGC ATCGGGCGGA CACAACGGAC
GGGCGCGCGG TCCGCGCCTA CCTGGCGCAG ATCACGGCCG AGGTGGGGAT CGACACCACC
TTCGACGTGG TCGACAACGA GGTCCTGGTC CGCGTCTGA
 
Protein sequence
MGNNRGIVVA ATGDLLIARD RPHDIFRYVR DLLTEADITF GQLETAYSDQ GSLGSSGPRG 
GVPHDVENLV AIPHAGFDVI SMASNHTGDW GADALLDCIE RCRRHGITVV GAGADIAEAR
RPGIIERDGT RVGFLAYCSV APDGYYAGPG KHGVAPMRAR THYEPFEYDQ PGGPPLVRTS
PDESDLAALV ADVDELRDQV DVLIVSFHWG LHFQPARLAD YQPVVAHAAI DAGADAVIGH
HPHILKPVEV YRGKVIFYSL GNFALEINER WWQSYSKEWF EKANEFHQER SPHRDLKEEA
RNSAIVRLHI VDGRIDRVGI VPVVINEAHE PVPHRADTTD GRAVRAYLAQ ITAEVGIDTT
FDVVDNEVLV RV