Gene Sare_1678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1678 
Symbol 
ID5704575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1934779 
End bp1936473 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content73% 
IMG OID641271182 
Producthypothetical protein 
Protein accessionYP_001536557 
Protein GI159037304 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1887] Putative glycosyl/glycerophosphate transferases involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.193325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.353348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGGTG ACCTGGTCCG GAAGCTGATC GCGCAGGCCC TCTCCGCCGG CTTGGCCGTG 
CTGGCCTTCG TCGTGCTCGC GTTGACCGGA GCGACCCACT GGGGGCTCGG GCTGGCGGTC
GCCGCACTCG CGGCGACCGG CTGGCAGCGG CGGGTTCGAC CGCAGGCCGA CAGCGTCGCC
GAGACCGCCC TGCTCGCGGC GGGAATCCTG GTCGGGTATG CCCGCCACCT GGACGCTGCG
TTCGATCCCG CGCTGGCCGC GACCGCGCTG GTCCTGCTCG GGCTGGTCCT GCTGGTCGAG
CCGCTGCGAG CGGCGGGCGA GCGGGAGATC CGCACCGCGA ACCTACCGGT ACGCGACTGG
TCTCCCCTGG TCGCGGCCGA GCTGCCGGCT GCCCTGCTCG TGCTGCTCGC TCTGGTCGCT
GGCGCGGCTG CGGTGCCCCT GCCGGCCTGG GTGGCGCTGG TCACCGCCCT GCTGGTCGGT
GTGGCCGCCG GCGCGGTGGC GCTGGATCTG GCCCGACGGC GATTCCGTCC GGCCGCCGGT
GGCGGGCCGG TGCGCCGGGC GCTGCGCCGG CACCACCCCG AGTTTCTGCT GTACTTCTCC
GCGCCTCCCG GCTCCGAGTA CCAGGTGACC ATGTGGCTGC CGTATCTGGA GCGGCTCGGT
CGACCGTTCC TGGTCATGCT GCGCGAGCCG GAGTTGCTGT CGACCGTCGC GGCGGCCACC
ACCGCCCCGG TCGTGTACTG CCCGACGCTG CCCGCGATGG ACGAGGCACT GGTGCCGAGC
CTGCGCGTCG CCTTCTATGT CAACCACGGC GCGAAGAACG GCCACTGCGT CCGATTCACC
CAGCTCACCC ACGTGCAGCT GCACCACGGC GACAGTGACA AGGCCCCCAG CGCCAACCCG
ATGTCGGGCA TCTTCGACCG GATCTTCGTC GCGGGTCAGG CGGCGGTCGA CCGGTACGCC
CGCGCCGGCG TCCACCTCCC GGCGGAGAAG TTCGTTCTGG TCGGCCGTCC CCAGGTGGAG
GGGATCGAGG TGCGCCGTGG GGCGGTGCCC ACCACCGCGC CGACCGTCCT GTACACCCCC
ACGTGGACCG GGCACCACGC CGACGCCAAC TACTGCTCGC TTCCGGCGGC CGAGACCCTG
CTGCGGGACC TGTTGGCGCG GGGCGCGACG GTGATCCTGC GGGCTCACCC CTACACCACG
CAGAACCCGG CCTCGGCCCG ACAGCTGGCC CGACTTCACG AGCTACTCGC CGCCGACCGC
GGCCGGACCG GGCGGGCCCA CGTCTTCGGG GCAGCGGCCA GCCGCGAGCT GACGCTGACC
GAGTGCGTCA ACCGCTCCGA CGCGCTCGTC TCCGACGTGT CAGGCGTCAT CTCCGACTAC
CTCTACTCCG GCAAGCCGTA CGCGGTGACC GACATGCTCG ACCTGCGCGA CCGGTTCCCG
GCGGACTTCC CCCTCGCCGC TTCCGGTTAC GTGCTGCGGC GGGACATGTC CAACGGCAAC
GACGTGCTGG ACCGGCTGCT CGGGAAGGAC CCGCTGGCCG ACGCCCGATG GGCTACCCGG
ACCCGCTACC TGGGCGACTT CCCCGCCGAG TCGTACGCCG AGGGCTTCCT GGCCGCCGCC
CGCCGGGAAC TGGAGTTGAA CGTCGCCGTT CCGACGCCCC GCACCGCCGA CGCCACGGCG
TCCGCCACCG GTTGA
 
Protein sequence
MRGDLVRKLI AQALSAGLAV LAFVVLALTG ATHWGLGLAV AALAATGWQR RVRPQADSVA 
ETALLAAGIL VGYARHLDAA FDPALAATAL VLLGLVLLVE PLRAAGEREI RTANLPVRDW
SPLVAAELPA ALLVLLALVA GAAAVPLPAW VALVTALLVG VAAGAVALDL ARRRFRPAAG
GGPVRRALRR HHPEFLLYFS APPGSEYQVT MWLPYLERLG RPFLVMLREP ELLSTVAAAT
TAPVVYCPTL PAMDEALVPS LRVAFYVNHG AKNGHCVRFT QLTHVQLHHG DSDKAPSANP
MSGIFDRIFV AGQAAVDRYA RAGVHLPAEK FVLVGRPQVE GIEVRRGAVP TTAPTVLYTP
TWTGHHADAN YCSLPAAETL LRDLLARGAT VILRAHPYTT QNPASARQLA RLHELLAADR
GRTGRAHVFG AAASRELTLT ECVNRSDALV SDVSGVISDY LYSGKPYAVT DMLDLRDRFP
ADFPLAASGY VLRRDMSNGN DVLDRLLGKD PLADARWATR TRYLGDFPAE SYAEGFLAAA
RRELELNVAV PTPRTADATA SATG