Gene Sare_4362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4362 
Symbol 
ID5706443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4929217 
End bp4930323 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content75% 
IMG OID641273784 
Productglycosyl transferase family protein 
Protein accessionYP_001539134 
Protein GI159039881 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCC CGCACCTGCT CGGCCCGGTC GCCGAGCGCG TATCCGCGGT CGAACGGATC 
GCCGTGCTGC GGGCCAACGC GCTCGGCGAC TTCATCTTCG TCCTGCCGAC GCTGGAGGCG
CTGCGGGCCG CGTACCCCGC CGCGGAGATC GTCCTGCTGG GCGCACCGTG GCACGCGAAG
CTGTGGCGCG ACCGGCCGGG TCCGGTGGAC CGGGTCCTGG TGGTCCCGCC GGCTCCCGGA
ATCCGTCGCC CGGAGCCGGA CGAGCCGGAG TCCGAGTTGG CGGACTTCCT CGCCCGCGCC
CGCAGGGAAC GCTTCGATCT GGCGCTGCAG GTGCACGGCG GTGGGGCCAA CTCCAATCCG
GTCGTGGCCG GCCTCGGCGC CCGGGTCACG GCCGGCCTGC GGGCCGAGGA CGCGCCGCCG
CTGGACCGCT GGCTGCGGTA CGTCTACTAC CAGCACGAGG TGATCCGTTA CCTGGAGGTG
GCGGCCCTGG TGGGCGCTCC GGCGACCACC GTCACTCCCG CCCTGGCGGT TACCGACGCC
GACCGGGCCG AGGCGGCCGA GGTGCTCGGC CCGGCGGACC GGCCCCGGGT GGCCTTGCAT
CCGGGCGCCA CCGACACCCG CCGGCGGTGG CCGGTCGAAC GCTTCGCGGC GGTCGCTCGG
GAACTGCACG GGGACGGGTA CGAGGTGCTG GTCACCGGCA CCCCGGTCGA ACAGAACGAG
GTGGACCGTC TGGTGGCGGC GGCCGGGGTG CCCCTCCGGC CGCAGGTCGG CACGCTCAGC
CTCGGCGGGC TGGCCGGCTG CTACGCCGGC TGCGCGGTGG TGGTCGCCAA CGACACCGGG
CCGCTGCACC TGGCGGCGGC GGTCGGCACC CCCACGGTCG GCGTCTACTG GGTCGGCAAT
TTCATCACCA CGGCGAGCCC GCTGCGCGGC CGGCACCGCC CGATCTGTTC CTGGACGGTG
CTCTGCCCGG TTTGTGGGGT CGACTGCACC CCGGGTAGCT ACCCGCACCG GCCCGGCGAC
GGCGAGTGCC CGCACCGCGA CTCGTTCGTG GCCGACGTCC CGGTGATCGA GGTCCTCGAA
GCCACCCGCG ATCTGCTCGG CGGGTAG
 
Protein sequence
MVAPHLLGPV AERVSAVERI AVLRANALGD FIFVLPTLEA LRAAYPAAEI VLLGAPWHAK 
LWRDRPGPVD RVLVVPPAPG IRRPEPDEPE SELADFLARA RRERFDLALQ VHGGGANSNP
VVAGLGARVT AGLRAEDAPP LDRWLRYVYY QHEVIRYLEV AALVGAPATT VTPALAVTDA
DRAEAAEVLG PADRPRVALH PGATDTRRRW PVERFAAVAR ELHGDGYEVL VTGTPVEQNE
VDRLVAAAGV PLRPQVGTLS LGGLAGCYAG CAVVVANDTG PLHLAAAVGT PTVGVYWVGN
FITTASPLRG RHRPICSWTV LCPVCGVDCT PGSYPHRPGD GECPHRDSFV ADVPVIEVLE
ATRDLLGG