Gene Sare_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0398 
Symbol 
ID5705657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp458617 
End bp459963 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content72% 
IMG OID641269923 
Productglycosyl transferase group 1 
Protein accessionYP_001535318 
Protein GI159036065 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.348364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00367672 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGGAAC AGCACACCGG TGTCGGTCGT CAGCGAGGTG CCCGTCCGTG GCCCCGACCC 
CGCCGCGTCG CGACACTCTC CGTGCACACG TCGCCGCTGC ACCAGCCCGG CACCGGTGAT
GCCGGGGGGA TGAACGTCTA CATCCTCGAA GTCGCACGGC GGCTGGCCGA AGCGGACGTG
GAGGTCGAGA TCTTCACCCG GGCGACCTCA GCCGATGTGC CACCGGTGGT CGAGATGATG
CCGGGCGTGC ACGTCCGGAA CATCATCTCC GGCCCGCTGG GTGGGTTGAC CAAGGAGGAA
CTGCCCGGCC AGCTCTGCGC GTTCACGGCG GGGGTGCTGC GGGCCGAGGC GTCCCGAGCC
GCCGGGCACT ACGACCTCAT CCACTCGCAC TACTGGCTGT CCGGGCAGGT TGGCTGGCTG
GCCAAGGAGC GTTGGGGAGT TCCGCTGGTG CACACCGCGC ACACCCTCGC CAAGGTCAAG
AACGCACAGC TCGCCGCCGG CGACCGGCCG GAGCCCAAGG CTCGGGTGAT CGGCGAGGAG
CAGGTGGTGG CCGAGGCCGA CCGGCTGGTC GCCAACACCA AGACCGAGGC CGGCGACCTG
ATCGACCGGT ACGACGCCGA CCCGACCCGC GTCGAGGTGG TCGAGCCGGG TGTGGACCTG
GCCCGGTTCA CCCCCGCGGC TGGCGACCGA TCCCGGGCGC AGGCCCTCGC CCGCCGTCGG
TTGGGCCTGC CCGAGCGTGG GTACGTGGTG GCGTTCGTCG GCCGGGTCCA GCCGCTCAAG
GCACCGGATG TGCTGATCCG CGCGGCGGCG GCACTGCGTC AGCGGGATCC GGCGCTCGCC
GAGGAACTGA CCGTGGTCGT CTGCGGTGGC CCCAGCGGTA GCGGGCTCGA CCGGCCGACC
CACCTGATCG AGCTGGCCGC CTCGTTGGGT GTCACCGACA GCGTACGGTT TCTGCCGCCG
CAGACCGGCG ACGACCTGCC CGCCCTGTAC CGCGCGGCCG ACCTGGTAGC GGTCCCGTCC
TACAACGAGA GCTTCGGGCT GGTCGCGTTG GAGGCGCAGG CATGCGGCAC GCCGGTGGTG
GCGGCCGCTG TCGGCGGCTT GGTCACCGCG GTACGTGACC AGGTCAGCGG TGTCCTCGTC
GACGGCCATG ACCCGGCCGT GTGGGCCCGT ACGCTGAGCC GTCTGCTGCC GGACACCGGA
CTGCGTGCGA CGTTGGCCCA GGGCGCGCGA CGCCATGCCT GCAACTTCTC CTGGGACCGG
ACGGTCAGCG GCCTGTTGGA GGTCTACGGC GAGGCGGTCG CCGCGTACGG ACCCCAATCG
TCCGAGCTCG CCACCTGTTC TTGTTGA
 
Protein sequence
MAEQHTGVGR QRGARPWPRP RRVATLSVHT SPLHQPGTGD AGGMNVYILE VARRLAEADV 
EVEIFTRATS ADVPPVVEMM PGVHVRNIIS GPLGGLTKEE LPGQLCAFTA GVLRAEASRA
AGHYDLIHSH YWLSGQVGWL AKERWGVPLV HTAHTLAKVK NAQLAAGDRP EPKARVIGEE
QVVAEADRLV ANTKTEAGDL IDRYDADPTR VEVVEPGVDL ARFTPAAGDR SRAQALARRR
LGLPERGYVV AFVGRVQPLK APDVLIRAAA ALRQRDPALA EELTVVVCGG PSGSGLDRPT
HLIELAASLG VTDSVRFLPP QTGDDLPALY RAADLVAVPS YNESFGLVAL EAQACGTPVV
AAAVGGLVTA VRDQVSGVLV DGHDPAVWAR TLSRLLPDTG LRATLAQGAR RHACNFSWDR
TVSGLLEVYG EAVAAYGPQS SELATCSC