Gene Sare_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4236 
Symbol 
ID5708086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4809274 
End bp4810371 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID641273655 
Producthypothetical protein 
Protein accessionYP_001539008 
Protein GI159039755 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0383732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGG GCTTCCTCGC CGCTGCCGCC GTCGGCCTGC TGGCCACCGG CAGCATGACG 
GCCTGTGGCG ACAACTCCAC CGACGGGGAC CAGACCGGTT CCGGCAAGAC CCCGAAGATC
GGCGTGATCC TTCCGGACAG CAAGTCCTCC GCCCGTTGGG AAGGCGCCGA CCGCAAGTTC
CTCCAAGAGG CATTCGCGGA GGCCGGGGTC GAGGCCGACA TCCAGAACGC GCAGGGTGAC
AAGACCCAGT TCCAGACGAT CGCCGACCAG ATGATCACTA AGGGTGTCAC CGCACTGATG
ATCGTCAACC TGGACTCCGG CACCGGCAGA GCCGTCCTCG ACAAGGCCAA GTCGCAGGGT
GTCGCCACCA TCGACTACGA CCGACTGACC CTCGGTGGCT CGGCGGAGTA CTACGTCAGC
TTCGACAACG AGGCCGTCGG CAAACTTCAG GGTGAAGGCC TCGTCAGGTG CCTCACGGAC
AGCGGCGTCG AGAACCCGTC GATCGCGTAC CTGAACGGCT CGCCGACCGA CAACAACGCC
ACTCTGTTCA GGAACGGCTA CGACTCGGTC CTGAAGCCGA AATTCGACGC CGGGGAGTAC
CAACAGGTCG CGGACGACTC CGTGCCGGAC TGGGACAACG CGCAGGCCGC CACCATCTTC
GAACAGCAAC TCACCAAGAC TGGCGGCAAG ATCGACGGGG TGCTCGCGGC CAACGACGGC
CTCGGCAACG CCGCGATCTC GGTGCTGAAG AAGAACAAAC TCAACGGCAA GGTCCCGGTC
ACCGGCCAGG ACGCCACCCC GCAGGGCCTA CAGAACGTTC TCGCCGGGGA CCAGTGCATG
ACCGTCTACA AGGCGATCAA GGAAGAGGCC AGCGCCGCTG CCTCCCTGGC CATCGCGCTC
GCCCAGGGAG AGCGGAAGGA GACCGGCCAG ACGGTCAAGG ACCCGGAGAG TGGCCGGGAC
GTACCCGCCG TGCTGCTCAC CCCCACGGCG GTCTACAAGG AAAACGTCAA GGACATCATC
GCCGACGGCT TCGTGACCAA GGACGAGATC TGCACCGGGG CCTACGCCCC GCTCTGCGCG
AGCGCCGGCA TCAGCTGA
 
Protein sequence
MRKGFLAAAA VGLLATGSMT ACGDNSTDGD QTGSGKTPKI GVILPDSKSS ARWEGADRKF 
LQEAFAEAGV EADIQNAQGD KTQFQTIADQ MITKGVTALM IVNLDSGTGR AVLDKAKSQG
VATIDYDRLT LGGSAEYYVS FDNEAVGKLQ GEGLVRCLTD SGVENPSIAY LNGSPTDNNA
TLFRNGYDSV LKPKFDAGEY QQVADDSVPD WDNAQAATIF EQQLTKTGGK IDGVLAANDG
LGNAAISVLK KNKLNGKVPV TGQDATPQGL QNVLAGDQCM TVYKAIKEEA SAAASLAIAL
AQGERKETGQ TVKDPESGRD VPAVLLTPTA VYKENVKDII ADGFVTKDEI CTGAYAPLCA
SAGIS