Gene Sare_4238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4238 
Symbol 
ID5708088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4811288 
End bp4812550 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content68% 
IMG OID641273657 
Productmonosaccharide-transporting ATPase 
Protein accessionYP_001539010 
Protein GI159039757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0284678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0915708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA CCGCCCTCCC CGACCAGGAT TCCGCGGCCA CCCCCGCCGC CGGTCCCACC 
CTCGCCAACC ACTTTCGCGG CTACGTCAGC CGGGTACGCG GCGGAGACAT CGGCGCCCTA
CCCGCCGTCC TCGGCCTGAT CGTGCTCTGT ACCGTCTTCT CGATCATGCG GCCGTCGTTC
CTCACCGCGG CCAACTTCGC CAACCTGTTC ACACAGGGGG CGGCGGTCAC GCTGATCGCC
ATGGGGCTGG TCTTCGTCCT GCTGCTCGGC GAGATCGACC TCTCCGCCGG CTTCGCCAGC
GGGGTGTGCG CCGCCGTACT GGCCAACGTG GTCACCGTCC TCGGCTACCC GTGGTACGTC
GCGGTACTCG CCGCCCTCCT CACCGGAGTG GTGATCGGCA GTACGCTTGG CATCCTGGTC
GCGAAGATCG GCATCCCGTC CTTCGTGGTC ACCCTCGCCG GTTTCCTCGC CTTCCAGGGC
CTCGTGCTAC TGCTGATGGA AGACGGCAGT AACATCTCGG TCCGGGATCC GGTGCTGGTG
GCCATCGCGA ACCGAAACCT CCCACCAGCG GTCGGCTGGA TCCTGGCCGG GCTCGCCGTC
GCCGGCTTCG CCACGGTCCA GGCGATGCGG CAGCGCACGC GCGCGCTCCG CGGTCTGGTC
ACCGACCCGC TCGCCGTGGT GCTCGCCCGG GTCGGCGGGC TGGCTGCCGT CCTGGGCACG
ACCGTCTACA TCCTCAACCA GGAACGCAGC TTCAACACTT TGATCAACTC GCTCAAGGGT
GTGCCGATCG TGGTGCCGAT CATCGCGGTG CTGTTGATCG CCTGGACCTT CGTCCTGCGG
CAGACCAGCT ACGGACGGCA CATCTATGCG GTCGGCGGCA ACAGAGAAGC GGCCCGCCGG
GCCGGCATCA ACGTCGACCG GATCCGCATC TCCGTCTTCG TGATCTGTTC CTCGATGGCC
GCGATCGGCG GCATCGTCGC AGCCAGCCGG GCCAACTCGG TCGACCCGAA CACCGGTGGC
AGTAACGTAC TGCTCTACGC CGTCGGTGCG GCGGTGATCG GCGGCACCAG CCTCTTCGGC
GGCAAGGGCC GGGTCCTCGA CGCGGTACTC GGCGGCGCAG TCGTCGCGGT GATCGACAAC
GGGATGGGTC TGATGGGCTA CAGCTCAGGG GTCAAGTACG TGGTCACCGG CGTGGTACTT
CTCCTCGCCG CCAGTGTGGA CGCGCTGTCC CGACGCCGAG CCGCCGCCAG CGGCGGCCGA
TGA
 
Protein sequence
MTSTALPDQD SAATPAAGPT LANHFRGYVS RVRGGDIGAL PAVLGLIVLC TVFSIMRPSF 
LTAANFANLF TQGAAVTLIA MGLVFVLLLG EIDLSAGFAS GVCAAVLANV VTVLGYPWYV
AVLAALLTGV VIGSTLGILV AKIGIPSFVV TLAGFLAFQG LVLLLMEDGS NISVRDPVLV
AIANRNLPPA VGWILAGLAV AGFATVQAMR QRTRALRGLV TDPLAVVLAR VGGLAAVLGT
TVYILNQERS FNTLINSLKG VPIVVPIIAV LLIAWTFVLR QTSYGRHIYA VGGNREAARR
AGINVDRIRI SVFVICSSMA AIGGIVAASR ANSVDPNTGG SNVLLYAVGA AVIGGTSLFG
GKGRVLDAVL GGAVVAVIDN GMGLMGYSSG VKYVVTGVVL LLAASVDALS RRRAAASGGR