Gene Sare_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1665 
Symbol 
ID5703435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1917041 
End bp1918051 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content74% 
IMG OID641271169 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_001536544 
Protein GI159037291 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGG AGAGCGTGGC CGCCGACGTG GTCGCGGAGC TGGCCGACCT GGCGGTGTGG 
TTCCCGACGC CGGCGGGGGT GGTACGGGCC GTGGACGGGG TGTCCCTCCG GGTGCGCCGT
GGCGAGACGC TGGGCCTGGT CGGCGAGTCC GGCAGCGGCA AGTCGACGAC GGGGCTGGCG
CTGCTGCGCC TGGTCGAGCC CACCGCCGGC GAGGTCCGGG TGGCGGGGCA GGACGTGACC
CGCTGGTCAC GGCGGCGGTT GCGGCGGTTG CGCCGTCGCG TCGCCATGGT GTTCCAGGAT
CCGCAGGCTT CGCTCGATCC GCGGCACACG GTCGGGGCGA GCATCGCCGA GCCGCTGGCC
GTGCACCGGC TCACCGCCGG TGGCTCGGCC CGCCGCGAGC GGGTGGCCGA GCTGCTCGAC
CTGGTCGGCC TGCGCCGCGA TCTCGCCGAC CGGCACCCGC ACGAGCTCTC CGGCGGCCAG
CGGCAGCGGG TGGGTATCGC GCGGGCCCTG GCCGGCGAGC CGGACCTGAT CGTTCTCGAC
GAACCGATCG CCTCCCTGGA CCTGAGTGTG CAGGCACAGA TCATGAACCT GCTCCGGGGA
CTCCAGCGGG AGTTGGGGCT GACCTATCTC TTCATCTCCC ACGACCTCGC CGCTGTCGAG
CACATGAGCG ACCGGGTGGC CGTGATGTAC CTCGGCCGGA TCGTGGAGAG CGGTACGCCG
GCACAGATCT GGCGAGAGCC CGCCCATCCG TACACCGCCG CGCTCCTGTC GGCCGTGCCG
GTGGCAGATC CGCCGGTGCA GCGCGGTCGG CAGCGGATCA TCCTCGCCGG TGACGTCCCG
AGCCCGATCG ACCCGCCCAC CGGCTGCCGC TTCCGGACGC GGTGTCCGCA GGCGCGGCCC
GACTGCGCCC GGACCGATCC GGTGCTGGTC GAGATCGGCT CGGGACACCA AGCGGCCTGC
CTGTTCGCGG GCGAGGCGGT GCGGGCGATG CGGGCGGACA CGGCCCGGTA G
 
Protein sequence
MTGESVAADV VAELADLAVW FPTPAGVVRA VDGVSLRVRR GETLGLVGES GSGKSTTGLA 
LLRLVEPTAG EVRVAGQDVT RWSRRRLRRL RRRVAMVFQD PQASLDPRHT VGASIAEPLA
VHRLTAGGSA RRERVAELLD LVGLRRDLAD RHPHELSGGQ RQRVGIARAL AGEPDLIVLD
EPIASLDLSV QAQIMNLLRG LQRELGLTYL FISHDLAAVE HMSDRVAVMY LGRIVESGTP
AQIWREPAHP YTAALLSAVP VADPPVQRGR QRIILAGDVP SPIDPPTGCR FRTRCPQARP
DCARTDPVLV EIGSGHQAAC LFAGEAVRAM RADTAR