Gene Sare_0667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0667 
Symbol 
ID5705003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp746728 
End bp747687 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content72% 
IMG OID641270187 
Producthypothetical protein 
Protein accessionYP_001535580 
Protein GI159036327 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.522504 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCACCC TGCCACGCCG GCTGCCGCCG GAGCCACTGA CAGTCGGTGC GCTCGCCCTC 
GCCGTGCTCG CCGTTTCGTC CTCCGCGCCG CTGGTCGCCT TCGCCGCCGC ACCAGCCCTG
GCGATCGCCT TCTGGCGCAA TCTGCTCTCG GCCGCGGTGC TCGGCCCGGC CGCCCTGGTA
CGGCGGCGGG CGGAGTTGCG GGCGTTGGTG GCACCGGCCG GACGTCGTGC CGGCTGGTAC
TGCGTCTTTT CCGGTGTCGC GCTTGCCGCG CACTTCGCCA CCTGGATGCC GAGCGCCAAG
CTCACCACTG TGGCGGCGGC CACCGCGCTG GTTGCCACCC AACCCGTTTG GCAGGGTTTG
ATCGCACAGG CCCAGGGCCG GCGGCTGCCG AGGGCGGTGT GGGTCGGTGT CGGCGTGGCC
GTCGTCGGCG CGGTACTGGC AACCGGGGCA GACTTCGCGG CGTCCAGGCC GGCGTTCCTT
GGTGACCTGC TCGCGATGAT CGGTGGCATG TTCGCCGCGG TCTACACGGC CCTGGGCGAG
CGGGCTCGCC GCACGGTCAG CACCACCACC TACACCACCG TTTGTTACGG GGTGTGCGCC
TTGCTTCTGC TCGCGGTCTG CTGGGTCGGT GGGGTGCCGC TGTCCGGCTT CGACACCCGC
ACCTGGCTCG CGGTGTTCGC CTTGGTGGCC GGTGCCCAAC TGCTGGGCCA TTCGATGTTC
ACCTACGCGC TGCGACGGGT ATCGGCCACC ACCGTCAGCA TGCTGATTCT GTTGGAGGCG
CCGGGCGCGG CCCTGATCGG CTGGGTCTGG CTGGGTCAGC TGCCGCAACC GTTGGCTCTG
CCGGGCCTGG CGCTGCTCCT GGTGGGTGTC GCGGTGGTGG TCCACACCAG CGCGCGGAAC
CGCCGCATCC GGTCCGTCCC GCCCACCGTG CCCGCCGACG CAGACCCGTC GCAGCGTTAG
 
Protein sequence
MPTLPRRLPP EPLTVGALAL AVLAVSSSAP LVAFAAAPAL AIAFWRNLLS AAVLGPAALV 
RRRAELRALV APAGRRAGWY CVFSGVALAA HFATWMPSAK LTTVAAATAL VATQPVWQGL
IAQAQGRRLP RAVWVGVGVA VVGAVLATGA DFAASRPAFL GDLLAMIGGM FAAVYTALGE
RARRTVSTTT YTTVCYGVCA LLLLAVCWVG GVPLSGFDTR TWLAVFALVA GAQLLGHSMF
TYALRRVSAT TVSMLILLEA PGAALIGWVW LGQLPQPLAL PGLALLLVGV AVVVHTSARN
RRIRSVPPTV PADADPSQR