Gene SAG1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1707 
Symbol 
ID1014516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1700825 
End bp1702324 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content38% 
IMG OID637316875 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionNP_688697 
Protein GI22537846 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00395265 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAA AAGCAAAACA TGATATTCAT GGCAAACCAT ATAATCGTAC AGCCATGATC 
ACCTTATTGT TGATTGCTAC ATTTGCAGGG GTATTAAATC AGACAAGTTT GGGAACGGCT
ATCCCAACTT TGATGAATAG CTTTAACATT TCGTTGTCAA CGGCTCAACA AGCAACAACA
TGGTTTTTAT TAGCGAATGG GATTATGATT CCTGTTTCAG CTTATTTAGC GACACGTTTT
TCAACAAAAT GGTTGTATGT GACATCCTAC GTTGTTTTAC TTATTGGACT GTTAATGACG
ACGTTAGCAC CTACATCTAA CTGGAACCTT TTTTTAGTAG GACGTATTAT TCAAGCGATT
TCTGTAGGGA TTTCTATGCC TTTGATGCAG GTTGTTATGG TGAATGTTTT TCCTCCAGAG
CAGCGTGGTG CCGCTATGGG ACTTAATGGT TTGGTGGTTG GTCTTGCCCC AGCTATTGGG
CCTACATTAG CAGGGTGGAT TTTAAAACAA GAATTTCATT TTGCAGGGCA TGATTTAACA
TGGCGTGCAA TTTTCCTTCT TCCTTTACTA ATTTTAACGG TCACTACAAT TTTATCCCCC
TTTGTTCTAA AAGATGTGGT TGATAATAAG TCAGTTAAAT TGGAAGTGCC TTCCCTTATC
CTTTCAATAA TTGGCTTTGG TAGCTTTTTG TGGGGCTTCA CAAATGTGGC AACTTATGGA
TGGGGAGATA TTGGATATGT TATTTCTCCT ATTATGGTTG GTATTATTTT TATCGCCTTA
TTCATTCATC GTCAATTAAA ACTAGAAACA CCGTTCTTAG ATATCCGTGT TTTCAAAAAT
AAACAATTCT CAGTAACAAC AGCAGCTATT GCACTTTCAA TGATGGCGAT GATGGGTGTT
GAAATGATGC TACCGCTTTA CTTGCAAAAT GTTCATGGTC TCTCTGCACT TGATTCTGGT
TTAGCTTTAT TACCAGGTGC CTTGATGATG GGGATAGTTA GTCCAATTTC TGGAGCTGTA
TATGATAAAG TTGGTGCAAG ACGTATGGCT ATGATTGGTT TTACCATACT AGGTGTAGCG
ACGTTACCTT TTGTTTTCTT AACGACTACA ACACCAGATC ATTTTATTAC CCTCTTATAT
GCAGTACGTA TGTTTGGTAT TGCTATGGTT ATGATGCCAT TGACAGCAAG TGCGATGAGC
GCACTTCCAC CACATGAGGC CGCACATGGA ACAGCAGCTA ATAATACTGC TCGTCAAATT
GCTTCAGCAG TTGTAGTTGC ATTGCTTTCA AGTGTTGCTC AAAATATTAT CACCAACAAT
AAACCATCAA AAGATTTGCT AACAATGAAT CCTTTAAAAT ATGCAAATCA GATGTTGAAT
GCTAGTCTTG ATGGCTTCCA TGTTTCTTTT GCCATTGGTT TTGTATTTGC GGTTCTTGGT
TTACTGGTAT CTCTTTTCCT GAGAAAAGGG AAAATTATTG AGACGGAAAA GGAGGTATAG
 
Protein sequence
MTTKAKHDIH GKPYNRTAMI TLLLIATFAG VLNQTSLGTA IPTLMNSFNI SLSTAQQATT 
WFLLANGIMI PVSAYLATRF STKWLYVTSY VVLLIGLLMT TLAPTSNWNL FLVGRIIQAI
SVGISMPLMQ VVMVNVFPPE QRGAAMGLNG LVVGLAPAIG PTLAGWILKQ EFHFAGHDLT
WRAIFLLPLL ILTVTTILSP FVLKDVVDNK SVKLEVPSLI LSIIGFGSFL WGFTNVATYG
WGDIGYVISP IMVGIIFIAL FIHRQLKLET PFLDIRVFKN KQFSVTTAAI ALSMMAMMGV
EMMLPLYLQN VHGLSALDSG LALLPGALMM GIVSPISGAV YDKVGARRMA MIGFTILGVA
TLPFVFLTTT TPDHFITLLY AVRMFGIAMV MMPLTASAMS ALPPHEAAHG TAANNTARQI
ASAVVVALLS SVAQNIITNN KPSKDLLTMN PLKYANQMLN ASLDGFHVSF AIGFVFAVLG
LLVSLFLRKG KIIETEKEV