Gene SAG1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1088 
Symbol 
ID1013892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1095548 
End bp1096888 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content36% 
IMG OID637316270 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionNP_688097 
Protein GI22537246 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTAAAA AGGGATGGCG TATTATGCCT GCCATAATTA CAACGGCTAT ATTAGGTTTT 
TCAGGGATTC TTATTGAAAC ATCTATGAAT GTCACTTTTC CGCTTTTGAT GAAAGAATTT
GGGGTTAACC CCGCTGTTAT TCAATGGGTG ACAACAGGAA ATTTGTTAGC TGTTGCTGTG
ACTGTGCCTT TGTCAGCATT TATGATTAAA AATTTATCAG AGAGACAAAT TTTTACATTG
GCAAATGTGC TTTTTTTATC TGGTGTTTTA ATTGATAGCT TTGCCCCTAA TTTAGCTATA
TTGTTAGTAG GGAGAGTTTT GCAAGGTGTT GGGACAGGAT TAGCCCTCCC TTTATTGTTT
CATATCATTC TTACACAAAT CCCGATGGAG CGTAGAGGAC TAATGATGGG AGTAGCTGCT
ATGGTCACAC TTTTAGCACC GGCAGTTGGA CCTACTTATG GCGGTGTTAT TTCAGGAATG
TTAGGATGGA AGATGATTTT TATGCTTCTG GCACCAATAC TTATCATATC CACCTTTATA
GGTTTGGCTT CTATTCCCAA ACGTCAAGTA AGAATTAATG ATAAACTCAA TTTTCCTGCC
TTTATCAGTT TAGGTATTGG CTTGGCAACC CTTCTTTTAG CTATTGAAAA GATGTCTATT
TTTTACTTAT TAGTAGCTAT TGTTAGTTTT GTTATTTTTT ACTATTTAAA TAAACAGCTA
GAATTTTTGA ATTTGAATGT TTTTAAAGAT AAAGATTTCT CAATCTTATT GTATGGCGTC
CTGGCTTTTC AAATGATTCC TCTAGCACTT TCGTTCTTAT TACCTAACCT CTTGCAACTT
GTTTTACATC AAACTTCAAC CAAAGCTGGT TTGTTTATGT TTCCAGGTGC AATAGCAGTA
GTTTTTTTAT CCCCTTTTGC AGGTTATCTC CTGGATAAAA TTGGTGCATT TAAGCCAATT
ATGATAGGCA TCTCCCTTTC TTTGATAGGT TTAATTGGTA CAGCTATATT CATTCCTGCG
AAGTCTGTTG TAGTACTTTT AGCCTTTGAT ATCCTTACTA AAATTGGTAT GGGGATTGGA
GCAAGTAATA TGGTTACGAC AGCTTTAACA AAACTAAAGC CAGCACAGTC AGCGGATGGT
AATAGTATCT TGAATACACT ACAACAATTT GCGGGAGCTT TTGCAACCGC AGTAGCCTCA
CAAATTTTTA CCATCGGACA AGTAGCTATT CCGAAAAATG GAGCTATAAT TGGTAGTCAA
TTTGCAGTTC TATTCGTTAT CGTTGTTGTT ATCTTAGCTA TTGTAGGATT AACTTATCTT
CGAAAAAGAA AAGCAATATA A
 
Protein sequence
MTKKGWRIMP AIITTAILGF SGILIETSMN VTFPLLMKEF GVNPAVIQWV TTGNLLAVAV 
TVPLSAFMIK NLSERQIFTL ANVLFLSGVL IDSFAPNLAI LLVGRVLQGV GTGLALPLLF
HIILTQIPME RRGLMMGVAA MVTLLAPAVG PTYGGVISGM LGWKMIFMLL APILIISTFI
GLASIPKRQV RINDKLNFPA FISLGIGLAT LLLAIEKMSI FYLLVAIVSF VIFYYLNKQL
EFLNLNVFKD KDFSILLYGV LAFQMIPLAL SFLLPNLLQL VLHQTSTKAG LFMFPGAIAV
VFLSPFAGYL LDKIGAFKPI MIGISLSLIG LIGTAIFIPA KSVVVLLAFD ILTKIGMGIG
ASNMVTTALT KLKPAQSADG NSILNTLQQF AGAFATAVAS QIFTIGQVAI PKNGAIIGSQ
FAVLFVIVVV ILAIVGLTYL RKRKAI