Gene SAG1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1049 
Symbol 
ID1013853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1058869 
End bp1060410 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content36% 
IMG OID637316232 
ProductABC transporter, ATP-binding protein 
Protein accessionNP_688059 
Protein GI22537208 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000311302 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT TAGAAGTAAA AAACTTAAGT CACGGTTTTG GTGATCGTGC TATTTTTGAG 
AATGTTTCTT TTCGCCTTTT AAAAGGTGAA CATATAGGAC TTATTGGTGC TAATGGTGAA
GGAAAATCAA CATTCATGTC AATTGTCACA GGTCATCTTC AACCTGATGA AGGAAAAATT
GAATGGTCAA AATATGTAAC TGCTGGTTAC CTTGATCAAC ATACTGTATT AGAAGAAGGT
CAAACAGTTC GTGATGTTTT GCGAACAGCA TTTGATGAAC TTTTTAAAAC AGAGGCACGT
ATCAATGATA TATATATATC CATGGCAGAT GATGGCGCAG ATGTAGATAG CCTTATGGAA
GAAGTTGGAG AACTACAAGA TCGTTTAGAA TCACGTGATT TTTATACACT TGATGCTAAA
ATTGATGAAG TTGCACGGGC ATTAGGAGTT ATGGATTTTG GTATGGATTC TGATGTTACT
GAATTATCTG GTGGTCAAAG GACAAAGGTT CTTTTAGCAA AACTGTTATT AGAAAAGCCA
GACATCCTCC TGCTTGATGA GCCTACAAAC TACCTTGATG CTGAACACAT TGATTGGTTA
AAACGCTATT TACAAAATTA TGAAAATGCT TTTGTTTTAA TCTCACACGA TATTCCATTT
TTGAACGATG TTATCAATAT TGTCTATCAT GTTGAAAATC AAGATTTAGT AAGATATTCC
GGTGATTATA CTAATTTTGA ATCAGTCTAT GCAATGAAAA AGGCTCAATT AGAAGCAGCC
TATGAACGTC AACAAAAGGA AATAGCTGAT TTACAAGATT TTGTTAATCG TAATAAAGCA
CGCGTCGCAA CACGTAATAT GGCAATGTCT CGTCAAAAGA AACTAGACAA AATGGATATT
ATTGAGTTAC AAGCAGAGAA ACCAAAGCCA AGTTTCGAAT TTAAAGAATC TCGCACACCA
GGACGTTTTA TTTTCCAAGC TAAAGATCTT CAAATTGGAT ATGATCGTGC TTTAACAAAA
CCTCTAAACC TAACATTTGA ACGTAATCAA AAAATAGCTA TTGTTGGAGC AAATGGGATA
GGAAAAACAA CATTACTTAA GAGTTTACTA GGTATTATCC CCCCTATTTC TGGTAATGTT
GAACGAGGTG ATTTTATTGA TTTAGGTTAT TTTGAACAAG AAGTACCCGG AGGAAATCGC
CAAACTCCAC TCGAAGCAGT ATGGGATGCT TTTCCTGCTT TGAACCAAGC AGAGGTTAGG
GCAGCTCTTG CTCGTTGTGG TTTAACTTCA AAGCATATTG AAAGTCAGAT TCAGGTGTTA
TCTGGTGGGG AGCAATCAAA GGTCAGATTT TGCTTATTAA TGAATCGCGA AAATAATGTC
CTTGTACTTG ACGAACCAAC CAATCATTTA GATGTAGATG CTAAAGATGA GTTAAAACGT
GCTTTAAAAG CTTATAAGGG ATCAATCTTG ATGGTATGTC ATGAACCGGA CTTTTATGAA
GGATGGATGG ATGATGTTTG GGACTTTAAC CAACTTAGTT AA
 
Protein sequence
MSILEVKNLS HGFGDRAIFE NVSFRLLKGE HIGLIGANGE GKSTFMSIVT GHLQPDEGKI 
EWSKYVTAGY LDQHTVLEEG QTVRDVLRTA FDELFKTEAR INDIYISMAD DGADVDSLME
EVGELQDRLE SRDFYTLDAK IDEVARALGV MDFGMDSDVT ELSGGQRTKV LLAKLLLEKP
DILLLDEPTN YLDAEHIDWL KRYLQNYENA FVLISHDIPF LNDVINIVYH VENQDLVRYS
GDYTNFESVY AMKKAQLEAA YERQQKEIAD LQDFVNRNKA RVATRNMAMS RQKKLDKMDI
IELQAEKPKP SFEFKESRTP GRFIFQAKDL QIGYDRALTK PLNLTFERNQ KIAIVGANGI
GKTTLLKSLL GIIPPISGNV ERGDFIDLGY FEQEVPGGNR QTPLEAVWDA FPALNQAEVR
AALARCGLTS KHIESQIQVL SGGEQSKVRF CLLMNRENNV LVLDEPTNHL DVDAKDELKR
ALKAYKGSIL MVCHEPDFYE GWMDDVWDFN QLS