Gene SAG2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2058 
Symbol 
ID1014869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2037559 
End bp2038806 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content39% 
IMG OID637317224 
Productmajor facilitator family protein 
Protein accessionNP_689044 
Protein GI22538193 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0107331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAT TAAATATCAA TAAGAATAAT TGGCGCGCCC TAATCGCTGC CATTGTCGCA 
TCCGGAACCG ATGATTTAAA TATCATGTTC CTTGCATTTT CTATGTCAAC TATCATTACT
GATCTTCATT TATCTGCGGC ACAGGCTGGT TGGATTGGGA CAATTACCAA CTTGGGAATG
CTCGTGGGAG GACTAATCTT TGGTTTATTA GCTGATCGCT ACAATAAATT TAAAGTTTTC
AAATGGACGA TTCTAATCTT TTCAATCGCA ACTGGTTTAG TTTTCTTCAC TACTAATTTA
TCATATCTCT ATATCATGCG CTTCATTGCA GGAATTGGCG TTGGTGGCGA ATATGGTATC
GCTATTGCCA TTATGGCAGG AATTGTCCCA ACTAATAAAA TGGGCCGGAT TTCTTCACTA
AATGGTATCG CTGGTCAAGT TGGATCTATT AGCTCAGCAC TCTTAGCAGG ATGGCTAGCT
CCAGCACTCG GTTGGCGGGG ACTTTTTCTC TTTGGCCTTC TCCCTATAGT TCTCGTCCTA
TGGATGCAAT TTGCTGTTGA TGATAAAGAT ATCTTAGATC AATATAATAC AGACGCAGAT
GATGAACCTC TAGATATCAG TATAAAAGCT TTATTTGATA CTCCTGTATT AGCCACACAG
AGCCTAGCAC TAATGGTTAT GACAACTGTT CAGATTGCCG GATACTTTGG CATGATGAAC
TGGTTACCAA CTATTATCCA AACCAACTTA AACGTTTCTG TCAAGAATTC ATCATTATGG
ATGATTGCAA CCATTCTTGG AATGTGCCTT GGCATGCTAG TATTTGGTCA ATTGTTAGAT
AAATTTGGAC CGCGTTTAGT ATACGGGTGT TTCCTTCTAT CATCTGCAAT TTGTGTCTAT
CTCTTTCAAT TCGCAACAAC AATGCCTTCT ATGATTATAG GCGGGGCAGT TGTCGGATTC
TTTGTTAATG GTATGTTTGC AGGCTATGGT GCCATGATTA CACGTCTCTA TCCACATCAT
ATTCGTTCAA CAGCTAACAA TCTTATCTTA AATGTTGGTC GTGCAATAGG TGGCTTTTCA
TCTGTTATCA TCGGAATGAT TCTAGACGTT TCAAATGTCT CTATGGTCAT GCTTTTCTTA
GCAAGTCTCT ATATCGTTAG TTTTTTATCA ATGCTAAGCA TTAAGCAATT AAAACGTCAA
AAATATCACA CTAATTTAAC ACAATTAGAT GTCAAACCAA CTGACTGA
 
Protein sequence
MSPLNINKNN WRALIAAIVA SGTDDLNIMF LAFSMSTIIT DLHLSAAQAG WIGTITNLGM 
LVGGLIFGLL ADRYNKFKVF KWTILIFSIA TGLVFFTTNL SYLYIMRFIA GIGVGGEYGI
AIAIMAGIVP TNKMGRISSL NGIAGQVGSI SSALLAGWLA PALGWRGLFL FGLLPIVLVL
WMQFAVDDKD ILDQYNTDAD DEPLDISIKA LFDTPVLATQ SLALMVMTTV QIAGYFGMMN
WLPTIIQTNL NVSVKNSSLW MIATILGMCL GMLVFGQLLD KFGPRLVYGC FLLSSAICVY
LFQFATTMPS MIIGGAVVGF FVNGMFAGYG AMITRLYPHH IRSTANNLIL NVGRAIGGFS
SVIIGMILDV SNVSMVMLFL ASLYIVSFLS MLSIKQLKRQ KYHTNLTQLD VKPTD