Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG2058 |
Symbol | |
ID | 1014869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | + |
Start bp | 2037559 |
End bp | 2038806 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637317224 |
Product | major facilitator family protein |
Protein accession | NP_689044 |
Protein GI | 22538193 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0107331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACCAT TAAATATCAA TAAGAATAAT TGGCGCGCCC TAATCGCTGC CATTGTCGCA TCCGGAACCG ATGATTTAAA TATCATGTTC CTTGCATTTT CTATGTCAAC TATCATTACT GATCTTCATT TATCTGCGGC ACAGGCTGGT TGGATTGGGA CAATTACCAA CTTGGGAATG CTCGTGGGAG GACTAATCTT TGGTTTATTA GCTGATCGCT ACAATAAATT TAAAGTTTTC AAATGGACGA TTCTAATCTT TTCAATCGCA ACTGGTTTAG TTTTCTTCAC TACTAATTTA TCATATCTCT ATATCATGCG CTTCATTGCA GGAATTGGCG TTGGTGGCGA ATATGGTATC GCTATTGCCA TTATGGCAGG AATTGTCCCA ACTAATAAAA TGGGCCGGAT TTCTTCACTA AATGGTATCG CTGGTCAAGT TGGATCTATT AGCTCAGCAC TCTTAGCAGG ATGGCTAGCT CCAGCACTCG GTTGGCGGGG ACTTTTTCTC TTTGGCCTTC TCCCTATAGT TCTCGTCCTA TGGATGCAAT TTGCTGTTGA TGATAAAGAT ATCTTAGATC AATATAATAC AGACGCAGAT GATGAACCTC TAGATATCAG TATAAAAGCT TTATTTGATA CTCCTGTATT AGCCACACAG AGCCTAGCAC TAATGGTTAT GACAACTGTT CAGATTGCCG GATACTTTGG CATGATGAAC TGGTTACCAA CTATTATCCA AACCAACTTA AACGTTTCTG TCAAGAATTC ATCATTATGG ATGATTGCAA CCATTCTTGG AATGTGCCTT GGCATGCTAG TATTTGGTCA ATTGTTAGAT AAATTTGGAC CGCGTTTAGT ATACGGGTGT TTCCTTCTAT CATCTGCAAT TTGTGTCTAT CTCTTTCAAT TCGCAACAAC AATGCCTTCT ATGATTATAG GCGGGGCAGT TGTCGGATTC TTTGTTAATG GTATGTTTGC AGGCTATGGT GCCATGATTA CACGTCTCTA TCCACATCAT ATTCGTTCAA CAGCTAACAA TCTTATCTTA AATGTTGGTC GTGCAATAGG TGGCTTTTCA TCTGTTATCA TCGGAATGAT TCTAGACGTT TCAAATGTCT CTATGGTCAT GCTTTTCTTA GCAAGTCTCT ATATCGTTAG TTTTTTATCA ATGCTAAGCA TTAAGCAATT AAAACGTCAA AAATATCACA CTAATTTAAC ACAATTAGAT GTCAAACCAA CTGACTGA
|
Protein sequence | MSPLNINKNN WRALIAAIVA SGTDDLNIMF LAFSMSTIIT DLHLSAAQAG WIGTITNLGM LVGGLIFGLL ADRYNKFKVF KWTILIFSIA TGLVFFTTNL SYLYIMRFIA GIGVGGEYGI AIAIMAGIVP TNKMGRISSL NGIAGQVGSI SSALLAGWLA PALGWRGLFL FGLLPIVLVL WMQFAVDDKD ILDQYNTDAD DEPLDISIKA LFDTPVLATQ SLALMVMTTV QIAGYFGMMN WLPTIIQTNL NVSVKNSSLW MIATILGMCL GMLVFGQLLD KFGPRLVYGC FLLSSAICVY LFQFATTMPS MIIGGAVVGF FVNGMFAGYG AMITRLYPHH IRSTANNLIL NVGRAIGGFS SVIIGMILDV SNVSMVMLFL ASLYIVSFLS MLSIKQLKRQ KYHTNLTQLD VKPTD
|
| |