Gene SbBS512_E4114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4114 
SymbolsetC 
ID6270772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3839107 
End bp3840141 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content46% 
IMG OID641727943 
Productsugar efflux transporter C 
Protein accessionYP_001882374 
Protein GI187731533 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.741083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGGTT TTTTCTTCAC CGGTAGCGCT ATTATGGGAA TTCTGGTCAG TCAATTTCTG 
GCAAGGCACT CCGATAAACA AGGCGACCGT AAATTACTGA TTCTGCTATG TTGCTTATTT
GGAGTGCTGG CCTGCACGCT TTTTGCGTGG AATCGCAACT ACTTCATTCT CCTCTCTACG
GGCGTACTTC TGAGTAGTTT TGCTTCTACC GCAAACCCGC AAATGTTCGC CCTCGCCCGT
GAACACGCCG ACAGAACAGG CCGTGAGACG GTCATGTTCA GTACATTTTT ACGTGCTCAG
ATCTCGCTTG CCTGGGTTAT CGGGCCACCG CTCGCTTATG AACTGACAAT AAGGTTTAGT
TTTAAAGTGA TGTATCTCAC CGCTGCCATC GCATTTGTTG TTTGCGGGCT GATAGTCTGG
TTGTTTTTGC CATCAATACA AAGAAATATT CCTGTCGTTA CCCAACCCGT AGAAATTTTA
CCCTCCACCC ATAGGAAGCG GGATACGCGG CTACTTTTTG TGGTCTGTTC AATGATGTGG
GCGGCGAATA ATCTCTACAT GATAAATATG CCGCTATTTA TTATTGATGA ACTGCATCTA
ACCGATAAAC TGGCTGGAGA AATGATTGGT ATCGCTGCCG GTCTGGAAAT TCCGATGATG
TTAATCGCAG GCTATTACAT GAAACGTATT GGCAAGCGAC TATTAATGCT CATTGCTATC
GTGAGTGGTA TGTGTTTTTA CGCCAGCGTA CTCATGGCGA CGACTCCGGC GGTTGAGCTG
GAATTGCAAA TTCTAAATGC CATCTTCCTT GGTATTCTCT GTGGTATCGG CATGCTTTAT
TTTCAGGACT TGATGCCTGA AAAAATAGGC TCTGCGACAA CGTTATATGC AAATACTTCA
CGCGTCGGCT GGATTATCGC CGGCTCTGTT GACGGAATTA TGGTTGAAAT CTGGAGCTAC
CATGCGTTGT TCTGGCTGGC GATAGGGATG TTGGGTATTG CGATGATTTG CCTGCTGTTT
ATTAAAGATA TTTAG
 
Protein sequence
MVGFFFTGSA IMGILVSQFL ARHSDKQGDR KLLILLCCLF GVLACTLFAW NRNYFILLST 
GVLLSSFAST ANPQMFALAR EHADRTGRET VMFSTFLRAQ ISLAWVIGPP LAYELTIRFS
FKVMYLTAAI AFVVCGLIVW LFLPSIQRNI PVVTQPVEIL PSTHRKRDTR LLFVVCSMMW
AANNLYMINM PLFIIDELHL TDKLAGEMIG IAAGLEIPMM LIAGYYMKRI GKRLLMLIAI
VSGMCFYASV LMATTPAVEL ELQILNAIFL GILCGIGMLY FQDLMPEKIG SATTLYANTS
RVGWIIAGSV DGIMVEIWSY HALFWLAIGM LGIAMICLLF IKDI