Gene Sbal223_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3801 
Symbol 
ID7088836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp4503410 
End bp4504645 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content53% 
IMG OID643462680 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002359701 
Protein GI217974950 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0888448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAACCG TTCAACATCA GGTTCAAAAT CCATCTGTCC AAAATCAACA CAAGGGCGCC 
AGTACACTAT GGGTGTTGCT CGCCTTAGCG CTCGGGACGT TTGCCTTAGG TACCACAGAA
TTTGCTTCCA TGACTCTAGT GCCCTATATC GCCAGCGATT TAGGCGTAGA GGTTGCACAT
GTGAGTTATG CCATTAGCGC CTATGCGCTG GGGGTTGTGG TCGGTTCGCC GATTATTATG
GTGCTAGCGG TTAGGGTCAG GCGGCGAACA CTCTTGATTG CTTTAGCTGC CTTAATGGCG
GTGGCCAATG GCTTAAGTGC GTTAGCGCCA TCATTGAATT GGTTAATCTT TTTTCGTTTT
CTCAGTGGCT TGCCCCACGG TGCTTATTTC GGCGTGGCTA TGTTGCTCGC CGCCTCTTTA
GTGCCGCCAG AAATGAAGGC CCGCGCCGTA TCGCGGGTGA TTATTGGCCT TACGCTGGCG
ACGATTATCG GTGTGCCGTT TGCCACTTGG ATGGGGCAAA CTGTGGGCTG GCGCTCAGGC
ATTGGCATAG TGGCGATTTT GGCGACTATT ACCGCTGTGA TGGTGTATTT TTTAGCGCCT
GATCAGGCCG TGGCCGCTGA TGCGAGTCCC AGAAAAGAGC TACAAACCCT GAAGAATCGT
GAAGTCTGGT TGACGCTTGG CATCGCTGCG ATTGGCTTTG GCGGTATCTT TTGCGTGTAT
ACCTATCTGG CTGAAACCTT AATCCAAGTG ACGCAAGTCG AGCCGTTTAA GATCCCGATC
ATGATGGCGG TATTTGGTAT TGGCGCAACA TTGGGCACGC TAGTGTGTGG CTGGGCGGCG
GATAAGTCGG CCTTAGCGGC GGCGTTTTGG TCGTTAGTGT TAAGCACTGT GGTATTAGCG
ATTTACCCGA GTTTGACCGG ACATTATTGG GCGCTGATGC CCGTAGTATT CTTTGTCGGT
TGTGGCTTGG GACTTGCCAC CATAGTGCAA GCAAGATTGA TGGATGTGGC GCCCGATGGG
CAGGCCATGA CAGGTGCGTT AGTGCAATGT GCCTTTAATC TCGCCAATGC TATTGGTCCT
TGGGTGGGCA GTTTAGTGAT CCTGTCTGGA CAAGGGATTG CCGCGACAGG TTATGCGGCG
TCTTTGTTGT CATTAGGAGG ACTTGTGATG TGGTGGCTGA CCCACAGGGA GAGTCGCCGC
GCGGTGAGCT TAAACACAGC CAATTGCGCT GACTAG
 
Protein sequence
MKTVQHQVQN PSVQNQHKGA STLWVLLALA LGTFALGTTE FASMTLVPYI ASDLGVEVAH 
VSYAISAYAL GVVVGSPIIM VLAVRVRRRT LLIALAALMA VANGLSALAP SLNWLIFFRF
LSGLPHGAYF GVAMLLAASL VPPEMKARAV SRVIIGLTLA TIIGVPFATW MGQTVGWRSG
IGIVAILATI TAVMVYFLAP DQAVAADASP RKELQTLKNR EVWLTLGIAA IGFGGIFCVY
TYLAETLIQV TQVEPFKIPI MMAVFGIGAT LGTLVCGWAA DKSALAAAFW SLVLSTVVLA
IYPSLTGHYW ALMPVVFFVG CGLGLATIVQ ARLMDVAPDG QAMTGALVQC AFNLANAIGP
WVGSLVILSG QGIAATGYAA SLLSLGGLVM WWLTHRESRR AVSLNTANCA D