Gene SbBS512_E0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0421 
Symbol 
ID6270729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp411406 
End bp412698 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content53% 
IMG OID641724649 
Productamino acid permease family protein 
Protein accessionYP_001879198 
Protein GI187732317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACA CGGAAGGTAA TAACGGTAAC AAACCCCTCG GTCTATGGAA TGTCGTTTCC 
ATCGGTATTG GGGCAATGGT GGGGGCGGGG ATCTTCGCGC TGCTGGGGCA GGCTGCGTTG
CTAATGGAAG CCTCGACCTG GGTCGCCTTT GCTTTTGGCG GTATTGTGGC GATGTTTTCC
GGTTATGCCT ATGCGCGTCT GGGGGCGAGC TATCCCAGTA ATGGCGGCAT TATCGACTTC
TTTCGTCGCG GATTAGGCAA CGGCGTCTTT TCGCTGGCGC TCTCGTTACT GTACCTGTTG
ACGCTGGCGG TGAGCATCGC CATGGTCGCC CATGCTTTTG GCGCTTATGC CGTGCAGTTT
TTGCATGAAG GCAGCCAGGA GGAGCACCTT ATTTTGCTCT ACGCGTTGGG GAGCATTGCG
GTGATGACGC TTTTCAACTC CTTAAGCAAC CATGCGGTAG GGCGGCTGGA AGTGATCCTC
GTCGGCATTA AAATGATGAT CCTGTTATTG CTGATTATTG CCGGTGTCTG GTCGCTGCAA
CCAGCGCATA TTTCCGTCTC TGCGCCCCCC AGCTCCGGTG CGTTCTTCTC CTGTATTGGG
ATAACTTTCC TTGCCTATGC GGGCTTTGGC ATGATGGCGA ACGCGGCGGA TAAAGTGAAA
GATCCGCAGG TCATTATGCC ACGGGCGTTT CTGGTGGCGA TTGGCGTTAC CACGTTGCTT
TATATCTCGC TGGCACTGGT TTTGCTTAGC GATGTATCGG CATTAGAGTT AGAAAAATAT
GCCGATACCG CCGTAGCGCA GGCTGCTTTT CCGCTGCTCG GACATGTGGG TTATGTGATC
GTCGTCATCG GCGCTTTACT GGCGACGGCT TCAGCCATTA ACGCGAACCT GTTCGCCGTG
TTTAACATCA TGGACAACAT GGGCAGCGAA CGCGAACTGC CGAAGCTAAT GAATAAATCC
CTGTGGCGGC AGAGTACCTG GGGCAACATC ATTGTCGTGG TGTTGATTAT GCTGATGACG
GCGGCACTGA ATTTAGGCTC ACTCGCCAGC GTTGCCAGCG CCACCTTTTT GATTTGCTAC
CTGGCGGTGT TTGTGGTGGC GATCCGCCTG CGTCATGATA TTCACGCCTC GTTGCCGATT
CTTATCGTTG GTACGTTGGT GATGTTGTTG GTGATCGTTG GCTTTATCTA CAGTCTGTGG
TCCCAGGGTA GCCGTGCGTT GATATGGATT ATTGGCTCAC TCTTACTCAG CCTTATTGTG
GCAATGGTCA TGAAGCGCAA TAAAACCGTA TAA
 
Protein sequence
MMNTEGNNGN KPLGLWNVVS IGIGAMVGAG IFALLGQAAL LMEASTWVAF AFGGIVAMFS 
GYAYARLGAS YPSNGGIIDF FRRGLGNGVF SLALSLLYLL TLAVSIAMVA HAFGAYAVQF
LHEGSQEEHL ILLYALGSIA VMTLFNSLSN HAVGRLEVIL VGIKMMILLL LIIAGVWSLQ
PAHISVSAPP SSGAFFSCIG ITFLAYAGFG MMANAADKVK DPQVIMPRAF LVAIGVTTLL
YISLALVLLS DVSALELEKY ADTAVAQAAF PLLGHVGYVI VVIGALLATA SAINANLFAV
FNIMDNMGSE RELPKLMNKS LWRQSTWGNI IVVVLIMLMT AALNLGSLAS VASATFLICY
LAVFVVAIRL RHDIHASLPI LIVGTLVMLL VIVGFIYSLW SQGSRALIWI IGSLLLSLIV
AMVMKRNKTV