Gene SbBS512_E4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4768 
Symbol 
ID6271909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4450010 
End bp4451290 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content52% 
IMG OID641728522 
Productputatitve membrane protein 
Protein accessionYP_001882917 
Protein GI187730651 
COG category[S] Function unknown 
COG ID[COG2733] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC TCATTGAACT CAGACGCGCC AAAATGTTGG CGCTCTCTTT ACTGCTTATC 
GCCGCTGCTA CCTTTGTCGT TACGCTGTTT TTGCCGCCCA ATTTTTGGGT GAGCGGCGTG
AAGGCGATTG CTGAAGCGGC GATGGTCGGC GCGCTGGCGG ACTGGTTTGC GGTGGTGGCG
CTGTTTCGCC GCGTGCCGAT TCCGATCATT TCTCGTCATA CGGCGATTAT CCCGCGTAAT
AAAGACCGGA TTGGCGAAAA TCTCGGTCAG TTCGTGCAGG AAAAATTTCT CGATACCCAG
TCGCTGGTGG CATTGATTCG ACGTCACGAA CCGGCGTTGT TGATTGGCAA CTGGTTTAGT
CAGCCGGAGA ACGCCCGCCG CGTTGGTCAG CATCTGTTGC AGATCATGAG CGGTTTTCTT
GAACTGACCG ATGATGCGCG TATTCAGCGC CTGCTTAAGC GCGCGGTCCA TCGGGCGATT
GATAAGGTCG ATCTTTCCGG AACCAGTGCG TTGATGCTGG AGAGTATGAC CAAAAACGAT
CGTCATCAGG TGCTGCTCGA TACGCTGATC GCACAGTTGA TCGCCCTTCT CCAGCGCGAT
AAATCGCGCA AGTTTATTGC CCAGCAAATT GTTCGCTGGC TGGAGAGCGA GCATCCGCTG
AAAGCCAAAA TTTTGCCTAC TGAATGGCTG GGCGAACATA GCGCGGAGTT GGTTTCTGAC
GCGGTGAATT CTTTGCTTGA TGATATCAGC CGCGATCGTG CGCATCAGAT CCGTCATGCG
TTTGATCGCG CCACTTTTGC CCTGATCGAC AAGCTGAAAA ACGATCCGGA AATGGCAGCG
CGAGCCGATG CCGTAAAAAG TTATCTGAAA GAAGATGAAG CTTTTAACCG CTATCTCAGT
GAATTGTGGG GGGATTTACG GGAATGGCTG AAAGCGGATA TCAACAGTGA AGATTCTCGT
GTGAAAGAAC GTATCGCGCG GGCGGGTCAA TGGTTTGGCG AAACGTTAAT TGCCGATGAT
GCCTTGCGGG CATCGTTAAA TGGTCACCTG GAACAAGCCG CGCACCGCGT CGCGCCTGAG
TTTTCCGCAT TCCTGACGCG CCACATCAGC GATACAGTAA AAAGCTGGGA TGCTCGGGAT
ATGTCGCGGC AAATCGAGTT AAATATCGGC AAAGATCTGC AGTTTATCCG TGTCAACGGT
ACGCTGGTTG GCGGTTGTAT TGGGCTAATT TTATATTTGT TGTCGCAGCT CCCGGCCTTG
TTCCCCCTCA GCAATTTTTA G
 
Protein sequence
MNKLIELRRA KMLALSLLLI AAATFVVTLF LPPNFWVSGV KAIAEAAMVG ALADWFAVVA 
LFRRVPIPII SRHTAIIPRN KDRIGENLGQ FVQEKFLDTQ SLVALIRRHE PALLIGNWFS
QPENARRVGQ HLLQIMSGFL ELTDDARIQR LLKRAVHRAI DKVDLSGTSA LMLESMTKND
RHQVLLDTLI AQLIALLQRD KSRKFIAQQI VRWLESEHPL KAKILPTEWL GEHSAELVSD
AVNSLLDDIS RDRAHQIRHA FDRATFALID KLKNDPEMAA RADAVKSYLK EDEAFNRYLS
ELWGDLREWL KADINSEDSR VKERIARAGQ WFGETLIADD ALRASLNGHL EQAAHRVAPE
FSAFLTRHIS DTVKSWDARD MSRQIELNIG KDLQFIRVNG TLVGGCIGLI LYLLSQLPAL
FPLSNF