Gene SbBS512_E3625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3625 
Symbol 
ID6271189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3374035 
End bp3376071 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content54% 
IMG OID641727494 
Productputative lipoprotein 
Protein accessionYP_001881936 
Protein GI187732075 
COG category[R] General function prediction only 
COG ID[COG3107] Putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACCCT CAACATTTTC TCGTTTGAAA GCCGCGCGTT GTCTGCCTGT TGTTCTGGCA 
GCCCTGATTT TCGCCGGTTG TGGCACCCAT ACTCTCGATC AGTCCACTGC TTATATGCAG
GGCACGGCGC AGGCTGATTC TGCCTTTTAT CTTCAGCAGA TGCAGCAAAG CTCTGATGAT
ACCAGGATCA ACTGGCAATT ACTCGCCATT CGTGCACTGG TGAAAGAAGG TAAAACCAGG
CAGGCGGTTG AGTTGTTTAA CCAACTACCG CAAGAACTGA ACGATTCTCA GCGTCGCGAG
AAAACACTGC TGGCGGTAGA GATTAAACTG GCGCAGAAAG ATTTTGCTGG CGCGCAAAAC
TTGCTGGCGA AAATCACACC TGCCGATTTA GAACAAAACC AGCAAGCGCG TTACTGGCAG
GCAAAAATCG ATGCCAGCCA GGGGCGTCCT TCCATTGATT TACTGCGCGC GTTAATTGCT
CAGGAACCGC TGCTCGGCGC GAAAGAAAAA AAGCAGAATA TTGATGCCAC CTGGCAGGCG
CTCTCCTCCA TGACTCAGGA ACAGGCGAAT ACGCTGGTGA TCAACGCCGA CGAAAATATT
CTGCAAGGCT GGCTGGATCT GCAGCGCGTC TGGTTTGATA ACCGTAACGA TCCCGATATG
ATGAAAGCCG GGATCGCCGA CTGGCAGAAA CGTTATCCGA ACAATCCGGG CGCGAAAATG
CTGCCAACGC AGTTGGTTAA CGTAAAAGCG TTTAAACCAG CCTCGACCAA CAAAATCGCC
CTGCTGTTGC CGCTGAATGG CCAGGCAGCG GTATTTGGTC GCACTATTCA GCAAGGCTTT
GAAGCGGCGA AAAATATCGG CACTCAGCCA GTGGCGGCTC AGGTAGCTGC CGCACCTGCC
GCAGACGTAG CAGAACAACC TCAGCCGCAA ACTGCGGATG GCGTTGCCAG CCCGGCACAA
GCCTCGGTTA GCGATCTGAC CGGTGATCAG CCTGCGGCCC AGCCGGTGCC TGTAAGCGCC
CCGGCGACAA GCACCGCAGC GGTAAGCGCA CCCGCAAATC CATCCGCAGA GCTGAAAATC
TACGATACCT CATCACAACC ACTTAGCCAG ATCTTAAGCC AGGTTCAGCA GGATGGCGCG
AGTATTGTGG TTGGTCCGTT GCTGAAAAAT AACGTTGAAG AGTTGCTTAA GAGCAACACC
CCGCTGAACG TGCTGGCACT GAACCAGCCG GAGAATATCG AAAACCGCGT CAATATTTGT
TACTTCGCGC TTTCACCGGA AGACGAAGCG CGCGATGCAG CGCGTCATAT TCGTGACCAG
GGTAAACAAG CGCCGCTGGT GCTGATCCCA CGCAGTTCAT TGGGCGATCG CGTAGCCAAT
GCGTTTGCGC AGGAGTGGCA GAAACTGGGC GGCGGCACCG TTCTGCAACA AAAATTTGGT
TCCACCAGCG AATTACGCGC GGGTGTTAAC GGCGGTTCTG GTATCGCTTT AACGGGTACC
CCGATTACTC CCAGAGCGAC AACCGACTCC GGCATGACGA CCAACAATCC AACGCTGCAA
ACCACGCCAA CCGATGACCA GTTCACCAAT AATGGCGGTC GTGTCGATGC GGTGTACATT
GTGGCAACGC CGGGTGAAAT CGCTTTTATC AAACCGATGA TCGCCATGCG TAACGGTAGC
CAGAGCGGTG CAATGCTGTA CGCCAGCTCC CGCAGCGCAC AAGGCACCGC AGGCCCGGAT
TTCCGTCTGG AGATGGAAGG TTTGCAGTAC AGCGAAATCC CGATGCTGGC GGGCGGTAAT
CTGCCGTTAA TGCAGCAGGC ACTCAGCGCG GTGAATAACG ATTATTCACT GGCTCGCATG
TATGCGATGG GCGTCGATGC CTGGTCGCTG GCAAATCATT TCTCACAAAT GCGCCAGGTT
CAGGGTTTTG AAATCAACGG TAATACCGGA AGCCTGACGG CAAACCCGGA TTGCGTGATT
AACAGGAAGT TATCATGGCT ACAGTACCAA CAAGGTCAGG TAGTCCCCGC CAGTTAA
 
Protein sequence
MVPSTFSRLK AARCLPVVLA ALIFAGCGTH TLDQSTAYMQ GTAQADSAFY LQQMQQSSDD 
TRINWQLLAI RALVKEGKTR QAVELFNQLP QELNDSQRRE KTLLAVEIKL AQKDFAGAQN
LLAKITPADL EQNQQARYWQ AKIDASQGRP SIDLLRALIA QEPLLGAKEK KQNIDATWQA
LSSMTQEQAN TLVINADENI LQGWLDLQRV WFDNRNDPDM MKAGIADWQK RYPNNPGAKM
LPTQLVNVKA FKPASTNKIA LLLPLNGQAA VFGRTIQQGF EAAKNIGTQP VAAQVAAAPA
ADVAEQPQPQ TADGVASPAQ ASVSDLTGDQ PAAQPVPVSA PATSTAAVSA PANPSAELKI
YDTSSQPLSQ ILSQVQQDGA SIVVGPLLKN NVEELLKSNT PLNVLALNQP ENIENRVNIC
YFALSPEDEA RDAARHIRDQ GKQAPLVLIP RSSLGDRVAN AFAQEWQKLG GGTVLQQKFG
STSELRAGVN GGSGIALTGT PITPRATTDS GMTTNNPTLQ TTPTDDQFTN NGGRVDAVYI
VATPGEIAFI KPMIAMRNGS QSGAMLYASS RSAQGTAGPD FRLEMEGLQY SEIPMLAGGN
LPLMQQALSA VNNDYSLARM YAMGVDAWSL ANHFSQMRQV QGFEINGNTG SLTANPDCVI
NRKLSWLQYQ QGQVVPAS