Gene SbBS512_E0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0724 
SymbolompC 
ID6270232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp683945 
End bp685075 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID641724910 
Productouter membrane porin protein C 
Protein accessionYP_001879438 
Protein GI187733369 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.140015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTA AAGTACTGTC CCTCCTGGTC CCAGCTCTGC TGGTAGCAGG CGCAGCAAAC 
GCTGCTGAAG TTTACAACAA AGACGGCAAC AAATTAGATC TGTACGGTAA AGTAGACGGC
CTGCACTATT TCTCTGACAA CAAGTCAGAA GACGGCGACC AGACCTATGT ACGTCTTGGT
TTCAAAGGCG AAACTCAGGT TACTGACCAG CTGACCGGTT ACGGCCAGTG GGAATATCAG
ATCCAGGGCA ATACCTCTGA AGACAACAAA GAAAACTCCT GGACCCGTGT GGCATTCGCA
GGTCTGAAAT TCCAGGATGT AGGTTCTTTC GACTACGGTC GTAACTACGG CGTTGTTTAC
GACGTAACTT CCTGGACCGA CGTACTGCCA GAATTCGGTG GCGACACCTA CGGTTCTGAC
AACTTCATGC AGCAGCGTGG TAACGGCTTC GCGACCTACC GTAACACCGA CTTCTTCGGT
CTGGTTGACG GTCTGAACTT TGCTGTTCAG TACCAGGGCA AAAACGGTAG CGTAAGCGGC
GAAGGCATGA CCAACAACGG TCGTGGCGCT CTGCGTCAGA ACGGCGACGG TGTCGGCGGT
TCTATCACTT ATGATTACGA AGGCTTCGGT ATCGGTGCTG CAGTTTCCAG CTCCAAACGT
ACTGATGATC AAAATGGTAG CTACATCAGC AATGGTGTAG TTCGTAACTA CATCGGTACT
GGCGACCGTG CTGAAACCTA CACTGGTGGT CTGAAATACG ACGCTAACAA CATCTACTTG
GCTGCTCAGT ACACCCAGAC CTACAACGCA ACTCGCGTAG GTTCCCTGGG TTGGGCGAAC
AAAGCACAGA ACTTCGAAGC TGTTGCTCAG TACCAGTTCG ACTTCGGTCT GCGTCCGTCT
GTAGCATACC TGCAGTCTAA AGGTAAAAAC CTGGGTGTCA TCAATGGTCG TAACTACGAC
GACGAAGATA TCCTGAAATA TGTTGATGTT GGCGCGACCT ACTACTTCAA CAAAAACATG
TCCACCTATG TTGACTACAA AATCAACCTG CTGGACGACA ACCAGTTCAC TCGTGACGCT
GGCATCAACA CTGATAACAT CGTAGCTCTG GGTCTGGTTT ACCAGTTCTA A
 
Protein sequence
MKVKVLSLLV PALLVAGAAN AAEVYNKDGN KLDLYGKVDG LHYFSDNKSE DGDQTYVRLG 
FKGETQVTDQ LTGYGQWEYQ IQGNTSEDNK ENSWTRVAFA GLKFQDVGSF DYGRNYGVVY
DVTSWTDVLP EFGGDTYGSD NFMQQRGNGF ATYRNTDFFG LVDGLNFAVQ YQGKNGSVSG
EGMTNNGRGA LRQNGDGVGG SITYDYEGFG IGAAVSSSKR TDDQNGSYIS NGVVRNYIGT
GDRAETYTGG LKYDANNIYL AAQYTQTYNA TRVGSLGWAN KAQNFEAVAQ YQFDFGLRPS
VAYLQSKGKN LGVINGRNYD DEDILKYVDV GATYYFNKNM STYVDYKINL LDDNQFTRDA
GINTDNIVAL GLVYQF