Gene SbBS512_E2411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2411 
Symbol 
ID6270558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2207428 
End bp2208459 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content50% 
IMG OID641726408 
ProductDNA internalization-related competence protein ComEC/Rec2 homolog 
Protein accessionYP_001880890 
Protein GI187731750 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATTTAA GCGGGCCGTT AATCCTGGAG CAAGGGTTAT GGTTTCTTGC CGACCGGTCT 
TTGGCTTTAC TTTTCTGGGG GTTAAAGAGT TTGCCAGAAG GGTGGATCAA CATTGCTGAA
CGTTGGCAAT GGCTATCATT TTCCCCATGG TTCTTACTGG TGGTATGGCG ATTAAACGTC
TGGCGAACGT TGCCAGCAAT GTGTGTGGCT GTAGGCTTGC TGATGTGCTG GCCGCTGTGG
CAAAAACCTC GACCTGACGA GTGGCAGGTG TACATGCTTG ATGTCGGGCA AGGGCTGGCA
ATGGTGATAG CCAGAAACGG CAAAGCGATT CTCTATGACA CAGGACTGGC CTGGCCCGAA
GGGGATAGTG GGCAACAACT GATTATCCCC TGGCTCCACT GGCATAATCT TGAACCGGAA
GGCGTTATTC TGAGTCATGA ACATCTGGAT CACCGGGGAG GGCTGGACTC AATATTGCAC
ACATGGCCGA TGTTATGGAT CAGAAGTCCG TTAAACTGGG AACATCATCA GCCCTGTGTG
CGTGGCGAAG CGTGGCAATG GCAAGGATTG CGTTTCAGCG TGCACTGGCC TTTACAAGCT
AGCAACGATA AAGGAAATAA CCATTCCTGT GTGGTTAAGG TTTATGACGG GACGAATAGC
ATTCTTCTAA CCGGTGATAT TGAAGTCCCC GCTGAACAAA AGATGCTAAG CCGTTACTGG
CAGCAAGTGC AGACAACATT GCTTCAGGTA CCTCACCATG GCAGTAATAC CTCATCATCG
TTGCCATTAA TTCAGCGAGT GAATGGAAAA GTGGCACTCG CATCGGCATC GCGCTATAAC
GCATGGCGAT TGCCCTCTAA TAAAGTTAAG CATCGCTATC AACAGCAAGG ATATCAATGG
CTTGATACTC CACATCAGGG TCAAGTGACG GTCAATTTTT CAGCGCAAGG CTGGCGGATT
AGCAGCCTCA GAGAGCAAAT TTTACCTCGT TGGTATCATC AGTGGTTTGG CGTGCCAGTG
GATAACGGGT AG
 
Protein sequence
MHLSGPLILE QGLWFLADRS LALLFWGLKS LPEGWINIAE RWQWLSFSPW FLLVVWRLNV 
WRTLPAMCVA VGLLMCWPLW QKPRPDEWQV YMLDVGQGLA MVIARNGKAI LYDTGLAWPE
GDSGQQLIIP WLHWHNLEPE GVILSHEHLD HRGGLDSILH TWPMLWIRSP LNWEHHQPCV
RGEAWQWQGL RFSVHWPLQA SNDKGNNHSC VVKVYDGTNS ILLTGDIEVP AEQKMLSRYW
QQVQTTLLQV PHHGSNTSSS LPLIQRVNGK VALASASRYN AWRLPSNKVK HRYQQQGYQW
LDTPHQGQVT VNFSAQGWRI SSLREQILPR WYHQWFGVPV DNG