Gene SbBS512_E3302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3302 
Symbol 
ID6269856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3076056 
End bp3077663 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content51% 
IMG OID641727202 
Productputative integrase 
Protein accessionYP_001881655 
Protein GI187733613 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAACT CAAGAACCTA CCTTTATCAG CGTAACGGTG TCTTTTACAT CCGTCTTAGA 
ATGAAGAGCA CTGGTCGCCT GACCGCTTCG CTTCCCTCTC ATAACCGTTA TAAACTAGCG
TCAGTATCAC TACGGACTAA GGACAGACGC ACCGCTATGG CTCACTCACG GCACATCAAG
TCAGCACTTA AAGCAATCCA CGCAGATAAC CCTAACGCCT CTTATGAGGA GCTACGGGAG
CACCTGAAGA CCATTGTCGA GTGGGAGCTT AGTGTCAGCC GTGATGACCT GAACGACCCA
GAGTCCTACC AGCTTTACGT TGACCAGTAC GATGACATTA AGTCCAACCT TCGGGAAGCT
GTAGCGACTG AGCGTCTGAC TGTAGACCAG CATCGCTATA TCAATGACGT TATCGGTGTG
CTTAAGGCCT GTCAGGACAG ACTCAGGGGT GATAGCTCTG GCCTGTTGTC TTACCTTGAG
CCTGAGACCA GTAGTCTAAG ACCGTCCGTC TCCCTATCTG TATTGGCAGA TCCTGAAGTC
CCCGAACCTA AAGCTCTTAC CCTCGCGTCT CTCATTGAGC AGTACGAGCA GGAGAACGCC
CAGAACTGGA AGCCAGCGAC CCTAAGCGAG AACCGAGCGT CGCACTCCAC GCTAATCGAG
ATATTCGACT ATCTGGATAT TCAGGATGTC GGCAAGGCGA CCCGTTCAGA TATGCTCAGG
GTCCGTGAAG TCCTCCAGCA GATACCGAAG AACCGCAAGC AACGCTTTAA GTCTATGCCG
CTGTCTGACC TGCTGAACCG GGAGTCTAAG ACTGACTGTT TGGATGTCGT TACCATCAAC
AATAAATATC TTATCAAGAT GGCTGCTGTG TTTAAGTGGG CGGTACGCAA CGACTTGATA
GCTAAGAACC TGACCGAAGG TCTTGAGCTG AAAGTCCCGC AGCGTAAAGC CTCCGACGCT
CGTGATGCGT TCAGTCCTGA GCAGGTAGGG CAACTACTGG TCGCCGCTAA AGCGTACTCT
CAGAAGACCG CAGGTAAGCC ATACCATTAT TACGTCACCG CTCTGGCTGC GATCACTGGA
GCAAGACTTA ACGAGGTGGC CCAACTTCAG GTCAAAGACG TCAGGGTCAC TGAGGCCGGG
ACCGTGTATA TCCACATTAA CGAAGACGAT AGCAGCCTGC CGGGTAAGAG CGTTAAGAAT
GCGCATAGCG ACCGCTGTGT CCCGCTGGTC GATGGCGCTT ATGGTTTTGT ATTGGCTGAC
TTTGTGTCGC TTGTGGAAGA CCGTAGAAAG GCTGAGGGTG ATAATGCTAT GGTATTCGAT
GGCCTCAAGC TGATGAAGAA CGGCTACGGT GAGCAGGTGA GCAAATGGTT TAACCGAACG
TTGCTACCTA AAGTCCTTGC TGACCGTAGC GGCTTAGCGT TTCACTCGTT TAGACATACT
GTTGCGACCC AGTTAAAACA ACATGGGGTC GAGTTAGCAT ATGCTCAGGC GATTATGGGC
CATTCGAGCG GGTCCATAAC TTACGACCGT TACGCGAAAG AGGTTGAGGT AGATAGATTG
GTTAATGTGA TGGCTGGTGT TTATAAGGAG ACTGGAGTAA ATGGCTGA
 
Protein sequence
MLNSRTYLYQ RNGVFYIRLR MKSTGRLTAS LPSHNRYKLA SVSLRTKDRR TAMAHSRHIK 
SALKAIHADN PNASYEELRE HLKTIVEWEL SVSRDDLNDP ESYQLYVDQY DDIKSNLREA
VATERLTVDQ HRYINDVIGV LKACQDRLRG DSSGLLSYLE PETSSLRPSV SLSVLADPEV
PEPKALTLAS LIEQYEQENA QNWKPATLSE NRASHSTLIE IFDYLDIQDV GKATRSDMLR
VREVLQQIPK NRKQRFKSMP LSDLLNRESK TDCLDVVTIN NKYLIKMAAV FKWAVRNDLI
AKNLTEGLEL KVPQRKASDA RDAFSPEQVG QLLVAAKAYS QKTAGKPYHY YVTALAAITG
ARLNEVAQLQ VKDVRVTEAG TVYIHINEDD SSLPGKSVKN AHSDRCVPLV DGAYGFVLAD
FVSLVEDRRK AEGDNAMVFD GLKLMKNGYG EQVSKWFNRT LLPKVLADRS GLAFHSFRHT
VATQLKQHGV ELAYAQAIMG HSSGSITYDR YAKEVEVDRL VNVMAGVYKE TGVNG