Gene SbBS512_E4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4070 
Symbol 
ID6268955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3800493 
End bp3801812 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content52% 
IMG OID641727910 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_001882342 
Protein GI187731467 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTTG GGTACATTGC TTTGGGTACA CCCCCTATTC AAAGTTTGGA TATTCGCAAA 
ATGCCAAAAC TAACAGACAT GCAGATCCGC GCATGGATTA AGAGCGGAGA GCGATTCGAG
GGGCGGGCAG ACGGTAACGG TTTATACCTA CGTTACCGTG AAGCCGACAA AACCCCCACA
TGGAGATTCC GCTATAAACT CGCAGGGAAG TCCCGCGCCA TGCTAATTGG TTCGTATAGC
GAGCTATCAC TATCAAAGGC CAGAGAGACA GCCAAAGAGC TATCGGCTCG CGTTGCGCTG
GGCTATGACG TTGCAGGAGA GAAGCAGAAG CACAAAACCG AAGCACTGGC GAAGATGGAA
GCAGAGAAGA ACGCTATGCG CGTTTCAGAG CTTGCCGCTG AATACTTTGA GCGTCAGATC
CTCCCGCGCT GGAAGCACCC CGATATACTC CGCCGCCGTA TCGACAAAGA TATAAACCCC
TGCATTGGCA GCATGAAGGT AGAGGACGTG AAACCGCGCC ATATCGATGA CATGCTGAAA
GGTATTGTTG ACCGTGGAGC GCCGACCATA GCAACGGACG TGCTGAGATG GACGCGCCGC
ATATTCGACT ACGGAATCAA ACGGCACGCG CTAGAGATTA ACCCCTGTTC AGCCTTTGAG
GTGGCAGACG CCGGAGGGAA AGAAGCTGCC CGTGACCGCT GGTTAACCCG CGATGAGTTA
ATCCAGCTAT TCAAAGCCAT GCGCACGGCT AAGGGATTCA GTCGCCAGAA CGAAATCACG
TTCAAATTAC TGTTAGCGTT ATGCGTCCGC AAAATGGAAT TATGCGCCGC ACGATGGGAA
GAGTTTGATT TAGATGGTGC GGTATGGCAT TTGCCGGAAG AACGCAGCAA AAACGGAGAC
CCTATTGATA TACCTCTACC TTCCCCAGCC GTTGAATGGT TGAGAGAGCT ACACACCTTT
TCATGTAATA GCGCATGGGT GCTTCCGGCC AGGAAGATGC AAAACAGAAT GATCCCACAT
ATTCAGGAAA GCACTTTACC CGTAGCACTG GCTAAGGTTC GCGCCGAAAT GCCGGATGTG
CCTAATTTCA CGATTCACGA CTTTCGACGC ACCGCACGTA CTCATTTAGC AGCGTTGGGT
GTTGATCCTG TTGTGGCGGA ACGATGCCTC AATCATCGCA TTAAGGGCGT AGAGGGGATT
TATAACCGCC ATCAGTATTT TGATGAGCGT AAAGCAGCAC TGGCACAGTG GGCTGATCTG
CTAGTGGCAC TGGAAAGCGG AAAAGACTAC AACGTAACGC CTCTCAGAAG GGCGAACTAA
 
Protein sequence
MRFGYIALGT PPIQSLDIRK MPKLTDMQIR AWIKSGERFE GRADGNGLYL RYREADKTPT 
WRFRYKLAGK SRAMLIGSYS ELSLSKARET AKELSARVAL GYDVAGEKQK HKTEALAKME
AEKNAMRVSE LAAEYFERQI LPRWKHPDIL RRRIDKDINP CIGSMKVEDV KPRHIDDMLK
GIVDRGAPTI ATDVLRWTRR IFDYGIKRHA LEINPCSAFE VADAGGKEAA RDRWLTRDEL
IQLFKAMRTA KGFSRQNEIT FKLLLALCVR KMELCAARWE EFDLDGAVWH LPEERSKNGD
PIDIPLPSPA VEWLRELHTF SCNSAWVLPA RKMQNRMIPH IQESTLPVAL AKVRAEMPDV
PNFTIHDFRR TARTHLAALG VDPVVAERCL NHRIKGVEGI YNRHQYFDER KAALAQWADL
LVALESGKDY NVTPLRRAN