Gene SbBS512_E0767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0767 
Symbol 
ID6270701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp719456 
End bp720697 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content52% 
IMG OID641724949 
Productprophage integrase 
Protein accessionYP_001879477 
Protein GI187731694 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.496617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAA AGGCAACCCC ACTCACCACC ACACAGATCA AAGCAGCTAA ACCAGCAGAA 
AAAGAATACA CCTTACAGGA TGGCGGCGGG CTTTTTCTCG TTATTAAGCC GTCTGGTTCG
AAACTATGGC GATTCAACTA CTATCGACCT TCGAACAAAA AACGAACACT CATTAGTTTG
GGATCGCTTG ATGAAGTCTC CCTTGCTGAT GCCAGAAAAC GCCGTAGCGA GTACAGGACG
TTAATTGCGG CAGGAACCGA CCCGCAGGAA TACGAACGGC AAAAACGCGA AGCAGAGGCT
CGACGACAGG GCAACACGTT CGAAAATGTG GCCGCATTGT GGTACGAAAT GAAAAAAAAC
CAGAATCTCG CCCACAATAC GATCAAGGAC ATCTGGCGCT CGCTAGAGAA ATATGTATTC
CCGTATATCG GAAACACACC GATAGATACT CTCACCGCTC GCCGCTTTGT TGAGATACTC
ACGCCCATCA AGGCACGTGG CAACCTGGAA ACACTGAAAC GCGTTTTACA GCGCATCAAT
GAAGTGATGG ATTTTGCTGC CAACAGTGGA ATGATTGACA TCAACACCGC CGCGAACGTT
CGCAAGACAT TCCCTTCTCC CACCAAAAAG CACATGCCAA CCATCCGGCC GGAACAACTA
CCGCAGCTAA TGCACGATTT ATCGATCGCC AGCATAGAAC GGCAAACCAG ATTACTGATT
GAGTGGCAGT TGTTAACCGC AACCCGCCCA GCCGAAGCCT CTGCCGCACG GTGGGAAGAA
ATCAACCTTG AGGCGGCAAC CTGGACGATA CCAGCCGGAC GCATGAAGAT GCGCCGTGAT
CATGTGATCC CCCTTTGCGC TCAGGCGATG GCAGTGCTTG AAGCCATGAA GCCGATTAGC
GCACGAAGGG AGCATGTTTT CCCAAGTCTT AAAAATCCAG TGCAACCAAT GAGCAGCCAG
ACAGCAAACG CAGCATTACG GCGAATGGGT TACACTGGTG TGCTGGTATC TCACGGACTA
CGCGCCATAT TCAGCACAGC GGCGAACGAA GAAGGATTTG AGCCGGACGT AATCGAGGCC
GCACTCGCAC ACGTGGACAC GAACGAAGTT AGACGGGCAT ACAACCGAAG CAACTACCTG
GAAAAACGTA AAGTGTTAAT GTGCTGGTGG GGTGATTTTG TGGAAGCAGC CGCAACCGGA
ACAACCATCG CCAGCGGGCA CAGAGGATTA CGCGGAAGGT AA
 
Protein sequence
MARKATPLTT TQIKAAKPAE KEYTLQDGGG LFLVIKPSGS KLWRFNYYRP SNKKRTLISL 
GSLDEVSLAD ARKRRSEYRT LIAAGTDPQE YERQKREAEA RRQGNTFENV AALWYEMKKN
QNLAHNTIKD IWRSLEKYVF PYIGNTPIDT LTARRFVEIL TPIKARGNLE TLKRVLQRIN
EVMDFAANSG MIDINTAANV RKTFPSPTKK HMPTIRPEQL PQLMHDLSIA SIERQTRLLI
EWQLLTATRP AEASAARWEE INLEAATWTI PAGRMKMRRD HVIPLCAQAM AVLEAMKPIS
ARREHVFPSL KNPVQPMSSQ TANAALRRMG YTGVLVSHGL RAIFSTAANE EGFEPDVIEA
ALAHVDTNEV RRAYNRSNYL EKRKVLMCWW GDFVEAAATG TTIASGHRGL RGR