Gene SbBS512_E0979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E0979 
Symbol 
ID6271978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp902152 
End bp903414 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content56% 
IMG OID641725129 
Productsite-specific recombinase, phage integrase family protein 
Protein accessionYP_001879655 
Protein GI187730111 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000444866 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTAA ACGATTCTAA AATCCGCAAG TTAAAACCTT CTTCCCGTCC GGTAAAACTC 
TCCGACGCTC ACGGTCTGTA TCTGCTCGTC AATCCGGGCG GTTCACGCAT CTGGTATCTC
AAATATCGTT TCAACGGAAA AGAATCCAGA GTCAGTCTTG GCGCATACCC GCTGGTCTCA
CTGGCAGAAG CCAGGCAACA GCGCGACGGT ATCCGCAAGC TACTGGCGCA GAATATCAAC
CCGGCGCAAC AGCGCATGGC AGAGAAATCC GCCTGCTCCC CTGAAAAGTG TTTTAAGGCG
GTGGCGCTGG CCTGGCACAA AACCAACAAA AAATGGTCGG CTGATTATGC CGCCCGTATT
CTCGCCAGTA TGGAAAACCA TATCTTCCCG GCGGTTGGTC ACCTGCCCGT TGCTGCGCTT
AAAACGCAGG ATTTCACGGC TTTGTTGCGG GTTATCGAGA ATAAAGGCTT TCTGGAAGTC
GCGTCCCGAA CCCGGCAGCA ACTCAGCAAC ATCATGCGCT ATGCCGTTCA GCAGGGACTT
ACCGACAGTA ATCCGGCGCA GCATCTGGAA GGTGTAACTG CCTCCCCCGT CAAGAATCAC
TATCCCGCTT TACCGCTGGA GCGATTGCCT GAACTGCTTG ACCGCATTGG CGACTACCGG
CAGGGCCGGG AGTTAACCCG GCTGGCGGTG GTGCTGACGT TGCACCTGTT CATCCGTTCC
AGCGAACTGC GTTTCGCCCG CTGGAGTGAG ATTGATTTCA GGCACAAAAT CTGGACCATC
CCCGCAACCC GCGAGGCCAT TGATAAAGTA CGGTTTTCGG GGCGTGGCGC AAAAATACGC
ACCCCGCATA TCGTACCGCT CTCCTGCCAG GCGATTGCCA TTCTGAAACA GATACAGGAG
CTTTCCGGCC ATCTGGATCT GGTGTTTCCC GGCGACCATA ATCCGTACAA GCCAATGAGC
GAAAACACCA CCAACCGGGC GCTGCGTCTG ATGGGGTATG ATACGAAAAC TGAAATCTGC
GGGCATGGAC TCAGGGCAAT GGCCTGTAGC GCCCTGGTGG AGTCGGATCT GTGGTCACGC
GATACAGTGG AGCGGCAAAT GAGCCACCAG GAGCGCAACA GCGTGCGTGC GGCATATGTG
CATAAGGCGG AGCATCTGGA GGCCCGAAAG GCCATGATGC AGTGGTGGTC GGATTATCTG
GATGTGTGCC GCGAGGGGTA TGTCGCGCCG TACATTTATG CGCGGCAGCA TGAGGCAGCC
TGA
 
Protein sequence
MFLNDSKIRK LKPSSRPVKL SDAHGLYLLV NPGGSRIWYL KYRFNGKESR VSLGAYPLVS 
LAEARQQRDG IRKLLAQNIN PAQQRMAEKS ACSPEKCFKA VALAWHKTNK KWSADYAARI
LASMENHIFP AVGHLPVAAL KTQDFTALLR VIENKGFLEV ASRTRQQLSN IMRYAVQQGL
TDSNPAQHLE GVTASPVKNH YPALPLERLP ELLDRIGDYR QGRELTRLAV VLTLHLFIRS
SELRFARWSE IDFRHKIWTI PATREAIDKV RFSGRGAKIR TPHIVPLSCQ AIAILKQIQE
LSGHLDLVFP GDHNPYKPMS ENTTNRALRL MGYDTKTEIC GHGLRAMACS ALVESDLWSR
DTVERQMSHQ ERNSVRAAYV HKAEHLEARK AMMQWWSDYL DVCREGYVAP YIYARQHEAA