Gene SbBS512_A0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_A0078 
Symbol 
ID6273404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010660 
Strand
Start bp48513 
End bp50114 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content52% 
IMG OID641728735 
Producttransposase family 
Protein accessionYP_001883126 
Protein GI187734384 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value7.38697e-26 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATG AACTCCCCGA TGATATTGAG CTGCTTAAAG CCATGTTGCG TAAGCAACAG 
AGTCGGCTTC GACAGTATGC CTGTCAGGTC GCGGGCTATG AGCAGGAAAT TGAACGGCTG
AAAGCGCAAC TCGACAGGTT GCGTCGTATG CTGTTCGGCC AGAGTTCAGA GAAAAAGCGT
CATAAGCTTG AAAATCAGAT CCGACAGGCA GAAAAACGAC TGTCGGAACT GGAAAACCGA
CTGAACACAG CCAGAAATCT TCTGGAAGAT GCATCGTCAG TCACAGATTC ACCTGACACC
AGTCCCCCGT CAGAAAACCC GATCGCCAGT AAGCCTGAAT CCCCGGGACG AAAATCTTCA
CGAAAACCGC TGCCGGCAGA ACTTCCCCGG GAGACACATC GCCTTCTGCC TGCTGAAACC
AGTTGCCCGG CCTGTGGAGG TGTTCTGAAA GAAATGGGGG AAACAATCTC AGAGCAACTG
GATATCATTA ATACCGCCTT TAAAGTTATC GAAACCATAC GTCCCAAACT GGCCTGTAGC
CGGTGTGATG TCATCGTTCA GGCACCACTT CCCCCTAAAC CGATCGAACG CGGTTATGCC
AGTGCAGGGT TACTTGCACG GATCCTGGTC AGCAAATATA TGGAACATAT CCCTTTATAT
CGCCAGTCAG AAATATACGC GCGACAGGGC GTGGAGCTGA GCCGTAATAC CATGGTGCGC
TGGGTATCAG AAATGGCAGA CAAACTCCGT CCTCTGTATA TAGCGCTGAA TGACTATGTT
CTGGAGGCAG GAAAGGTGCA CGCAGATGAC ACTCCGGTGA AAGTACTGGC CCCGGGGAAC
GGAAAGACGA AAACGGGTCG TCTGTGGGTA TACGTCAGGG ATGATCGTAA TGCGGGTTCA
TCCCTGCCGG CAGCCGTCTG GTTCGCGTAT TCGGCAGATC GCAAAGGAGA ACATCCGCAG
CTCCACCTGG CAAAGTATCA GGGCGTACTG CAGGCTGATG CCTATGCAGG TTATAACGTA
CTGTACGAAA CGGGCCGGGT GAAGGAAGCC GGGTGCCTGG CCCACGCCCG CCGAAAAATC
CATGACGAGG ATGTGCGCCG TCCGACAGAA ATGACTCAGG AAGCGCTCAG ACGGATAGCA
GAGTTATACG ACATAGAAGC GGAGATACGT GGCAGTCCGG CAGAGGAACG GCTTGCAGTC
AGAAAAGCCA GAAGCGTCCA GTTGATGCAG TCATTGTACG ACTGGATACA GTTGCAGAGG
AAAACGCTGT CGAAACATGC GGAGATGGCG AAGGCGTTCG ACTATATCCT GAATCACTGG
AATGCGCTGA ACGAGTTCTG TCGTGACGGC TGGGTGGAAA TAGACAACAA CATCGGTGAA
AACGCGTTAC GATCGGTGGC GGTTGGAAGA AAAAATTATC TCTTTTTCGG CTCAGACAAG
GGAGGAGAAA GTGCGGCGAT CATCTACAGT CTGCTGGTCA CCTGCAAACA GAACGAAGTG
GAGCCGGAGG ACTGGTTGCG CGAAGTGATC GAGAAGCTCA ATGACTGGCC GTCGAACCAA
GTGCATGAAC TGCTGCCCTG GAACTTCTCG TCTGTAAAAT AA
 
Protein sequence
MNNELPDDIE LLKAMLRKQQ SRLRQYACQV AGYEQEIERL KAQLDRLRRM LFGQSSEKKR 
HKLENQIRQA EKRLSELENR LNTARNLLED ASSVTDSPDT SPPSENPIAS KPESPGRKSS
RKPLPAELPR ETHRLLPAET SCPACGGVLK EMGETISEQL DIINTAFKVI ETIRPKLACS
RCDVIVQAPL PPKPIERGYA SAGLLARILV SKYMEHIPLY RQSEIYARQG VELSRNTMVR
WVSEMADKLR PLYIALNDYV LEAGKVHADD TPVKVLAPGN GKTKTGRLWV YVRDDRNAGS
SLPAAVWFAY SADRKGEHPQ LHLAKYQGVL QADAYAGYNV LYETGRVKEA GCLAHARRKI
HDEDVRRPTE MTQEALRRIA ELYDIEAEIR GSPAEERLAV RKARSVQLMQ SLYDWIQLQR
KTLSKHAEMA KAFDYILNHW NALNEFCRDG WVEIDNNIGE NALRSVAVGR KNYLFFGSDK
GGESAAIIYS LLVTCKQNEV EPEDWLREVI EKLNDWPSNQ VHELLPWNFS SVK